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NOVEL COMPOUNDS 

Field of the Invention : 

This invention relates to newly identified polynucelotides, particularly 
Staphylococcal polynucleotides, polypeptides encoded by such polynucleotides, the uses of 
5 such polynucleotides and polypeptides, as well as the production of such polynucleotides 
and polypeptides and recombinant host cells transformed with the polynucleotides. This 
invention also relates to activating or inhibiting the biosynthesis or action of such 
polynucleotides or polypeptides and to the use of such activators or inhibitors in therapy. 
Background of the Invention; 

10 The Staphylococci make up a medically important genera of microbes. They are 

known to produce two types of disease, invasive and toxigenic. Invasive infections are 
characterized generally by abscess formation effecting both skin surfaces and deep tissues. 
Staphylococcus aureus (S. aureus) is the second leading cause of bacteremia in cancer 
patients. Osteomyelitis, septic arthritis, septic thrombophlebitis and acute bacterial 

IS endocarditis are also relatively common. There are at least three clinical conditions 
resulting from the toxigenic properties of Staphylococci. The manifestation of these 
diseases result from the actions of exotoxins as opposed to tissue invasion and bacteremia. 
These conditions include: Staphylococcal food poisoning, scalded skin syndrome and toxic 
shock syndrome. 

20 While certain Staphylococcal proteins associated with pathogenicity have been 

identified, e.g., coagulase, hemolysins, leucocidins and exo and enterotoxins, very little is 
known concerning the temporal expression of such genes during infection and disease 
progression in a mammalian host Discovering the sets of genes the bacterium is likely to be 
expressing at the different stages of infection, particularly when an infection is established, 

25 provides critical information for the screening and characterization of novel antibacterials 
which can interrupt pathogenesis. In addition to providing a fuller understanding of known 
proteins, such an approach will identify previously unrecognized targets. 
Brief Description of the Invention: 

This invention provides novel protein from Staphylococcus aureus WCUH29, 

30 characterized in that it comprises the amino acid sequence given herein or a fragment, 
analogue or derivative thereof. 

Also provided by the invention is an isolated polypeptide comprising an acid 
sequence selected from the group consisting of SEQ ID NO:l through SEQ ID NO:623, or 
any combination of these SEQ ID NOS (SEQ ID NOS:l through 623). 
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Further provided by the invention is an isolated nucleic acid (herein also 
"polynucleotide* 1 ) encoding one of the amino acid sequences of SEQ ID NO: I through SEQ 
ID NO:623, or an isolated polynucleotide sequence selected from the group consisting of 
SEQ ID NO:624 through 1 165, or any combination of these SEQ ID NOS (SEQ ID 
5 NOS:624 through 1 165), or any polynucleotide sequences capable of hybridizing therewith 
under stringent conditions. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 

In particular the invention provides polynucleotides having the DNA sequences 
10 given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act as PCR primers in the process herein described to determine 
whether or not the gene identified herein, particularly a Staphylococcus aureus gene, in 
whole or in part is expressed in infected tissue. It is recognized that such sequences will 

15 also have utility in diagnosis of the stage of infection and type of infection the pathogen has 
attained. Hie proteins so identified are also useful as targets in screens designed to identify 
antimicrobial compounds. 
Detailed Description of the Invention: 

Each of the polynucleotide sequences provided herein, particularly the DNA 

20 sequences, may be used in the discovery and development of antibacterial compounds. 

Because each of the sequences contains an open reading frame (ORF) or putative gene with 
an appropriate initiation and termination codons, the encoded protein upon expression can 
be used as a target for the screening of antimicrobial compounds, such as drugs. 
Additionally, the DNA sequences encoding the amino terminal regions of the encoded 

25 protein can be used to construct antisense or ribozyme sequences to control the expression 
of the coding sequence of interest Furthermore, many of the sequences disclosed herein 
also provide regions upstream and downstream from the encoding sequence. These 
sequences are useful as a source of regulatory elements for the control of bacterial gene 
expression. Such sequences are conveniently isolated by restriction enzyme action or 

30 synthesized chemically and introduced, for example, into promoter identification strains. 
These strains contain a reporter structural gene sequence located downstream from a 
restriction site such that if an active promoter is inserted, the reporter gene will be 
expressed. 
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Although each of Che sequences may be employed as described above, this 
invention also provides several means' for identifying particularly useful target genes. The 
first of these approaches entails searching appropriate databases for sequence matches in 
related organisms. Thus, if a homologue exists, the Staphylococcal-like form of this gene 
S would likely play an analogous role. For example, a Staphylococcal protein identified as 
homologous to a cell surface protein in another organism would be useful as a vaccine 
candidate. To the extent such homologies have been identified for the sequences disclosed 
herein they are reported along with the coding sequence. 

Recently techniques have become available to evaluate temporal gene expression in 
10 bacteria, particularly as it applies to viability under laboratory and infection conditions. A 
number of methods can be used to identify genes which are essential to survival per se % or 
essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and 
permits the selection of such an ORF for further development as a screening target. Briefly, 
IS these approaches include: 

1) Signature Tagged Mutagenesis (STM) 

This technique is described by Hensel & al. Science 269: 400-403(1995), the 
contents of which is incorporated by reference for background purposes. 
Signature tagged mutagenesis identifies genes necessary for the establishment/maintenance 

20 of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various 
means (e.g., transposons) such that unique DNA sequence tags are inserted in close 
proximity to the site of mutation. The tags from a mixed population of bacterial mutants 
and bacteria recovered from an infected hosts are detected by amplification, radiolabeling 

25 and hybridization analysis. Mutants attenuated in virulence are revealed by absence of the 
tag from the pool of bacteria recovered from infected hosts. 

2) In Vivo Expression Technology (TVET) 

This technique is described by Camilli ct aL Proc. Natl. Acad . Sci . USA . 9 1 :2634- 
2638 (1994), the contents of which is incorporated by reference for background purposes. 
30 IVET identifies genes up-regulated during infection when compared to laboratory 

cultivation, implying an important role in infection. ORF identified by this technique are 
implied to have a significant role in infection establishment/maintenance. 

In this technique random chromosomal fragments of target organism are cloned 
upstream of a promoter-less recombinase gene in a plasmid vector. This construct is 
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introduced into the target organism which carries an antibiotic resistance gene flanked by 
resolvase sites. Growth in the presence f the antibiotic removes from the population those 
fragments cloned into the plasmid vector capable of supporting transcription of the 
recombinase gene and therefore have caused loss of antibiotic resistance. The resistant pool 
5 is introduced into a host and at various times after infection bacteria may be recovered and 
assessed for the presence of antibiotic resistance. The chromosomal fragment carried by 
each antibiotic sensitive bacterium should cany a promoter or portion of a gene normally 
upregulated during infection. Sequencing upstream of the recombinase gene allows 
identification of the up regulated gene. 

10 3) Differential display 

This technique is described by Chuang £i al, J. Bacteriol . 175:2026-2036 (1993), 
the contents of which is incorporated by reference for background purposes. This method 
identifies those genes which are expressed in an organism by identifying mRNA present 
using randomly-primed RT-PCR. By comparing pre-infection and post infection profiles, 

1 5 genes up and down regulated during infection can be identified and the RT-PCR product 
sequenced and matched to ORF "unknowns'. 

4) Generation of conditional lethal mutants by transposon mutagenesis. 

This technique, described by de Lorenzo, V. fit al, Gene 123: 17-24 (1993); 
Neuwald, A. F. fit aL GfiDS 125: 69-73(1993); and Takiff, H. E. fit aL, J. Bacteriol . 

20 174: 1544-1553(1992), the contents of which is incorporated by reference for 

background purposes, identifies genes whose expression arc essential for cell viability. 

In this technique transposons carrying controllable promoters, which provide 
transcription outward from the transposon in one or both directions, arc generated. Random 
insertion of these transposons into target organisms and subsequent isolation of insertion 

25 mutants in the presence of inducer of promoter activity ensures that insertions which 
separate promoter from coding region of a gene whose expression is essential for cell 
viability will be recovered. Subsequent replica plating in the absence of inducer identifies 
such insertions, since they fail to survive. Sequencing of the flanking regions of the 
transposon allows identification of site of insertion and identification of the gene disrupted. 

30 Close monitoring of the changes in cellular processes/morphology during growth in the 
absence of inducer yields information on likely function of the gene. Such monitoring 
could include flow cytometry (cell division, lysis, redox potential, DNA replication), 
incorporation of radiochemical^ labeled precursors into DNA, RNA, protein, lipid. 
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peptidoglycan, monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

5) Generati nofconditi nal lethal mutants by chemical mutagenesis. 

This technique is described by Beckwith, J.. Methods in Enzvmologv 204: 
5 3-18(1991), the contents of which arc incorporated herein by reference for background 
purposes. In this technique random chemical mutagenesis of target organism, growth at 
temperature other than physiological temperature (permissive temperature) and subsequent 
replica plating and growth at different temperature (e.g. 42°C to identify ts, 25°C to identify 
cs) are used to identify those isolates which now fail to grow (conditional mutants). As 

10 above close monitoring of the changes upon growth at the non-permissive temperature 
yields information on the function of the mutated gene. Complementation of conditional 
lethal mutation by library from target organism and sequencing of complementing gene 
allows matching with unknown ORF. 

Each of these techniques may have advantages or disadvantage depending on the 

IS particular application. The skilled artisan would choose the approach that is the most 

relevant with the particular end use in mind. For example, some genes might be recognized 
as essential for infection but in reality are only necessary for the initiation of infection and 
so their products would represent relatively unattractive targets for antibacterials developed 
to cure established and chronic infections. 

20 In yet another variation, a suitably labelled oligonucleotide probe which anneals 

specifically to the bacterial ribosomal RNA in Northern blots of bacterial RNA preparations 
from infected tissue is employed. Using the more abundant ribosomal RNA as a 
hybridization target greatly facilitates the optimization of a protocol to purify bacterial RNA 
of a suitable size for RT-PCR from infected tissue. 

25 Use of the of these technologies when applied to the ORFs of the present invention 

enables identification of bacterial proteins expressed during infection, inhibitors of which 
would have utility in anti-bacterial therapy. 

S. aureus WCUH 29 has been deposited at the National Collection of Industrial and 
Marine Bacteria Ltd. (NCIMB), Aberdeen, Scotland under number NCIMB 4077 1 on 11 

30 September 1995. 

The nucleotide sequences disclosed herein can be obtained by synthetic chemical 
techniques known in the art or can be obtained from S. aureus WCUH 29 by probing a 
DNA preparation with probes constructed from the particular sequences disclosed herein. 
Alternatively, oligonucleotides derived from a disclosed sequence can act as PCR primers 
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in a process f PCR-based cloning of the sequence from a bacterial genomic source. It is. 
recognized that such sequences will also have utility in diagn sis of the stage of infection 
and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given 
5 herein typically a library of clones of chromosomal DNA of S. aureus WCUH29 in E. coli 
or some other suitable host is probed with a radiolabeled oligonucleotide, preferably a 
17mer or longer, derived from the partial sequence. Clones carrying DNA identical to that 
of the probe can then be distinguished using high stringency washes. By sequencing the 
individual clones thus identified with sequencing primers designed from the original 

10 sequence it is then possible to extend the sequence in both directions to determine the full 
gene sequence. Conveniently such sequencing is performed using denatured double 
stranded DNA prepared from a plasmid clone. Suitable techniques are described by 
Maniatis, T., Fritsch, E.R and Sambrook, J. in MOLECULAR CLONING, A Laboratory 
Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory (see: Screening By 

15 Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 13.70). 

A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 

20 may be identical to the coding sequence shown or may be a different coding sequence 
which coding sequence, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 

25 characterized by the de du ced amino acid sequence given herein. The variant of the 

polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. In addition to the standard A, G, C, T/U 
representations for nucleic acid bases, the term "N" is also used in certain polynucelotides 
of the invention. * r N" means that any of the four DNA or RNA bases may appear at such a 

30 designated position in the DNA or RNA sequence, except that in preferred embodiments N 
cannot be a base that when taken in combination with adjacent nucleotide positions, when 
read in the correct reading frame, would have the effect of generating a premature 
termination codon in such reading frame. 
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Thus, the present invention includes polynucleotides encoding the same 
polypeptide characterized by the deduced amino acid sequence given herein as well as 
variants of such polynucleotides which variants encode for a fragment, derivative or 
analogue of the polypeptide. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
allelic variant of the coding sequence characterized by the DNA sequence disclosed herein. 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, may include only 
the coding sequence for the mature polypeptide or the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory sequence or a 
proprotein sequence. 

Thus,, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
controlling transport of a polypeptide from the cell. The polypeptide having a leader 
sequence is a preprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5' amino acid residues. A mature protein having 
a prosequence is a proprotein and is an inactive form of the protein. Once the prosequence 
is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may encode for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a presequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH 2 -terminus. It is appreciated, however, 
that during post-translational modification of the peptide, this residue may be deleted. 
Accordingly, this invention contemplates the use of both the methionine-containing and the 
methionineless amino terminal variants of each protein disclosed herein. 
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The polynucleotides of the present invention may also hav the coding sequence - 
fused in frame to a marker sequence at either the 5' or 3' terminus of the gene which allows 
for purification of the polypeptide of the present inventi n. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by Quiagen 
5 Inc.) to provide for purification of the polypeptide fused to the marker in the case of a 
bacterial host. 

The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 50% or 60% and preferably at least 
70%, 80% or 90% identity between the sequences. The present invention particularly 

10 relates to Staphylococcal polynucleotides which hybridize under stringent conditions to the 
hereinabove-described polynucleotides . As herein used, the term "stringent conditions" 
means hybridization will occur only if there is at least 95% and preferably at least 97% 
identity between the sequences. The polynucleotides which hybridize to the hereinabove 
described polynucleotides in a preferred embodiment encode polypeptides which retain 

15 substantially the same biological function or activity as the polypeptide characterized by the 
deduced amino acid sequence given herein. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 
Patent Procedure. These deposits are provided merely as convenience to those of skill in 

20 the art and are not an admission that a deposit is required under 35 U.S.C. §112. The 

sequence of the polynucleotides contained in the deposited material* as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with any description of sequences herein. A 
license may be required to make, use or sell the deposited material, and no such license is 

25 hereby granted. 

The terms "fragment," "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence herein, means a polypeptide 
which retains essentially the same biological function or activity as such polypeptide. 
Thus, an analogue includes a proprotein which can be activated by cleavage of the 

30 proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide. 

The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence herein may be (i) one in which one or more of the amino acid 
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residues are substituted with a conserved r non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may n t be 
one encoded by the genetic code* or (ii) ne in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is fused with 
5 another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogues are deemed to be within the scope of those skilled in the art from the teachings 
10 herein. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

The term "isolated" means that the material is removed from its original 
environment (e.g., the natural environment if it is naturally occurring). For example, a 

IS naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 
and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not part of its natural environment. 

20 The present invention also relates to vectors which include polynucleotides of the 

present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 

In accordance with yet a further aspect of the present invention, there is therefore 
provided a process for producing die polypeptide of the invention by recombinant 

25 techniques by expressing a polynucleotide encoding said polypeptide in a host and 

recovering the expressed product Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

Host cells are genetically engineered (transduced or transformed or transfected) 
with the vectors of this invention which may be, for example, a cloning vector or an 

30 expression vector. The vector may be, for example, in the form of a plasmid, a cosmid, a 
phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
genes. The culture conditions, such as temperature, pH and the like, are those previously 
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used with the host cell selected for xpression, and will be apparent to the ordinarily skilled 
artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
DNA sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
5 derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. 

10 Hie DNA sequence in the expression vector is operatively linked to an appropriate 

expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coli. 
lac or trp % the phage lambda Pl promoter and other promoters known to control expression 
of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector also 

15 contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 

20 tetracycline or ampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and, optionally, an operator (collectively referred to herein as "control" 
elements), so that the DNA sequ en ce encoding the desired protein is transcribed into RNA 
in the host cell transformed by a vector containing this expression construction. The coding 

25 sequence may or may not contain a signal peptide or leader sequence. The polypeptides of 
the present invention can be expressed using, for example, the E. coli tac promoter or the 
protein A gene (spa) promoter and signal sequence. Leader sequences can be removed by 
the bacteria] host in post-translational processing. See, e.g., U.S. Patent Nos. 4,431,739; 
4,425,437; 4,338397. Promoter regions can be selected from any desired gene using CAT 

30 (chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors arc PKK232-8 and PCM7. Particular named bacterial promoters 
include lacl, lacZ, T3, T7, gpt, lambda Pr, Pl and trp. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
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mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
5 of the host cell. Regulatory sequences are known to those of skill in the art, and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
sequences. 

10 An expression vector is constructed so that the particular coding sequence is located 

in the vector with the appropriate regulatory sequences, the positioning and orientation of 
the coding sequence with respect to the control sequences being such that the coding 
sequence is transcribed under the "control" of the control sequences (i.e., RNA polymerase 
which binds to the DNA molecule at the control sequences transcribes the coding 

IS sequence). Modification of the coding sequences may be desirable to achieve this end. For 
example, in some cases it may be necessary to modify the sequence so that it may be 
attached to the control sequences with die appropriate orientation; i.e., to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insertion into a vector, such as the cloning vectors described 

20 above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of K coli and 5. cerevisiae TRP1 gene, and a promoter derived from a highly- 

25 expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 

30 identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

The vector containing die appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
5 embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene), 
pQE70, pQE60, pQE-9 (Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, 
10 pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, 

pDR540, pRIT5 (Pharmacia). Eukaryotic: pBlueBacm (Invitrogen), pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, 
any other plasmid or vector may be used as long as they are replicable and viable in the 
host. 

15 Examples of recombinant DNA vectors for cloning and host cells which they can 

transform include the bacteriophage 1 (E. coli), pBR322 (£. coli), pACYC177 (E. coli), 
pKT230 (gram-negative bacteria), pGVl 106 (gram-negative bacteria), pLAFRl (gram- 
negative bacteria), pME290 (non-E. coli gram-negative bacteria), pHV14 (£. coli and 
Bacillus subtitis\ pBD9 (Bacillus). pU61 (Streptomyces), pUC6 (Streptomyces\ YIp5 

20 (Saccharomyces), a baculovirus insect cell system, YCpl9 (Saccharomyces). See, 

generally, "DNA Cloning": Vols. I & II, Glover et al. ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et al. ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the 
polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

25 Polypeptides can be expressed in host cells under the control of appropriate 

promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook, et al.. Molecular Cloning: A Laboratory Manual, Second Edition, Cold 

30 Spring Harbor, N.Y., (1989), the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use 
5 of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
10 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product Polypeptide expressed in bacterial hosts such as £. coli may require isolation from 
IS inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression, it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
20 methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
25 (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 

A "replicon" is any genetic element (e.g., plasmtd, chromosome, virus) that 
30 functions as an autonomous unit of DNA replication in vivo; i.e„ capable of replication 
under its own control. 

A "vector" is a replicon, such as a plasmid, phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
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A "double-stranded DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
5 term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., 

restriction fragments), viruses, plasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5' to 3' direction along the 
nontranscribed strand of DNA (i.e., the strand having the sequence homologous to the 
10 mRNA). 

A DNA "coding sequence or or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 

IS polymerase in a cell and initiating transcription of a downstream (3' direction) coding 

sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the 3' terminus by a translation start codon (e.g., ATG, GTG) of a coding sequence and 
extends upstream (5' direction) to include the minimum number of bases or elements 
necessary to initiate transcription at levels detectable above background. Within the 

20 promoter sequence will be found a transcription initiation site (conveniently defined by 
mapping with nuclease SI), as well as protein binding domains (consensus sequences) 
responsible for the binding of RNA polymerase. Eukaryotic promoters will often, but not 
always, contain TATA" boxes and "CAT boxes. Prokaryotic promoters contain Shine- 
Dalgarno sequences in addition to the -10 and -35 consensus sequences. 

25 DNA "control sequences" refers collectively to promoter sequences, ribosome 

binding sites, polyadenylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e., the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 

30 RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA, which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous DNA sequence. 
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A cell has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Ex genous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
5 episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cell containing the exogenous DNA. 
10 A "clone*' is a population of cells derived from a single cell or common ancestor by 

mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
1 5 molecule in nature. 

In accordance with yet a further aspect of the present invention, there is provided 
the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the use 
20 of a polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunization. 

In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. In particular, there are 
provided antibodies against such polypeptides. 
25 Another aspect of the invention is a pharmaceutical composition comprising the 

above polypeptide, polynucleotide or inhibitor of the invention and a pbarrnaceutically 
acceptable carrier. 

In a particular aspect the invention provides the use of an inhibitor of the invention 
as an antibacterial agent 
30 The invention further relates to the manufacture of a medicament for such uses. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
specific antibodies which have anti-bacterial action. 

The polypeptides or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or monoclonal 

15 
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antibodies. The term antibodies also includes chimeric, single chain, and humanized or - 
simianized antibodies, as well as Fab fragments, or the product of an Fab expression library. 
Various procedures known in the ait may be used for the production f such antibodies and 
fragments. 

5 Antibodies generated against the polypeptides of the present invention can be 

obtained by direct injection of the polypeptides into an animal or by administering the 
polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
10 Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
polypeptide. 

Polypeptide derivatives include antigenically or immunologically equivalent 
derivatives which form a particular aspect of this invention. 

The term 'antigenically equivalent derivative' as used herein encompasses a 
IS polypeptide or its equivalent which will be specifically recognized by certain antibodies 
which, when raised to the protein or polypeptide according to the present invention, 
interfere with the interaction between pathogen and mammalian host 

The term 'immunologically equivalent derivative 1 as used herein encompasses a 
peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
20 vertebrate, the antibodies act to interfere with the interaction between pathogen and 
mammalian host 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition, 
polypeptides in which one or more of the amino acid residues are modified may be used. 

25 Such peptides may, for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those dulled in the art of peptide chemistry. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 
or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 

30 as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation , with an immunogenic carrier 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 
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polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may .be 
sufficiently antigenic to improve immunogenicity so as to obviate the use f a carrier. 

For preparation f monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used Examples include the hybridoma 
5 technique (Kohler and Milstein, Nature , 256:495-497( 1 975)), the trioma technique, the 

human B-cell hybridoma technique (Kozbor et a!.. Immunology Today 4:72(1983)), and the 
EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy t Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 
10 4,946,778) can be adapted to produce single chain antibodies to immunogenic polypeptide 
products of this invention. 

Using the procedure of Kohler and Milstein (gypm (1975)), antibody-containing 
cells from the immunized mammal are fused with myeloma cells to create hybridoma cells 
secreting monoclonal antibodies. 
15 The hybridomas arc screened to select a cell line with high binding affinity and 

favorable cross reaction with other staphylococcal species using one or more of the original 
polypeptide and/or the fusion protein. The selected cell line is cultured to obtain the desired 
Mab. 

Hybridoma cell lines secreting the monoclonal antibody are another aspect of this 
20 invention. 

Alternatively phage display technology could be utilized to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes of lymphocytes from humans screened for possessing anti-Fbp or from naive libraries 
(McCafferty, J. et aL. MaQffi 348:552-554(1990), and Marks, J. et ai. 9 Biotechnology 
25 10:779-783(1992)). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. et a/., Nature 352:624-628(1991)). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

As mentioned above, a fragment of the final antibody may be prepared. 
30 The antibody may be either intact antibody of M r approx 1 50,000 or a derivative of 

it, for example a Fab fragment or a Fv fragment as described in Skerra, A and Pluckthun, 
A., Science 240:1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed l bispecific' antibodies. 
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The antibody of the invention may be prepared by conventional means for example 
by established monoclonal antibody technology (Kohler, G. and Milstein, C. (supra. (1975) 
or using recombinant means e.g. combinatorial libraries, for example as described in Huse, 
W.D. et a/., SOSDCC 246:1275-1281 (1989). 
5 Preferably the antibody is prepared by expression of a DNA polymer encoding said 

antibody in an appropriate expression system such as described above for the expression of 
polypeptides of the invention. The choice of vector for the expression system will be 
determined in part by the host, which may be a prokaryotic cell, such as E. coli (preferably 
strain B) or Streptomyces sp. or a eukaryotic cell, such as a mouse CI 27, mouse myeloma, 
10 human HeLa, Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a transgenic animal or a transgenic plant (for example, as described in 
Hiatt, A. et aL, Nature 340:76-78(1989). Suitable vectors include plasmids, bacteriophages, 
cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 
The Fab fragment may also be prepared from its parent monoclonal antibody by 
15 enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 
Preferably the antibody or derivative thereof is modified to make it less 
immunogenic in the patient. For example, if the patient is human the antibody may most 
preferably be Tmmanized'; where the complimentarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for 
20 example as described in Jones, P. et aL, MatSHfi.321 :522-525 (1986), or Tempest et a/., 
Biotechnology 9:266-273 (1991). 

The modification need not be restricted to one of 'humanizadon' ; other primate 
sequences (for example Newman, R. et aL % Biotechnology 10:1455-1460 (1992)) may also 
be used. 

25 The humanized monoclonal antibody, or its fragment having binding activity, form 

a particular aspect of this invention. 
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This invention provides a method of screening compounds, for example, drugs to. 
identify those which activate or preferably interfere with the proteins selected as targets 
herein, which method comprises measuring the activation or interference of the activity of 
the protein by a test compound or drug. For example if the protein selected has a catalytic 
5 activity, after suitable purification and formulation the activity of the enzyme can be 
followed by its ability to convert its natural substrates. By incorporating different 
chemically synthesized test compounds or natural products into such an assay of enzymatic 
activity one is able to detect those additives which compete with the natural substrate or 
otherwise inhibit enzymatic activity. 

10 The invention also relates to inhibitors identified thereby. 

The use of a polynucleotide of the invention in genetic immunization will 
preferably employ a suitable delivery method such as direct injection of plasmid DNA into 
muscles (Wolff et al.. Hum Mol. Genet. 1:363 (1992); Manthorpe et a/.. Hum. Gene Ther . 
4:419 (1963)), delivery of DNA complexed with specific protein carriers ( Wu et a/., L 

15 Biol. Chem . 264:16985 (1989)), coprecipitation of DNA with calcium phosphate 

(Benvenisty & Reshef, Proc. Nat'l Acad. Sci. USA . 83:9551 (1986)), encapsulation of DNA 
in various forms of liposomes (Kaneda et aL 9 Science 243:375 (1989)), particle 
bombardment (Tang et al., Nature 356: 152 (1992)); Eisenbraun et al. 9 DNA Cell Biol. 
12:791 (1993)) and in vivo infection using cloned retroviral vectors (Seeger et aL. Proc. 

20 Nat'l. Acad. Sci. USA 81:5849 (1984)). Suitable promoters for muscle transfection include 
CMV, RSV, SRa, actin, MCK, alpha globin, adenovirus and dihydrofolate reductase. 

In therapy or as a prophylactic, the active agent i.e., the polypeptide, polynucleotide 
or inhibitor of the invention, may be administered to a patient as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

25 Alternatively the composition may be formulated for topical application 

for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 

30 contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
or oleyl alcohol for lotions. Such carriers may constitute from about 1% to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 
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For administration to human patients, it is expected that the daily d sage level of _ 
the active agent will be from 0.01 to 10 nig/kg, typically around 1 mg/kg. The physician in 
any event will determine the actual dosage which will be roost suitable for an individual 
patient and will vary with the age, weight and response of the particular patient The above 
5 dosages are exemplary of the average case. There can, of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
invention. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants 
may be employed to enhance the immune response. 
10 A suitable unit dose for vaccination is 0.5-5ugfleg of antigen, and such dose is 

preferably administered 1-3 times and with an interval of 1-3 weeks. 

Within the indicated dosage range, no adverse toxicoiogicals effects are expected 
with the compounds of the invention which would preclude their administration to suitable 
patients. 
15 EXAMPLES 

In order to facilitate understanding of the following non-limiting examples certain 
frequently occurring methods and/or terms will be described. 

"Plasmids" are designated by a lower case p preceded and/or followed by capital 
letters and/or numbers. The starting plasmids herein are either commercially available, 

20 publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

•Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 

25 used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically 1 pg of plasmid or DNA fragment is used with about 2 units of enzyme 
in about 20 pi of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to 50 ug of DNA are digested with 20 to 250 units of enzyme in a 

30 larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
are specified by the manufacturer. Incubation times of about 1 hour at 37 C are ordinarily 
used, but may vary in accordance with the supplier's instructions. After digestion the 
reaction is electrophoresed direcdy on a polyacrylamide gel to isolate the desired fragment. 
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Size separation of the cleaved fragments is performed using 8 percent 
polyacrylamide gel described by Goeddel, D. er aL, (1980) Nucleic Acids Res., 8:4057. 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 
complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
5 synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 

oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

"Ligation" refers to the process of forming phosphodiester bonds between two 
double stranded nucleic acid fragments (Maniatis, T., et a/., supra., p. 146). Unless 
10 otherwise provided, ligation may be accomplished using known buffers and conditions with 
10 units to T4 DNA ligase ("ligase") per 0.5 pg of approximately equimolar amounts of the 
DNA fragments to be ligated. 
Example 1 

Isolation of DNA coding for a Novel Protein from & Aureus WCUH 29 
15 The polynucleotide having the DNA sequence given herein can be obtained from a 

library of clones of chromosomal DNA of S. aureus WCUH 29 in KcolL Libraries may be 
prepared by routine methods, for example: 
Methods 1 and 2. 

Total cellular DNA is isolated from Staphylococcus aureus strain WCUH29 
20 (NC3MB 40771) according to standard procedures and size-fractionated by either of two 
methods. 
Method I . 

Total cellular DNA is mechanically sheared by passage through a needle in order to 
size-fractionate according to standard procedures. DNA fragments of up to 1 Ikbp in size are 
25 rendered blunt by treatment with exonuclease and DNA polymerase, and EcoRl linkers 
added. Fragments are ligated into the vector Lambda Zapn that has been cut with EcoRI, 
the library packaged by standard procedures and Kcoli infected with the packaged library. 
The library is amplified by standard procedures. 
Method 2. 

30 Total cellular DNA is partially hydrolyzed with a combination of four restriction 

enzymes (RsaU Pall, Ahd and Bsh 12351) and size-fractionated according to standard 
procedures. EcoRI linkers are ligated to the DNA and the fragments then ligated into the 
vector Lambda ZapD that have been cut with EcoRI, the library packaged by standard 



21 



WO 97/30070 PCT/US97/02318 

procedures, and Kcoli infected with the packaged library. The library is amplified by 
standard procedures. 
Example 2 

The determination of expression during infection of a gene from Staphylococcus 
5 aureus WCUH29 

Necrotic fatty tissue from a four day groin infection of Staphylococcus aureus 
WCUH29 in the mouse is efficiently disrupted and processed in the presence of chaotropic 
agents and RNAase inhibitor to provide a mixture of animal and bacterial RNA. The 
optimal conditions for disruption and processing to give stable preparations and high yields 

10 of bacterial RNA are followed by the use of hybridization to a radiolabeled oligonucleotide 
specific to Staphylococcus aureus 16S RNA on Northern blots. The RNase free, DNase 
free, DNA and protein free preparations of RNA obtained are suitable for Reverse 
Transcription PCR (RT-PCR) using unique primer pairs designed from the sequence of 
each gene of Staphylococcus aureus WCUH29. 

15 a) Isolation of tissue infected with Staphylococcal aureus WCUH29 from a mouse 
animal model of infection 

10 ml. volumes of sterile nutrient broth (No.2 Oxoid) are seeded with isolated, 
individual colonies of Staphylococcus aureus WCUH29 from an agar culture plate. 

The cultures are incubated aerobicaUy (static culture) at 37 degrees C for 16-20 

20 hours. 4 week old mice (female, 18g-22g, strain MF1) are each infected by subcutaneous 
injection of 0.5ml. of this broth culture of Staphylococcus aureus WCUH29 (diluted in 
broth to approximately 108 cfti/ml.) into the anterior , right lower quadrant (groin area). 
Mice should be monitored regularly during the first 24 hours after infection, then daily until 
termination of study. Animals with signs of systemic infection, i.e. lethargy, ruffled 

25 appearance, isolation from group, should be monitored closely and if signs progress to 
moribundancy, the animal should be culled immediately. 

Visible external signs of lesion development will be seen 24-48h after infection. 
Examination of the abdomen of the animal will show the raised outline of the abscess 
beneath the skin. The localized lesion should remain in the right lower quadrant, but may 

30 occasionally spread to the left lower quadrant, and superiorly to the thorax. On occasions, 
the abscess may rupture through the overlying skin layers. The affected animal should be 
culled immediately and the tissues sampled if possible. Failure to cull the animal may result 
in the necrotic skin tissue overlying the abscess being sloughed off, exposing the abdominal 
muscle wall. 
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Approximately 96h after infection, animals are killed using carbon dioxide 
asphyxiation. To minimize delay between death and tissue processing /storage, mice 
should be killed individually rather than in groups. The dead animal is placed onto its back 
and the fur swabbed liberally with 70% alcohol. An initial incisi n using scissors is made 
S through the skin of the abdominal left lower quadrant, travelling superiorly up to, then 
across the thorax. Hie incision is completed by cutting inferiorly to the abdominal lower 
right quadrant. Care should be taken not to penetrate the abdominal wall. Holding the skin 
flap with forceps, the skin is gently pulled way from the abdomen. The exposed abscess, 
which covers the peritoneal wall but generally does not penetrate the muscle sheet 
10 completely, is excised, taking care not to puncture the viscera. 

The abscess/muscle sheet and other infected tissue, such as the necrotic pads of 
fatty tissue in the abdominal lower right and left quadrants may require cutting in sections, 
prior to flash-freezing in liquid nitrogen, thereby allowing easier storage in plastic 
collecting vials. 

15 b) Isolation of Staphylococcus aureus WCUH29 RNA from infected tissue samples 

4-6 infected tissue samples(each approx 0.5-0.7g) in 2ml screw-cap tubes are 
removed from -80°C storage into a dry ice ethanol bath In a microbiological safety cabinet 
the samples are disrupted individually whilst the remaining samples are kept cold in the dry 
ice ethanol bath. To disrupt the bacteria within the tissue sample 1ml of TRIzol Reagent 

20 (Gibco BRL, Life Technologies) is added followed by enough 0. 1 mm zirconia/silica beads 
to almost fill the tube,the lid is replaced taking care not to get any beads into the screw 
thread so as to ensure a good seal and eliminate aerosol generation. The sample is then 
homogenized in a Mini-BeadBeater Type BX-4 (Biospec Products) Necrotic fatty tissue is 
treated for 100 seconds at 5000 rpm in order to achieve bacterial lysis. In vivo grown 

25 bacteria require longer treatment than in vitro grown S.aureus WCUH29 which are 
disrupted by a 30 second bead-beat. 

After bead-bearing the tubes are dulled on ice before opening in a fume-hood as 
heat generated during disruption may degrade the TRIzol and release cyanide. 

200 microlitres of chloroform is then added and the tubes shaken by hand for 15 

30 seconds to ensure complete mixing. After 2-3 minutes at room temperature the tubes are 
spun down at 12,000 x g, 4°C for 15 minutes and RNA extraction is then continued 
according to the method given by the manufacturers of TRIzol Reagent i.e.:- The aqueous 
phase, approx 0.6 ml, is transferred to a sterile eppendorf tube and 0.5 ml of isopropanol is 
added. After 10 minutes at room temperature the samples are spun at 12,000 x g, 4°C for 10 
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minutes. The supernatant is removed and discarded then the RNA pellet is washed with L 
ml 75% ethanol. A brief vortex is used to mix the sample before centrifuging at 7,500 x g, 4 
°C for 5 minutes. The ethanol is removed and the RNA pellet dried under vacuum f r no 
more than 5 minutes. Samples are then resuspended by repeated pipetting in 100 
5 microlitres of DEPC treated water, followed by 5-10 minutes at 55°C .Finally, after at least 
1 minute on ice, 200 units of Rnasin (Promega) is added 

RNA preparations are stored at -80 °C for up to one month. For longer term storage the 
RNA precipitate can be stored at the wash stage of the protocol in 75% ethanol for at least 
one year at -20 °C. 

10 Quality of the RNA isolated is assessed by running samples on 1% agarose gels. Ix 

TBE gels stained with ethidium bromide are used to visualize total RNA yields. To 
demonstrate the isolation of bacterial RNA from the infected tissue 1 x MOPS, 2.2M 
formaldehyde gels are run and vacuum blotted to Hybond-N (Amersham). The blot is then 
hybridized with a M P labelled oligonucleotide probe specific to 16s rRNA of S.aureus ( K. 

15 Greisen, etal, J, Clin, Microbiftf. 32 335-351(1994)). An oligonucleotide of the sequence: 
S -gctcctaaaaggttactccaccggc-y [SEQ ID NO: 1 1661 is used as a probe. The size of the 
hybridizing band is compared to that of control RNA isolated from in vitro grown S.aureus 
WCUH29. Correct sized bacterial 1 6s rRNA bands can be detected in total RNA samples 
which show extensive degradation of the mammalian RNA when visualized on TBE gels. 

20 c) The removal of DNA from Staphylococcus aureus WCUH29 derived RNA 

DNA was removed from 73 microlitre samples of RNA by a 15 minute treatment 
on ice with 3 units of DNasel, amplification grade (Gibco BRL, Life Technologies) in the 
buffer supplied with the addition of 200 units of Rnasin (Promega) in a final volume of 90 
microlitres. 

25 The DNase was inactivated and removed by treatment with TRIzoI LS Reagent 

(Gibco BRL, Life Technologies) according to the manufacturers protocol. 
DNase treated RNA was resuspended in 73 microlitres of DEPC treated water with the 
addition of Rnasin as described in Method 1. 

d) The preparation of cDNA from RNA samples derived from infected tissue 

30 10 microlitre samples of DNase treated RNA are reverse transcribed using a 

Superscript Preamplification System for First Strand cDNA Synthesis kit (Gibco BRL, Life 
Technologies) according to the manufacturers instructions. 1 nanogram of random hexamers 
is used to prime each reaction. Controls without the addition of SuperScriptl! reverse 
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transcriptase are also run. Both +/-RT samples are treated with RNaseH before proceeding 
to the PCR reaction 

e) The use of PCR to determine the presence of a bacterial cDNA species 
PCR reactions are set up on ice in 0.2ml tubes by adding the f Mowing 
5 components: 

45 microlitres PCR SUPERMK (Gibco BRL, Life Technologies). 
1 microlitre 50mM MgClj , to adjust final concentration to 2.5mM. 
1 microlitre PCR primers(optimaIly 18-25 basepairs designed to possess 
similar annealing temperatures), each primer at lOmM initial concentration. 
10 2 microlitres cDNA. 

PCR reactions are run on a Perkin Elmer GeneAmp PCR System 9600 as follows: 
5 minutes at 95°C, then 50 cycles of 30 seconds each at 94 °C, 42"C and 72 
°C followed by 3 minutes at 72 °C and then a hold temperature of 4°C. 
10 microlitre aliquots are then run out on 1% 1 x TBE gels stained with 
15 ethidium bromide with PCR product sizes estimated by comparison to a 

100 bp DNA Ladder (Gibco BRL, Life Technologies). 
RT/PCR controls may include +/- reverse transcriptase reactions, 16s rRNA primers 
or DNA specific primer pairs designed to produce PCR products from non-transcribed 
S.aureus WCUH29 genomic sequences. 
20 To test the efficiency of the primer pairs they are used in DNA PCR with WCUH29 

total DNA. PCR reactions are set up and run as described above using approx. 1 microgram 
of DNA in place of the cDNA and 35 cycles of PCR rather than 50. 

Primer pairs which fail to give the predicted sized product in either DNA PCR or 
RT/PCR (approx 20%) are PCR failures and as such are uninformative. Of those which give 
25 the correct size product with DNA PCR three classes are distinguished in RT/PCR: 

1. Genes which are not expressed in vivo reproducibly fail to give a product in 
RT/PCR. 

2. Genes which are expressed in vivo reproducibly give the correct size product in 
RT/PCR and show a stronger signal in the +RT samples than in the -RT controls. 

30 3. Genes which may be expressed in vivo give similar amounts of product in both 

+/-RT samples. 
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Table 1 sets forth certain information pertinant to cloning and identification of _ 
ORFs of the invention, such as reading frame direction (forward or reverse) on each 
polynucleotide in the Sequence Listing* a putative identity based n homology searches, the 
numbers of the first and last nucleic acid of the reading frame, and the number of codons in 

S the ORF (including the stop codon). Using this information, expression vectors providing 
the selected protein are prepared and the protein is configured in an appropriate screen for 
the identification of anti-microbial agents. The SEQ ID NOS provided herewith for the 
amino acid (polypeptide) sequences are consecutively numbered to correlate with the 
information forth in Table 1, such that SEQ ID NO: 1 corresponds to the first sequence 

10 referenced in Table 1 and SEQ ID NO:623 corresponds to the last amino acid sequence 
referenced in Table 1. For the polynucleotide sequences SEQ ID NO:624 corresponds to 
the first polynucleotide sequence referenced in Table 1 and SEQ ID NO: 1 165 corresponds 
to the last sequence referenced Table 1. 

Table 1 

15 SEQ ID NO: 1 ENCODED BY SEQ ID NO: _624 

ORF# Start End Direction Codon Length 



20 



1 64 396 F 111 codons 
ORF translation from 64-396, direction F 
Blastp and/or MPSearch Result: 
25 Result: UNK 



SEQ ID NO: 2 ENCODED BY SEQ ID NO: 625 
ORF# Start End Direction Codon Length 



1 127 375 F 83 codons 
35 ORF translation from 1 27-375, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 
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SEQ ID NO: 3 ENCODED BY SEQ ID NO: 

5 ORF# Start End Direction Codon Length 
1 24 167 R 48 codons 
ORF translation from 24-167, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

20 SEQ ID NO: _ 4 ENCODED BY SEQ ID NO: _627 
ORF# Start End Direction Codon Length 
1 241 378 R 46 codons 

25 

ORF translation from 241-378, direction R 
Blastp and/or MPSearch Result: 
30 Result: UNK 

SEQ ID NO: 5 ENCODED BY SEQ ID NO: 628_ 

35 

ORF# Start End Direction Codon Length 
1 29 286 F 86 codons 
40 ORF translation from 29-286, direction F 
Blastp and/or MPSearch Result: 
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Result: UNK 



PCT/US97/02318 



SEQ ID NO: 6 ENCODED BY SEQ ID NO: _629 

5 

ORF# Start End Direction Codon Length 
1 1 513 R 171codons 
10 ORF translation from 1-513, direction R 
Blastp and/or MPSearch Result 
Description: 

15 ESCHERICHIA COLL PUTATIVE O- ANTIGEN TRANSPORTER. 

SEQ ID NO: 7 ENCODED BY SEQ ID NO: _630 

20 ORF# Start End Direction Codon Length 
1 135 365 R 77codons 

25 ORF translation from 135-365, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: 8 ENCODED BY SEQ ID NO: _631 

ORF# Start End Direction Codon Length 
35 

1 203 451 F 83 codons 
ORF translation from 203-45 1 , direction F 

40 

Blastp and/or MPSearch Result: 
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Description: 

protein-methionine-S-oxide reductase (EC L8.4.6) - Escherichia coli 

5 SEQ ID NO: 9 ENCODED BY SEQ ID NO: _632 

ORF# Start End Direction Codon Length 
1 63 359 R 99codons 

10 

ORF translation from 63-359, direction R 

1 5 Blastp and/or MPSearch Result: 
Result: UNK 

20 SEQ ID NO: _10 ENCODED BY SEQ ID NO: __633 

ORF# Start End Direction Codon Length 
1 480 932 R ISlcodons 

25 

ORF translation from 480-932, direction R 

30 Blastp and/or MPSearch Result- 
Description: 

STREPTOCOCCUS PNEUMONIAE. DNA POLYMERASE I (EC 2.7.7.7) (POL I). 

35 

SEQ ID NO: 1 1 ENCODED BY SEQ ID NO: _634 

ORF# Start End Direction Codon Length 
40 

1 265 444 R 60 codons 
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ORF translation from 265-444, direction R 
Blastp and/or MPSeaich Result: 

Result: UNK 

5 

SEQ ID NO: .12,13 ENCODED BY SEQ ID NO: _635 

ORF # Start End Direction Codon Length 

1 28 258 F 77 codons 

2 368 634 F 89 codons 

1 5 ORF translation from 28-258, direction F 

Blastp and/or MPSearch Result: 
20 Result: UNK 

ORF translation from 368-634, direction F 
25 Blastp and/or MPSearch Result: 
Result: UNK 

30 SEQ ID NO: _14 ENCODED BY SEQ ID NO: 636 

ORF# Start End Direction Codon Length 
1 19 291 R 91 codons 

35 

ORF translation from 19-291, direction R 

40 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTHJS. TECHOIC ACID BIOSYNTHESIS PROTEIN F. 

30 
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5 SEQ ID NO: _15 ENCODED BY SEQ ID NO: _637 

ORF# Start End Direction Codon Length 
1 93 224 R 44codons 

10 

ORF translation from 93-224, direction R 

1 5 Blastp and/or MPSeaich Result: 
Result: UNK 

20 SEQ ID NO: _1 6 ENCODED BY SEQ ID NO: _638. 

ORF# Start End Direction Codon Length 
1 141 386 F 82codons 

25 

ORF translation from 141-386, direction F 

30 Blastp and/or MPSeaich Result: 
Result: UNK 

35 

SEQ ID NO: _17 ENCODED BY SEQ ID NO: _639. 

ORF# Start End Direction Codon Length 
40 2 32 274 R 81 codons 

ORF translation from 32-274, directi nR 
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Blastp and/or MPSearch Result: 

Result: UNK 
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5 

SEQ ED NO: 1 8 ENCODED BY SEQ ID NO: _640. 

ORF# Start End Direction Codon Length 
10 1 348 602 F 85 codons 

ORF translation from 348-602, direction F 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: _19 ENCODED BY SEQ ID NO: _641. 

ORF# Start End Direction Codon Length 

25 

2 159 305 F 49 codons 

ORF translation from 159-305, direction F 

30 

Blastp and/or MPSearch Result: 

Description: 
35 ATP/GTP-binding protein 

SEQ ID NO: _20 ENCODED BY SEQ ID NO: _642_ 

40 ORF# Start End Direction Codon Length 
1 28 306 F 93 codons 
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ORF translation from 28-306, direction F 

5 Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _21 ENCODED BY SEQ ID NO: _643. 

ORF # Start End Direction Codon Length 
15 1 44 415 R 124codons 

ORF translation from 44-415, direction R 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: 22 ENCODED BY SEQ ID NO: 

ORF # Start End Direction Codon Length 

30 

2 154 372 R 73 codons 

ORF translation from 154-372, direction R 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _23 ENCODED BY SEQ ID NO: 

33 
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ORF# Start End Direction Codon Length 



5 



10 



2 313 411 F 33codons 

ORF translation from 313-411, direction F 

Blastp and/or MPSearch Result: 
Result: UNK 



1 5 SEQ ID NO: 2 4 ENCODED BY SEQ ID NO: _646 
ORF# Start End Direction Codon Length 
1 192 380 F 63 codons 

20 

ORF translation from 192-380, direction F 

25 Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _25_ ENCODED BY SEQ ID NO: _647. 
ORF # Start End Direction Codon Length 
35 1 39 275 F 79 codons 

ORF translation from 39-275. direction F 

40 

Blastp and/or MPSearch Result: 
Description: 

34 



WO 97/30070 

THERMOTOGA MARITIMA. 



PCT/US97/02318 

PHOSPHOGLYCERATE KINASE (EC 2.7.2.3). 



5 SEQIDNO:_26_ 
ORF# Start End 
1 83 250 

10 

ORF translation from 83-250, direction R 

1 5 Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: 27 ENCODED BY SEQ ID NO: _649 
ORF# Start End Direction Codon Length 
25 1 159 380 F 74codons 

ORF translation from 159-380, direction F 

30 

Blastp and/or MPSearch Result: 

Description: 
ABC transporter 

35 

SEQ ID NO: 28 ENCODED BY SEQ ID NO: _650. 

ORF # Start End Direction Codon Length 

40 

1 200 328 F 43 codons 



ENCODED BY SEQ ID NO: _648. 
Direction Codon Length 
R 56 codons 
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ORF translation from 200-328, direction F 



Blastp and/or MPSearch Result: 

5 

Description: 

BACILLUS SUBTTLIS. THREONYL-TRNA SYNTHETASE 1 (EC 6.1.1.3) 
(THREONINE-TRNA LIGASE) (THRRS). 

10 

SEQ ID NO: 29 ENCODED BY SEQ ID NO: _65 1 

ORF# Start End Direction Codon Length 



15 1 262 378 F 39codons 



ORF translation from 262-378, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: 30 ENCODED BY SEQ ID NO: _652 

ORF # Start End Direction Codon Length 

30 

1 127 438 F 104codons 



ORF translation from 127-438, direction F 

35 

Blastp and/or MPSearch Result: 
Description: 

40 PSEUDOMONAS AERUGINOSA. ARGININE/ORNITHINE ANTIPORTER. 



36 



WO 97/30070 PCT/TJS97/D2318 

SEQ ID NO: _31 ENCODED BY SEQ ID NO: 653 

ORF# Start End Direction Codon Length 
5 

I 89 358 R 90codons 
ORF translation from 89-358, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _32 ENCODED BY SEQ ID NO: 654 

20 ORF# Start End Direction Codon Length 
1 131 382 F 84codons 

25 ORF translation from 131-382, direction F 

Blastp and/or MPSearch Result 
30 Description: 

SALMONELLA TYPHIMURIUM. HEXOSE PHOSPHATE TRANSPORT PROTEIN. 

35 SEQ ID NO: _33 ENCODED BY SEQ ID NO: 65 5 

ORF# Start End Direction Codon Length 
1 245 361 F 39codons 

40 

ORF translation from 245-361, direction F 
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Blastp and/or MPSearch Result: 
Result: UNK 

5 

SEQ ID NO: 34 ENCODED BY SEQ ID NO: _656 
10 ORF# Start End Direction Codon Length 
1 135 245 F 37codons 

15 ORF translation from 135-245, direction F 

Blastp and/or MPSearch Result: 
20 Result: UNK 

SEQ ID NO: 35 ENCODED BY SEQ ID NO: _657. 

25 

ORF# Start End Direction Codon Length 
1 265 426 R 54 codons 

30 

ORF translation from 265-426, direction R 
Blastp and/or MPSearch Result: 

Result: UNK 

35 

SEQ ID NO: 36 ENCODED BY SEQ ID NO: _658. 

40 ORF# Start End Direction Codon Length 
1 277 459 R 61 codons 

38 



PC17US97/02318 



WO 97/30070 PCT/US97/02318 

ORF translation from 277-459, direction R 
Blastp and/or MPSearch Result: 

5 

Description: 

TRANSKETOLASE 1 (EC 2.2.1.1) (TK 1). - ESCHERICHIA COLL 

10 SEQ ID NO: _37 ENCODED BY SEQ ID NO: _659 

ORF# Start End Direction Codon Length 



1 42 326 R 95codons 

15 

ORF translation from 42-326, direction R 
Blastp and/or MPSearch Result: 

20 

Result: UNK 



25 SEQ ID NO: 38 ENCODED BY SEQ ID NO: 660 

ORF# Start End Direction Codon Length 



I 172 294 F 41 codons 

30 

ORF translation from 172-294, direction F 



35 Blastp and/or MPSearch Result: 
Result: UNK 

40 



SEQ ID NO: 39 ENCODED BY SEQ ID NO: _661 

39 
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ORF# Start End Direction Codon Length 



1 227 331 F 35codons 

5 



ORF translation from 227-331, direction F 



10 Blastp and/or MPSearch Result: 
Result: UNK 



15 SEQ ID NO: 40 ENCODED BY SEQ ID NO: _662 

ORF# Start End Direction Codon Length 



1 58 207 R 50codons 

20 

ORF translation from 58-207, direction R 



25 Blastp and/or MPSearch Result- 
Description: 

BACILLUS SUBTILIS. CARBAMOYL-PHOSPHATE SYNTHASE, PYRIMIDINE- 
SPECIFIC. LARGE CHAIN (EC 6.3.5.5) (CARBAMOYL-PHOSPHATE SYNTHETASE 
30 AMMONIA CHAIN). 



SEQ ID NO: _41 ENCODED BY SEQ ID NO: _663 

35 

ORF# Start End Direction Codon Length 
1 121 426 F 102codons 

40 

ORF translation from 121-426, direction F 
Blastp and/or MPSearch Result: 

40 
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Description: 

ESCHERICHIA COLL SEQA PROTEIN. 

5 

SEQ ID NO: 4 2 ENCODED BY SEQ ID NO: 664 

ORF# Start End Direction Codon Length 
10 

1 20 223 R 68codons 

ORF translation from 20-223, direction R 
IS Blastp and/or MPSearch Result: 

Result: UNK 

20 

SEQ ID NO: 43 ENCODED BY SEQ ID NO: 665 
ORF# Start End Direction Codon Length 
25 1 18 329 R 104 codons 

ORF translation from 18-329, direction R 

30 

Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _44__ ENCODED BY SEQ ID NO: 666 

ORF# Start End Direction Codon Length 
40 

3 142 369 F 76 codons 
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ORF translation from 142-369, direction F 
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5 Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _45 ENCODED BY SEQ ID NO: _667 

ORF # Start End Direction Codon Length 
15 2 181 369 R 63codons 

ORF translation from 181-369, direction R 
20 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTTLIS. PHOSPHORffiOSYLAMINOIMroAZOI^CARBOXAMIDE 
FORMYLTRANSFERASE (EC 2.L2.3) (AICAR TRANSFORMYLASE) / IMP 
25 CYCLOHYDROLASE (EC 3.5.4.10) (INOSINICASE) (IMP SYNTHETASE) (ATIC). 

30 SEQ ID NO: 4 6 ENCODED BY SEQ ID NO: 668 
ORF# Start End Direction Codon Length 
1 65 268 F 68codons 

35 

ORF translation from 65-268, direction F 

40 Blastp and/or MPSearch Result: 
Description: 

LACTOBACILLUS PLANTARUM. REP PROTEIN (REPLICATION PROTEIN). 
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SEQ ID NO: _47 ENCODED BY SEQ ID NO: .669. 

5 ORF# Start End Direction Codon Length 



1 133 375 F 81 codons 

10 ORF translation from 133-375, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _48 ENCODED BY SEQ ID NO: .670. 

20 ORF# Start End Direction Codon Length 



2 202 387 F 62 codons 
25 ORF translation from 202-387, direction F 

Blastp and/or MPSearch Result: 

30 Description: 

ATP/GTP-binding protein 

SEQ ID NO: 4 9 ENCODED BY SEQ ID NO: _671_ 
35 ORF # Start End Direction Codon Length 



1 7 339 R 111 codons 

40 ORF translation from 7-339, direction R 
Blastp and/or MPSearch Result: 
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Description: 

PSEUDOUR1DYLATE SYNTHASE I (EC 4.2.1.70) (PSEUDOURIDINE SYNTHASE I) 
(URACIL HYDROLYASE). - ESCHERICHIA COLL 

5 SEQ ED NO: _50 ENCODED BY SEQ ID NO: _672 

ORF# Start End Direction Codon Length 



1 51 299 F 83codons 

10 

ORF translation from 51-299, direction F 



1 5 Blastp and/or MPSearch Result: 
Result: UNK 



20 SEQ ID NO: _51 ENCODED BY SEQ ID NO: _673 

ORF# Start End Direction Codon Length 



1 109 378 F 90codons 

25 

ORF translation from 109-378, direction F 



30 Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL OUTER MEMBRANE PROTEIN F PRECURSOR (OUTER 
MEMBRANE PROTEIN 1 A, IA, OR B). 

35 



SEQ ID NO: _52 ENCODED BY SEQ ID NO: _674 

40 

ORF# Start End Direction Codon Length 



2 268 378 F 37codons 
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5 



15 



ORF translation from 268-378, direction F 



Blastp and/or MPSearch Result: 



Description: 

PROBABLE UDP-N-ACETYLGLUCOSAMINE 1 -CARBOXYVINYLTRANSFERASE 
10 (EC 2.5.1.7) (ENOYLPYRUVATE TRANSFERASE) (UDP-N- 
ACETYLGLUCOSAMINE ENOLPYRUVYL TRANSFERASE). - BACILLUS 
SUBTTLIS. 



SEQ ID NO: _53 ENCODED BY SEQ ID NO: _675_ 

ORF# Start End Direction Codon Length 



20 1 303 437 F 45 codons 



ORF translation from 303-437, direction F 

25 

Blastp and/or MPSearch Result: 
Description: 

UDP-N-ACETYLGLUCOSAMINE 1 CARBOXYVINYLTRANSFERASE (EC 2.5.1.7) 
30 (ENOYLPYRUVATE TRANSFERASE) (UDP-N-ACETYLGLUCOSAMINE 
ENOLPYRUVYL TRANSFERASE). - ACINETOBACTER CALCOACETICUS. 

SEQ ID NO: _54 ENCODED BY SEQ ID NO: _676 

35 ORF# Start End Direction Codon Length 



1 7 216 R 70 codons 



40 ORF translation from 7-216, direction R 



Blastp and/or MPSearch Result: 

45 
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5 

SEQ ID NO: _55 ENCODED BY SEQ ID NO: _677 

ORF # Start End Direction Codon Length 

10 

1 20 160 R 47codons 

ORF translation from 20-160, direction R 

15 Blastp and/or MPSearch Result: 

Description: 

BACILLUS STEAROTHERMOPHILUS. GLUTAMINE TRANSPORT ATP-BINDING 
PROTEIN GLNQ. 

20 

SEQ ID NO: _56 ENCODED BY SEQ ID NO: _678 

ORF# Start End Direction Codon Length 

25 

1 119 328 F 70codons 



ORF translation from 1 19-328, direction F 

30 

Blastp and/or MPSearch Result: 
Result: UNK 

35 



40 



SEQ ID NO: _57 ENCODED BY SEQ ID NO: _679_ 

ORF # Start End Direction Codon Length 



1 32 163 R 44 codons 
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ORF translation from 32-163, direction R 
5 Blastp and/or MPSearch Result: 
Result: UNK 



10 SEQ ID NO: _58 ENCODED BY SEQ ID NO: _680. 

ORF# Start End Direction Codon Length 



1 7 246 R 80codons 

15 



ORF translation from 7-246, direction R 



20 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTDLIS. ALKALINE PHOSPHATASE HI PRECURSOR (EC 3.1.3.1) 
(APASE ED). 

25 



SEQ ID NO: _59 ENCODED BY SEQ ID NO: _681 

ORF# Start End Direction Codon Length 
30 

1 45 323 R 93codons 



ORF translation from 45-323, direction R 

35 



Blastp and/or MPSearch Result: 
Result: UNK 

40 



SEQ ID NO: 60 ENCODED BY SEQ ID NO: _682. 
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ORF# Start End Direction Codon Length 



2 251 355 F 35codons 

5 

ORF translation from 251-355, direction F 



Blastp and/or MPSearch Result: 

10 

Result: UNK 



SEQ ID NO: _61 ENCODED BY SEQ ID NO: _683 

15 

ORF # Start End Direction Codon Length 



1 24 233 R 70codons 

20 

ORF translation from 24-233, direction R 



Blastp and/or MPSearch Result: 

25 

Result: UNK 
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30 SEQ ID NO: _62 ENCODED BY SEQ ID NO: _684 

ORF# Start End Direction Codon Length 



1 51 302 R 84codons 

35 



ORF translation from 51-302, direction R 
Blastp and/or MPSearch Result: 

40 

Result: UNK 
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SEQ ID NO: _63 ENCODED BY SEQ ID NO: _685 

ORF# Start End Direction Codon Length 



5 1 7 396 R 130 codon s 

ORF translation from 7-396, direction R 

10 Blastp and/or MPSearch Result: 

Description: 
ABC Transporter 

15 

SEQ ID NO: 64 ENCODED BY SEQ ED NO: _686. 
20 ORF # Start End Direction Codon Length 



1 172 303 F 44codons 



25 ORF translation from 172-303, direction F 



Blastp and/or MPSearch Result: 

30 Description: 

ALCALIGENES EUTROPHUS. MEMBRANE PROTEIN. 



35 



SEQ ID NO: 65 ENCODED BY SEQ ID NO: _687_ 
ORF# Start End Direction Codon Length 



40 



1 41 349 F 103codons 



1 ORF translation from 4 1 -349, direction F 
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Blastp and/or MPSearch Result: 

Result: UNK 



PCT/US97/0ZJ18 



10 



SEQ ID NO: _66 ENCODED BY SEQ ID NO. _688_ 

ORF# Start End Direction Codon Length 



15 



25 



30 



40 



1 72 347 R 92codons 



ORF translation from 72-347, direction R 



Blastp and/or MPSearch Result: 
20 Description: 

ESCHERICHIA COLL IRON(III) DICITRATE TRANSPORT SYSTEM PERMEASE 
PROTEIN FECD. 



SEQ ID NO: 67 ENCODED BY SEQ ID NO: _689_ 
ORF# Start End Direction Codon Length 



1 65 253 F 63codons 
ORF translation from 65-253, direction F 
35 Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: _68 ENCODED BY SEQ ID NO: _690_ 

ORF# Start End Direction Codon Length 



1 114 536 F 141codons 

50 
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ORF translation from 1 14-536, direction F 



5 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTILIS. REPLICATTVE DNA HELICASE (EC 3.6.1 

10 



SEQ ID NO: 69 ENCODED BY SEQ ID NO: _69 1 

ORF# Start End Direction Codon Length 

15 

1 46 402 R 119codons 



ORF translation from 46-402, direction R 

20 



Blastp and/or MPSearch Result: 
Description: 

25 ESCHERICHIA COLL NICKEL TRANSPORT SYSTEM PERMEASE PROTEIN NIKC. 



SEQ ID NO: 70 ENCODED BY SEQ ID NO: _692 

30 

ORF# Start End Direction Codon Length 
1 227 334 F 36codons 

35 

ORF translation from 227-334, direction F 
Blastp and/or MPSearch Result: 

40 

Result: UNK 



PCT/US97/D2318 



-). 



51 
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PCT/US97/02318 



SEQ ID NO. _7 1 ENCODED BY SEQ ID NO: _693. 

5 ORF# Start End Direction Codon Length 



I 184 342 F 53 codons 



10 ORF translation from 184-342, direction F 



Blastp and/or MPSearch Result: 
IS Result: UNK 



SEQ ID NO: _72 ENCODED BY SEQ ID NO: _694 

20 

ORF# Start End Direction Codon Length 
1 17 367 R 117 codons 

25 

ORF translation from 17-367, direction R 
Blastp and/or MPSearch Result: 

30 

Result: UNK 



SEQ ID NO: _73 ENCODED BY SEQ ID NO: _695 

35 

ORF# Start End Direction Codon Length 
1 159 317 F 53 codons 

40 

ORF translation from 159-317, direction F 



52 
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Blastp and/or MPSearch Result: 



PCT/US97A>2318 



Description: 

nerve growth factor {promoter} [human, Genomic, 486 ntj. ptsG protein - Staphylococcus 
5 camosus 



10 SEQIDNO:_74 

ORF# Start End 
I 2 409 

15 

ORF translation from 2-409, direction F 

20 Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _75,76 ENCODED BY SEQ ID NO: _697. 

ORF# Start End Direction Codon Length 

30 2 547 720 F 58 codons 

3 915 1079 F 55 codons 

ORF translation from 547-720, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

ORF translation from 915-1079, direction F 

53 



ENCODED BY SEQ ID NO: _696. 
Direction Codon Length 



F 136 codons 



WO 97/30070 PCT/US97/02318 

Blastp and/or MPSearch Result: 

Description: 
5 ATP/GTP-binding protein 



SEQ ID NO: _77 ENCODED BY SEQ ID NO: _698. 

10 

ORF # Start End Direction Codon Length 



1 68 271 F 68codons 
1 5 ORF translation from 68-27 1 , direction F 
Blastp and/or MPSearch Result: 
Description: 

20 BACILLUS SUBTIUS. PROBABLE ASPARTATE AMINOTRANSFERASE (EC 
2.6. 1.1) (TRANSAMINASE A) (ASPAT). 



25 SEQ ID NO: _78 ENCODED BY SEQ ID NO: _699 

ORF# Start End Direction Codon Length 
2 699 797 F 33codons 

30 

ORF translation from 699-797, direction F 
Blastp and/or MPSearch Result: 

35 

Result: UNK 



40 SEQ ID NO: _79 ENCODED BY SEQ ID NO: _700 

ORF# Start End Direction Codon Length 
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2 483 716 F 78 codons 

ORF translation from 483-716, direction F 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: 80 ENCODED BY SEQ ID NO: _701 

15 ORF# Start End Direction Codon Length 
1 223 483 R 87 codons 

20 ORF translation from 223-483, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _81 ENCODED BY SEQ ID NO: 702 

ORF# Start End Direction Codon Length 
30 

1 365 574 R 70 codons 
ORF translation from 365-574, direction R 

35 

Blastp and/or MPSearch Result: 
Description: 

40 BACILLUS SUBTILIS. FERROCHELATASE (EC 4.99. 1.1) (PROTOHEME FERRO- 
LYASE) (HEME SYNTHETASE). 



PCT/US97/02318 



55 
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SEQ ID NO: _82 ENCODED BY SEQ ID NO: _703 

ORF# Start End Direction Codon Length 

1 98 535 F 146codons 

ORF translation from 98-535, direction F 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _83 ENCODED BY SEQ ID NO: _704 

20 ORF# Start End Direction Codon Length 

2 460 726 F 89codons 

25 ORF translation from 460-726, direction F 
Blastp and/or MPSearch Result: 

Result: UNK 

30 

SEQ ID NO: 84 ENCODED BY SEQ ID NO: _705 

ORF# Start End Direction Codon Length 

35 

1 22 384 R 121 codons 

ORF translation from 22-384, direction R 

40 

Blastp and/or MPSearch Result: 
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Description: 

ESCHERICHIA COLL MALONYL COA-ACYL CARRIER PROTEIN 
TRANSACYLASE (EC 23.1.39) (MCT). 

5 SEQ ID NO: _85 ENCODED BY SEQ ID NO: _706 

ORF# Start End Direction Codon Length 

I 339 560 F 74codons 

10 

ORF translation from 339-560, direction F 

IS Blastp and/or MPSearch Result: 
Result: UNK 

20 SEQ ID NO: _86,87 ENCODED BY SEP ID NO: 707 

ORF# Start End Direction Codon Length 

1 129 419 R 97codons 
25 2 554 829 R 92 codons 

ORF translation from 129-419, direction R 

30 

Blastp and/or MPSearch Result: 
Result UNK 

35 

ORF translation from 554-829, direction R 
Blastp and/or MPSearch Result: 

40 

Result: UNK 

57 
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SEQ ID NO: _88 ENCODED BY SEQ ID NO: _708 

ORF # Start End Direction Codon Length 
2 159 353 R 65 codons 



ORF translation from 159-353, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 



SEQ ID NO: _89.90,91 ENCODED BY SEQ ID NO: _709 

20 ORF# Start End Direction Codon Length 



2 


1212 


1445 


R 


78 codons 


3 


14S6 


1587 


R 


44 codons 


4 


1705 


2016 


R 


104 codons 



25 

ORF translation from 1212-1445, direction R 

MDQHIhniXSASSDHLMI^^ 

DSKID 

30 TLLQNFDVKSPAMVNQY* 



Blastp and/or MPSearch Result: 
35 Result: UNK 



ORF translation from 1456-1587, direction R 
40 Blastp and/or MPSearch Result: 
Result: UNK 
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ORF translation from 1705-2016, direction R 
Blastp and/or MPSearch Result 

5 

Result: UNK 

SEQ ID NO: _92 ENCODED BY SEQ ID NO: _710 

10 ORF# Start End Direction Codon Length 
1 167 382 F 72codons 

15 ORF translation from 1 67-382, direction F 

Blastp and/or MPSearch Result: 
20 Result: UNK 

SEQ ID NO: _93 ENCODED BY SEQ ID NO: 711 

25 ORF# Start End Direction Codon Length 
I 155 283 F 43codons 

30 ORF translation from 155-283, direction F 

Blastp and/or MPSearch Result: 

35 Description: 

ATP-dependent Clp proteinase (EC 3.4.21 .-) chain clpL - Lactococcus 

SEQ ID NO: _94 ENCODED BY SEQ ID NO: _7 12 

40 

ORF# Start End Direction Codon Length 
1 57 245 R 63codons 
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PCT/US97/02318 



ORF translation from 57-245, direction R 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _95 ENCODED BY SEQ ID NO: _713. 

ORF# Start End Direction Codon Length 
1 57 245 R 63codons 

ORF translation from 57-245, direction R 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: 96 ENCODED BY SEQ ID NO: _7 14. 
30 ORF# Start End Direction Codon Length 
1 179 331 R Slcodons 

35 ORF translation from 179-331, direction R 

Blastp and/or MPSearch Result: 
40 Result: UNK 

SEQ ID NO: _97 ENCODED BY SEQ ID NO: 715_ 

60 
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ORF # Start End Direction Codon Length 



5 



10 



15 



1 3 113 R 37codons 



ORF translation from 3-1 13, direction R 



Blastp and/or MPSearch Result: 



Description: 

BACILLUS SUBTILIS. 2-OXOISOVALERATE DEHYDROGENASE BETA SUBUNIT 
(EC 1.2.4.4) (BRANCHED- CHAIN ALPHA-KETO ACID DEHYDROGENASE 
COMPONENT BETA CHAIN (El)) (BCKDH El-BETA). 



SEQ ID NO: _98 ENCODED BY SEQ ID NO: _716 

20 

ORF# Start End Direction Codon Length 



1 70 228 R 53codons 

25 

ORF translation from 70-228, direction R 



Blastp and/or MPSearch Result: 

30 

Result: UNK 



35 SEQ ID NO: 9 9 ENCODED BY SEQ ID NO: _717. 
ORF# Start End Direction Codon Length 



1 7 441 R 145codons 

40 

ORF translation from 7-441, direction R 
Blastp and/or MPSearch Result: 
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PCT/US97/02318 



Description: 

LACTOCOCCUS LACTIS (SUBSP. LACTIS) (STREPTOCOCCUS LACTIS). 
ACETOLACTATE SYNTHASE LARGE SUBUNIT (EC 4.13.18) (AHAS) 
5 (ACETOHYDROXY-ACID SYNTHASE LARGE SUBUNIT) (ALS). 



10 SEQ ID NO: _100 ENCODED BY SEQ ID NO; _718 

ORF # Start End Direction Codon LengthDirection Codon Length 
1 279 485 F 69codons 

15 

ORF translation from 279-485, direction F 
Blastp and/or MPSearch Result: 

20 

Result: UNK 



SEQ ID NO: 101 ENCODED BY SEQ ID NO: _719. 

25 

ORF# Start End Direction Codon Length 



1 517 660 R 48codons 

30 

ORF translation from 5 1 7-660, direction R 



Blastp and/or MPSearch Result 

35 

Description: 

BACILLUS SUBTILIS. PROTEASE SYNTHASE AND SPORULATION NEGATIVE 
REGULATORY PROTEIN PAI 1 . 

40 

SEQ ID NO: _102 ENCODED BY SEQ ID NO: _720 
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ORF # Start End Direction Codon Length 



1 172 402 F 77codons 

5 

ORF translation from 172-402, direction F 



Blastp and/or MPSearch Result: 

10 

Result: UNK 



15 SEQ ID NO: _103 ENCODED BY SEQ ID NO: _721. 

ORF# Start End Direction Codon Length 



1 312 482 F 57codons 

20 

ORF translation from 312-482, direction F 



25 Blastp and/or MPSearch Result: 
Result: UNK 

30 SEQ ID NO: _104 ENCODED BY SEQ ID NO: _722_ 

ORF# Start End Direction Codon Length 



2 168 404 F 79 codons 

35 

ORF translation from 168-404, direction F 



40 Blastp and/or MPSearch Result: 
Result: UNK 
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SEQ ID NO: __105 ENCODED BY SEQ ID NO: _723 

5 ORF# Start End Direction Codon Length 
1 S14 933 F 140codons 

10 ORF translation from 514-933, direction F 

Blastp and/or MPSearch Result: 

IS Description: 

THERMOTOGA MARITTMA. GLYCERALDEHYDE 3-PHOSPHATE 
DEHYDROGENASE (EC 1.2.1.12) (GAPDH). 

20 

SEQ ID NO: _106 ENCODED BY SEQ ID NO: _724 

ORF# Start End Direction Codon Length 
25 2 632 961 R UOcodons 

ORF translation from 632-961 , direction R 
30 Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _107 ENCODED BY SEQ ID NO: _725 

ORF# Start End Direction Codon Length 
40 1 141 323 R 61 codons 

ORF translation from 141-323, direction R 

64 



WO 97/30070 



PCT/US97/02318 



Blastp and/or MPSearch Result: 
Result: UNK 

5 

SEQ ID NO: _108 t 109 ENCODED BY SEQ ID NO: _726 

ORF# Start End Direction Codon Length 
10 

1 47 943 F 299codons 

2 978 1112 F 45codons 



15 ORF translation from 47-943, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

20 

ORF translation from 978-1 1 12, direction F 



25 Blastp and/or MPSearch Result: 
Result: UNK 



30 

SEQ ID NO: _1 10,1 1 1 ENCODED BY SEQ ID NO: _727 

ORF# Start End Direction Codon Length 



35 1 4 192 R 63 codons 

2 224 352 R 43 codons 



ORF translation from 4-192, direction R 

40 

Blastp and/or MPSearch Result: 
Result: UNK 
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PCT/US97/02318 



ORF translation from 224-352, direction R 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _1 12 ENCODED BY SEQ ID NO: _728 

ORF# Start End Direction Codon Length 
15 1 25 303 R 93codons 

ORF translation from 25-303. direction R 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO. _1 13 ENCODED BY SEQ ID NO: _729 

30 ORF# Start End Direction Codon Length 
1 263 652 R 130codons 

35 ORF translation from 263-652, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _1 14 ENCODED BY SEQ ID NO: _730. 
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ORF # Start End Direction Codon Length 

I 84 617 R 178codons 

5 

ORF translation from 84-617, direction R 
Blastp and/or MPSearch Result: 

10 

Description: 

surfactin synthetase - Bacillus subtilis 

15 SEQ ID NO: _1 15 ENCODED BY SEQ ID NO 

ORF # Start End Direction Codon Length 

1 31 213 R 61codons 

20 

ORF translation from 31-213, direction R 

25 Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _1 16 ENCODED BY SEQ ID NO: 

ORF# Start End Direction Codon Length 
35 

2 533 658 F 42codcms 
ORF translation from 533-658, direction F 

40 

Blastp and/or MPSearch Resuh: 

67 
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.731 



732 



WO 97/30070 
Result: UNK 



PCT/US97/02318 



5 

SEQ ID NO: 117 ENCODED BY SEQ ID NO: _733 

ORF# Start End Direction Codon Length 
10 1 175 3% F 74codons 

ORF translation from 175-396, direction F 

15 

Blastp and/or MPSearch Result- 
Result: UNK 

20 SEQ ID NO: 118 ENCODED BY SEQ ID NO: _734 

ORF# Start Bid Direction Codon Length 
1 3 248 R 82codons 

25 

ORF translation from 3-248, direction R 
Blastp and/or MPSearch Result: 

30 

Description: 

ATP/GTP-binding protein 

35 

SEQ ID NO: _1 19 ENCODED BY SEQ ID NO: _735. 

ORF# Start End Direction Codon Length 
40 1 160 378 R 73codons 

ORF translation from 160-378, direction R 
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Blastp and/or MPSearch Result: 
S Result: UNK 

SEQ ID NO: _120 ENCODED BY SEQ ID NO: _736 

10 ORF# Start End Direction Codon Length 
1 202 546 R 115codons 

IS ORF translation from 202-546, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: 121 ENCODED BY SEQ ID NO: _737 

25 ORF# Start End Direction Codon Length 
1 2 349 F 116codons 

30 ORF translation from 2-349, direction F 

Blastp and/or MPSearch Result 
35 Result: UNK 

SEQ ID NO: _122 ENCODED BY SEQ ID NO: _738 

40 ORF# Start End Direction Codon Length 
1 1 285 F 95 codons 
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ORF translation from 1-285, direction F 

5 Blastp and/or MPSearch Result: 
Result: UNK 

10 SEQ ID NO: _123,124 ENCODED BY SEQ ID NO: _739 

ORF # Start End Direction Codon Length 

2 487 828 R 114codons 
15 3 905 1120 R 72codons 

ORF translation from 487-828, direction R 
20 Blastp and/or MPSearch Result: 
Result UNK 

ORF translation from 905-1 120, direction R 
25 MSKIJ-MIGTGPVAIQLANICYLK^ 
IQNE 

AHQHLEGKFGN* 

30 Blastp and/or MPSearch Result: 
Description: 

Putative nucleotide-binding protein 

35 

SEQ ID NO: _125 ENCODED BY SEQ ID NO: 740 

ORF # Start End Direction Codon Length 

40 

2 193 555 F 121 codons 
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ORF translation from 193-555, direction F 



PCT/US97/02318 



Blastp and/or MPSearch Result: 

5 

Result: UNK 



10 SEQ ID NO __126 ENCODED BY SEQ ID NO: 741 

ORF# Start End Direction Codon Length 



1 61 339 F 93codons 

15 

ORF translation from 61-339, direction F 
Blastp and/or MPSearch Result: 

20 

Result: UNK 



SEQ ID NO: _127 ENCODED BY SEQ ID NO: __742 

25 

ORF# Start End Direction Codon Length 



1 251 706 F 152 codons 

30 

ORF translation from 251-706, direction F 



Blastp and/or MPSearch Resuh: 

35 

Description: 

ESCHERICHIA COLL NICKEL TRANSPORT SYSTEM PERMEASE PROTEIN NIKC. 

40 

SEQ ID NO: _128 ENCODED BY SEQ ID NO: _743 

ORF# Start End Direction Codon Length 
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1 126 326 F 67codons 



5 ORF translation from 126-326, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _129 ENCODED BY SEQ ID NO: _744. 

ORF# Start End Direction Codon Length 
1 25 579 R 185codons 



ORF translation from 25-579, direction R 

20 

Blastp and/or MPSearch Result: 
Description: 

LACTOCOCCUS LACTTS (SUBSP. LACTIS) (STREPTOCOCCUS LACTIS). ATP 
25 PHOSPHORIBOSYLTRANSFERASE (EC 2.4.2.17). 



SEQ ID NO: _130 ENCODED BY SEQ ID NO: _745. 

30 

ORF# Start End Direction Codon Length 



1 641 826 F 62codons 

35 

ORF translation from 641-826, direction F 
Blastp and/or MPSearch Result: 
40 Description: 

STAPHYLOCOCCUS AUREUS. ARSENICAL PUMP MEMBRANE PROTEIN. 
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SEQ ID NO: 131 ENCODED BY SEQ ID NO: _746 

ORF# Start End Direction Codon Length 

5 1 198 473 F 92codons 

ORF translation from 198-473, direction F 
10 Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _132 ENCODED BY SEQ ID NO: _747 

ORF# Start End Direction Codon Length 
20 

1 95 409 R 105codons 
ORF translation from 95-409, direction R 
25 Blastp and/or MPSearch Result: 
Description: 

LACALS NCBI gi: 473900NCBI gi: 473748 - Lactococcus lactis (strain DSM 20384, 
sub_species lactis) DNA. 

30 



SEQ ID NO: 133 ENCODED BY SEQ ID NO: _748 

ORF# Start End Direction Codon Length 



I 342 542 R 67 codons 
40 ORF translation from 342-542, direction R 

Blastp and/or MPSearch Result: 

73 
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Result: UNK 



PCT/US97/02318 



5 

SEQ ID NO: _134 ENCODED BY SEQ ID NO: _749 

ORF# Start End Direction Codon Length 
10 2 179 598 F 140codons 

ORF translation from 179-598, direction F 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: 135 ENCODED BY SEQ ID NO: _750 

ORF# Start End Direction Codon Length 
25 1 179 382 R 68codons 

ORF translation from 179-382, direction R 
30 Blastp and/or MPSearch Result: 
Description: 

Leuconostoc gelidum (strain UAL 187) leucocin A ATP-dependent 

35 

SEQ ID NO: _136 ENCODED BY SEQ ID NO: 751 

ORF # Start End Direction Codon Length 

40 

1 96 392 F 99codons 

ORF translation from 96-392, direction F 
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Blastp and/or MPSearch Result: 

DescriptioQ: 

PSEPSTD NCBI gi: 63399 1NCBI gi: 473 - Pseudomonas aeruginosa (strain PAOl, ) DNA. 



SEQ ID NO. _137 ENCODED BY SEQ ID NO: _752 

10 

ORF # Start End Direction Codon Length 
1 298 681 R 128codons 

15 

ORF translation from 298-681, direction R 
Blastp and/or MPSearch Result: 
20 Result: UNK 

SEQ ID NO: _138 ENCODED BY SEQ ID NO: _753 

25 ORF# Start End Direction Codon Length 
1 153 341 F 63codons 

30 ORF translation from 153-341 , direction F 

Blastp and/or MPSearch Result: 

35 Result: UNK 

SEQ ID NO: _139 ENCODED BY SEQ ID NO: _754 

ORF# Start End Direction Codon Length 

40 

1 38 322 F 95codons 
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ORF translation from 38-322, direction F 



PCT/US97/02318 



Blastp and/or MPSearch Result: 

5 

Description: 

THERMOANAEROBACTER THERMOS ULFUROGENES (CLOSTRIDIUM 
THERMOSULFUROGENES). HYPOTHETICAL 35.6 KD PROTEIN IN AMYB 
5'REGION(ORFl). 

10 



SEQ ID NO: _140 ENCODED BY SEQ ID NO: _755. 

IS ORF# Start End Direction Codon Length 



1 73 213 R 47codons 



20 ORF translation from 73-21 3, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: 141 ENCODED BY SEQ ID NO: _756. 

ORF# Start End Direction Codon Length 
30 

1 1 189 R 63codons 



ORF translation from 1-189, direction R 

35 



Blastp and/or MPSearch Result: 
Result: UNK 

40 



SEQ ID NO: _142 ENCODED BY SEQ ID NO: _757. 
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ORF# Start End Direction Codon Length 



15 



20 



25 



35 



2 724 966 F 81 codons 

5 

ORF translation from 724-966, direction F 

Biastp and/or MPSearch Result: 

10 Description: 

ESCHERICHIA COLI. PHOTOREPAIR PROTEIN PHRA. 



SEQ ID NO: _143 ENCODED BY SEQ ID NO: _758_ 

ORF# Start End Direction Codon Length 



1 353 481 F 43 codons 



ORF translation from 353-48 1 , direction F 



Biastp and/or MPSearch Result 
Description: 

ECOPHNAQ NCBI gi: 147192NCBI gi: 4737 - E.coli (strain B) DNA. 



30 SEQ ID NO: _144 ENCODED BY SEQ ID NO: _759_ 

ORF# Start End Direction Codon Length 



1 144 419 F 92 codons 



ORF translation from 144-419, direction F 



40 Biastp and/or MPSearch Result: 
Result: UNK 
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SEQ ID NO: _145 ENCODED BY SEQ ID NO: _760 

S ORF# Start End Direction Codon Length 
1 7 312 R 102codons 

10 ORF translation from 7*312, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _146 ENCODED BY SEQ ID NO: _761. 

20 ORF # Start End Direction Codon Length 
1 89 478 F 130codons 

25 ORF translation from 89-478, direction F 

Blastp and/or MPSearch Result: 
30 Result: UNK 

SEQ ID NO: _147 ENCODED BY SEQ ID NO: _762. 

35 ORF# Start End Direction Codon Length 
1 128 382 F 85 codons 

40 ORF translation from 128-382, direction F 

Blastp and/or MPSearch Result: 
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Result: UNK 



5 SEQIDNO:_148 ENCODED BY SEQ ID NO: _763 ORF# Start End 

Direction Codon Length 



1 234 512 F 93codons 

10 

ORF translation from 234-512, direction F 



Blastp and/or MPSearch Result: 

15 

Description: 

ESCHERICHIA COLL OXYGEN-INSENSITIVE NAD(P)H NITROREDUCTASE (EC 
1.-.-.-). 

20 

SEQ ID NO: _149 ENCODED BY SEQ ID NO: _764 



ORF # Start End Direction Codon Length 



25 1 1 432 F 144 codons 

ORF translation from 1-432, direction F 

30 

Blastp and/or MPSearch Result: 
Description: 

PARACOCCUS DENTTRIFICANS. NADH-UBIQUINONE OXIDOREDUCTASE 
35 CHAIN 14 (EC 1.6.5.3) (NADH DEHYDROGENASE 1, CHAIN 14) (NDH-1, CHAIN 
14). 

SEQ ID NO: _I50 ENCODED BY SEQ ID NO: _765 

40 ORF# Start End Direction Codon Length 



1 125 394 R 90 codons 
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ORF translation from 125-394, direction R 
Blastp and/or MPSearch Result: 

5 

Description: 

BACILLUS STEAROTHERMOPHILUS. GLUCOSE-6-PHOSPHATE ISOMERASE A 
(GPI A) (EC 5.3.1.9) (PHOSPHOGLUCOSE ISOMERASE A). 

10 

SEQ ID NO: 151 ENCODED BY SEQ ID NO: _766 

ORF # Start End Direction Codon Length 

2 256 426 F 57 codons 
ORF translation from 256-426, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _I52 ENCODED BY SEQ ED NO: _767 

ORF# Start End Direction Codon Length 
30 1 322 717 R 132 codons 

ORF translation from 322-717, direction R 

35 Blastp and/or MPSearch Result: 
Description: 

BACRSBU NCBI gi: 642 - Bacillus subtilis (strain 168 Marburg) DNA. 

40 SEQ ID NO: _153 ENCODED BY SEQ ID NO: _768 

ORF# Start End Direction Codon Length 

80 
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1 33 572 R 180codons 
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ORF translation from 33-572, direction R 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 



SEQ ID NO: _154 ENCODED BY SEQ ID NO: _769 

15 

ORF# Start End Direction Codon Length 



1 78 545 F 156codons 

20 

ORF translation from 78-545, direction F 
Blastp and/or MPSearch Result: 

Result: UNK 

25 



SEQ ID NO: _155 ENCODED BY SEQ ID NO: _770. 

30 ORF# Start End Direction Codon Length 



1 310 540 F 77codons 



35 ORF translation from 310-540, direction F 

Blastp and/or MPSearch Result: 

Description: 
40 Potential ATP/GTP-binding protein 



SEQ ID NO: 156 ENCODED BY SEQ ID NO: _771 

81 
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ORF# Start End Direction Codon Length 
1 220 390 F 57codons 

5 

ORF translation from 220-390, direction F 

10 Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: 157 ENCODED BY SEQ ID NO: _772 

15 ORF# Start End Direction Codon Length 

1 104 577 F 158codons 

20 ORF translation from 104-577, direction F 

Blastp and/or MPSearch Result: 

25 Description: 

SALMONELLA TYPHIMURIUM. L-RIBULOKINASE (EC 2.7. 1 . 16). 

30 SEQ ID NO: _1 58 ENCODED BY SEQ ID NO: _773 

ORF# Start End Direction Codon Length 

2 218 352 F 45codons 

35 

ORF translation from 218-352, direction F 

MTlM^ETFANAKNEnNAVNNGEF^ERQNELXGDMINXLFGGN* 

40 

Blastp and/or MPSearch Result: 
Result: UNK 

82 



WO 97/30070 



PCT/US97/02318 



SEQ ID NO: _159 ENCODED BY SEQ ID NO: _774 

5 ORF # Start End Direction Codon Length 

1 90 401 F 104codons 

1 0 ORF translation from 90-40 1 » direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _160 ENCODED BY SEQ ID NO: _775 

20 ORF # Start End Direction Codon Length 

2 296 424 R 43 codons 

25 ORF translation from 296-424, direction R 

Blastp and/or MPSearch Result: 
30 Result: UNK 



SEQIDNO: _161_ 
ORF# Start End 



1 211 378 



ORF translation from 21 1-378, direction F 
Blastp and/or MPSearch Result: 

83 



ENCODED BY SEQ ID NO: _776 
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F 56 codons 
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Description: 

BACILLUS SUBTILIS. AMIDOPHOSPHORIBOSYLTRANSFERASE PRECURSOR 
(EC 2.4.2.14) (GLUT AMINE PHOSPHORIBOSYLPYROPHOSPHATE 
5 AMIDOTRANSFERASE) (ATASE). 



SEQ ID NO: _162 ENCODED BY SEQ ID NO: _777. 

10 

ORF# Start End Direction Codon Length 



2 217 327 F 37 codons 

15 

ORF translation from 217-327. direction F 



Blastp and/or MPSearch Result: 

20 

Result: UNK 



25 

SEQ ID NO: _163 ENCODED BY SEQ ID NO: _778. 

ORF# Start End Direction Codon Length 



30 1 33 275 R 81 codons 



ORF translation from 33-275, direction R 

35 

Blastp and/or MPSearch Result: 

Description: 
Potential peptidase 

40 



SEQ ID NO: _164 ENCODED BY SEQ ID NO: _779. 
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ORF# Start End Direction Codon Length 



65 



253 



R 63 codons 



5 



ORF translation from 65-253, direction R 



10 



Blastp and/or MPSearch Result: 



Result: UNK 



15 



SEQ ID NO: _165 ENCODED BY SEQ ID NO: _780. 

ORF# Start End Direction Codon Length 



ORF translation from 120-284, direction F 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _166 ENCODED BY SEQ ID NO: _781 

ORF# Start End Direction Codon Length 
35 I 191 364 F 58 codons 

ORF translation from 191-364, direction F 
Blastp and/or MPSearch Result: 

40 

Description: 

Potential prenylated protein 



20 



120 



284 



F 55 codons 
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SEQ ID NO: _167 ENCODED BY SEQ ID NO: _782 

ORF# Start End Direction Codon Length 
5 

1 219 551 R lllcodons 
ORF translation from 219-551, direction R 

10 

Blastp and/or MPSearch Result: 
Description: 

15 SALMONELLA TYPHIMURIUM. RIBONUCLEOSIDE-DIPHOSPH ATE REDUCTASE 
2 BETA CHAIN (EC 1.17.4.1) (RIBONUCLEOTIDE REDUCTASE 2). 

SEQ ID NO: _168 ENCODED BY SEQ ID NO: _783 

20 ORF# Start End Direction Codon Length 

2 134 346 F 71 codons 

25 ORF translation from 134-346, direction F 

Blastp and/or MPSearch Result: 
30 Result: UNK 

SEQ ID NO: _169 ENCODED BY SEQ ID NO: _784 

35 ORF# Start End Direction Codon Length 
1 28 333 R 102 codons 
ORF translation from 28-333, direction R 

40 

Blastp and/or MPSearch Result: 
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Description: 

ESCHERICHIA COLL MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C. 

5 

SEQ ID NO: _170 ENCODED BY SEQ ID NO: _785 

ORF# Start End Direction Codon Length 
1 108 299 F 64codons 

ORF translation from 108-299, direction F 

15 

Blastp and/or MPSearch Result: 
Description: 

20 BACILLUS SUBTILIS. CTP SYNTHASE (EC 6.3.4.2) (UTP— AMMONIA LIGASE) 
(CTP SYNTHETASE). 

25 

SEQ ID NO: 171 ENCODED BY SEQ ID NO: _786 

ORF# Start End Direction Codon Length 
30 1 7 141 R 45codons 

ORF translation from 7-141, direction R 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _172 ENCODED BY SEQ ID NO: _787 

87 
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ORF# Start End Direction Codon Length 



5 



10 



15 



25 



30 



35 



40 



1 106 390 F 95codons 

ORF translation from 106-390, direction F 

Blastp and/or MPSearch Result: 
Description: 

VIBRIO ALGINOLYTICUS. PTS SYSTEM, SUCROSE-SPECIFIC IIBC COMPONENT 
(EIIBC-SCR) (SUCROSE- PERMEASE HBC COMPONENT) 

(PHOSPHOTRANSFERASE ENZYME II, BC COMPONENT) (EC 2.7.1.69) (FJI-SCR). 

SEQ ID NO: 173 ENCODED BY SEQ ID NO: _788 

ORF# Start End Direction Codon Length 



20 1 17 298 R 94 codons 

ORF translation from 17-298. direction R 



Blastp and/or MPSearch Result: 

Description: 
ABC Transporter 



SEQ ID NO: _174 ENCODED BY SEQ ID NO. _789 

ORF# Start End Direction Codon Length 



1 99 290 F 64 codons 
ORF translation from 99-290, direction F 
Blastp and/or MPSearch Result: 

88 
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Description: 

Potential prenylated protein 



5 

SEQ ID NO: _175 ENCODED BY SEQ ID NO: _790 

ORF# Start End Direction Codon Length 
1 68 295 F 76codons 



ORF translation from 68-295, direction F 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 SEQ ID NO: _1 76 ENCODED BY SEQ ID NO: 791 

ORF # Start End Direction Codon Length 



2 135 344 R 70codons 

25 

ORF translation from 1 35-344, direction R 

MDSFVMDGTD^LVLKTU^NAQSIGA 

DE 

IKSRIFNLL* 

30 

Blastp and/or MPSearch Result: 
Description: 

35 BACILLUS SUBTILIS. ARGININE HYDROXIMATE RESISTANCE PROTEIN. 

SEQ ID NO: _177 ENCODED BY SEQ ID NO: _792 

40 ORF# Start End Direction Codon Length 



1 20 358 F 113codons 
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ORF translation from 20-358, direction F 



5 Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE 
(EC 3.5.1.25). 

10 

SEQ ID NO: 178 ENCODED BY SEQ ID NO: _793 

ORF# Start End Direction Codon Length 
15 

1 9 131 R 41 codons 



ORF translation from 9-131, direction R 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _179 ENCODED BY SEQ ID NO: _794 

ORF# Start End Direction Codon Length 



30 1 59 334 R 92 codons 



ORF translation from 59-334, direction R 

35 

Blastp and/or MPSearch Result: 
Description: 

STAPHYLOCOCCUS EPIDERMIDIS. FOSFOMYCIN RESISTANCE PROTEIN. 

40 



SEQ ID NO: __180 ENCODED BY SEQ ID NO: _795. 

90 
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ORF# Start End 



1 7 219 



Direction Codon Length 
R 71 codons 
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ORF translation from 7-219, direction R 



1 0 Blastp and/or MPSearch Result: 



15 



Description: 

PSEUDOMONAS PUTIDA. UROCANATE HYDRATASE (EC 4.2.1.49) 
(UROCANASE). 



SEQ ID NO: 181 ENCODED BY SEQ ID NO: _796 

ORF# Start End Direction Codon Length 



20 



1 127 315 R 63 codons 



25 



ORF translation from 127-315, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

30 SEQ ID NO: _1 82 ENCODED BY SEQ ID NO: _797„ 

ORF# Start End Direction Codon Length 



1 149 265 F 39 codons 

35 

ORF translation from 149-265, direction F 



40 Blastp and/or MPSearch Result: 
Result: UNK 
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5 SEQ ID NO: 183 ENCODED BY SEQ ID NO: _798 

ORF# Start End Direction Codon Length 



1 87 284 F 66 codons 

10 

ORF translation from 87-284, direction F 
Blastp and/or MPSearch Result: 

15 

Result: UNK 



20 SEQ ID NO: _1 84 ENCODED BY SEQ ID NO: _799. 

ORF # Start End Direction Codon Length 
1 126 299 F 58 codons 

25 

ORF translation from 126-299, direction F 

30 Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _1 85 ENCODED BY SEQ ID NO: _800 

ORF # Start End Direction Codon Length 
40 2 426 845 F 140 codons 

ORF translation from 426-845, direction F 

92 
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Blastp and/or MPSearch Result: 
5 Description: 

ESCHERICHIA COLL TYPE I RESTRICTION ENZYME ECOR 124/3 I M PROTEIN 
(EC 2.1.1.72). 

10 

SEQ ID NO: _186 ENCODED BY SEQ ID NO: _801 

ORF # Start End Direction Codon Length 
15 1 119 391 R 91codons 

ORF translation from 1 19-391. direction R 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _187 ENCODED BY SEQ ID NO: _802 

ORF# Start End Direction Codon Length 
30 I 108 548 F 147 codons 

ORF translation from 108-548, direction F 
35 Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: 188 ENCODED BY SEQ ID NO: _803 

ORF# Start End Direction Codon Length 
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2 421 816 R 132codons 

5 ORF translation from 42 1 -8 1 6, direction R 

Blastp and/or MPSearch Result: 
10 Result: UNK 

SEQ ID NO: _189 ENCODED BY SEQ ID NO: _804. 

15 

ORF# Start End Direction Codon Length 
2 427 609 R 61 codons 

20 

ORF translation from 427-609, direction R 
Blastp and/or MPSearch Result: 

25 

Result: UNK 

30 SEQ ID NO: _1 90 ENCODED BY SEQ ID NO: _805_ 

ORF# Start End Direction Codon Length 
I 216 356 R 47 codons 

35 

ORF translation from 216-356, direction R 

40 Blastp and/or MPSearch Result: 
Result: UNK 

94 
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ENCODED BY SEQ ID NO: _806 
Direction Codon Length 
F 143 codons 

10 ORF translation from 369-797, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _192 ENCODED BY SEQ ID NO: _807. 

ORF # Start End Direction Codon Length 

20 

1 136 411 F 92 codons 

ORF translation from 136-41 1, direction F 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _193 ENCODED BY SEQ ID NO: _808_ 

35 ORF # Start End Direction Codon Length 
I 4 372 R 123 codons 

40 ORF translation from 4-372, direction R 
Blastp and/or MPSearch Result: 

95 



SEQIDNO:_191_ 
5 ORF# Start End 
1 369 797 
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Description: 

BACILLUS SUBTTLIS. CELL DIVISION PROTEIN FTSA. 

5 SEQ ID NO: _194 ENCODED BY SEQ ID NO: 809 

ORF# Start End Direction Codon Length 

1 477 692 F 72codons 

10 

ORF translation from 477-692, direction F 
Blastp and/or MPSearch Result: 

15 

Result: UNK 

20 SEQ ID NO: _195 ENCODED BY SEQ ID NO: _810 

ORF# Start End Direction Codon Length 

2 S88 824 R 79codons 

25 

ORF translation from 588-824, direction R 

30 Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _196 ENCODED BY SEQ ID NO: _81 1 

ORF# Start End Direction Codon Length 
40 2 203 325 F 41 codons 

ORF translation from 203-325, direction F 

96 
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Blastp and/or MPSearch Result: 
5 Result: UNK 

SEQ ID NO: _197 ENCODED BY SEQ ID NO: _812 

10 

ORF# Start End Direction Codon Length 
1 23 406 R 128 codons 

15 

ORF translation from 23-406, direction R 
Blastp and/or MPSearch Result: 

20 

Description: 

SALMONELLA TYPHIMURIUM. VIRULENCE TRANSCRIPTIONAL 
REGULATORY PROTEIN PHOP. 

25 

SEQ ID NO: _198 ENCODED BY SEQ ID NO: _813 

30 ORF# Start End Direction Codon Length 
1 66 359 R 98 codons 

35 ORF translation from 66-359, direction R 

Blastp and/or MPSearch Result: 
40 Result: UNK 

97 
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SEQ ID NO: _199 ENCODED BY SEQ ID NO: _814 

ORF# Start End Direction Codon Length 

5 2 387 602 F 72 codons 

ORF translation from 387-602, direction F 
10 Blastp and/or MPSearch Result: 
Result: UNK 

15 SEQ ID NO: _200,201 ENCODED BY SEQ ID NO: 815 

ORF# Start End Direction Codon Length 

1 206 538 R 111 codons 
20 2 572 745 R 58 codons 

ORF translation from 206-538, direction R 

25 

Blastp and/or MPSearch Result: 
Description: 

GLYCINE CLEAVAGE SYSTEM H PROTEIN. - ESCHERICHIA COLL 

30 

ORF translation from 572-745, direction R 

35 Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: .202 ENCODED BY SEQ ID NO: 816 



98 
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ORF # Start End Direction Codon Length 



PCT/US97/02318 



1 145 318 R 58codons 

5 

ORF translation from 145-318, direction R 
Blastp and/or MPSearch Result: 

10 

Result: UNK 

15 SEQ ID NO: _203 ENCODED BY SEQ ID NO: _817 

ORF# Start End Direction Codon Length 

2 609 803 F 65 codons 

20 

ORF translation from 609-803, direction F 
Blastp and/or MPSearch Result: 

25 

Result: UNK 

30 SEQ ID NO: 204 ENCODED BY SEQ ID NO: _818 
ORF # Start End Direction Codon Length 
1 209 754 F 182 codons 

35 

ORF translation from 209-754, direction F 

40 Blastp and/or MPSearch Result: 
Result: UNK 

99 



WO 97/30070 



PCT/US97/02318 



SEQ ID NO: _205 ENCODED BY SEQ ID NO: _819 

5 ORF# Start End Direction Codon Length 



2 444 605 R 54 codons 
ORF translation from 444-605, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: 206 ENCODED BY SEQ ID NO: _820 

ORF# Start End Direction Codon Length 

20 

2 458 823 F 122 codons 



ORF translation from 458-823, direction F 
25 Blastp and/or MPSearch Result: 

Result: UNK 

SEQ ID NO: 207 ENCODED BY SEQ ID NO: _821 

30 

ORF# Start End Direction Codon Length 



2 379 510 F 44 codons 
35 ORF translation from 379-5 10. direction F 



Blastp and/or MPSearch Result: 
40 Description: 

LACTOCOCCUS LACTIS (SUBSP. LACTIS) (STREPTOCOCCUS LACTIS). KETOL- 
AOD REDUCTOISOMERASE (ECU A .86) (ACETOHYDROXY-ACID 

100 
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ISQMEROREDUCTASE) (ALPHA-KETO-BETA-HYDROXYLACIL 
REDUCTOISOMERASE). 



5 

SEQ ID NO: 208.209 ENCODED BY SEQ ID NO: _822. 
ORF# Start End Direction Codon Length 



10 1 241 429 F 63 codons 

3 1284 1427 F 48 codons 



ORF translation from 241-429, direction F 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 

ORF translation from 1284-1427, direction F 



Blastp and/or MPSearch Result: 

25 

Result: UNK 



SEQ ID NO: _210 ENCODED BY SEQ ID NO: _823. 

30 

ORF# Start End Direction Codon Length 



1 88 417 F 110 codons 
35 ORF translation from 88-417, direction F 
Blastp and/or MPSearch Result: 
Description: 

40 STAPHYLOCOCCUS AUREUS, AND BACILLUS SP. REPLICATION PROTEIN 



SEQ ID NO: _21 1 ENCODED BY SEQ ID NO: _824 
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WO 97/30070 

ORF# Start End Direction Codon Length 
1 254 757 R 168codons 

5 

ORF translation from 254-757, direction R 

1 0 Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _212 ENCODED BY SEQ ID NO: 825 

ORF # Start End Direction Codon Length 
20 1 230 1198 R 323codons 

ORF translation from 230-1 198, direction R 
Blastp and/or MPSearch Result: 

25 

Description: 

A/G-speciftc adenine glycosyiase (mutY) homolog - Haemophilus 

30 

SEQ ID NO: _213 ENCODED BY SEQ ID NO: _826 

ORF# Start End Direction Codon Length 
35 2 725 895 F 57codons 

ORF translation from 725-895, direction F 

40 Blastp and/or MPSearch Result: 
Result: UNK 

102 
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SEQ ID NO: _214 ENCODED BY SEQ ID NO: _827 

ORF # Start End Direction Codon Length 
1 253 393 R 47codons 



ORF translation from 253-393, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _215 ENCODED BY SEQ ID NO: _828. 

ORF# Start End Direction Codon Length 

20 

1 245 619 F 125codons 



ORF translation from 245-619, direction F 
25 Blastp and/or MPSearch Result: 

Result: UNK 

SEQ ID NO: _216 ENCODED BY SEQ ID NO: _829 

30 

ORF # Start End Direction Codon Length 



2 753 1049 R 99codons 

35 

ORF translation from 753-1049, direction R 
Blastp and/or MPSearch Result: 

Description: 

40 DNA helicase pcrA - Staphylococcus aureus 
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SEQ ID NO: _21 7 ENCODED BY SEQ ID NO: _830 

ORF# Start End Direction Codon Length 



15 



25 



30 



5 1 77 298 F 74codons 

ORF translation from 77-298, direction F 

10 Blastp and/or MPSearch Result: 
Description: 

crtN protein - Staphylococcus aureus 



SEQ ID NO: 218 ENCODED BY SEQ ID NO: 831. 

ORF# Start End Direction Codon Length 



20 1 71 367 F 99codons 

ORF translation from 71-367, direction F 
Blastp and/or MPSearch Result: 



Description: 

crtN protein - Staphylococcus aureus 



SEQ ED NO: _219 ENCODED BY SEQ ID NO: _832_ 

ORF# Start End Direction Codon Length 



35 1 33 356 R 108codons 

ORF translation from 33-356, direction R 
40 Blastp and/or MPSearch Result: 
Result: UNK 
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SEQ ID NO: _220 ENCODED BY SEQ ID NO: _833 

5 ORF# Start End Direction Codon Length 
2 243 362 F 40 codons 

10 ORF translation from 243-362 t direction F 

Blastp and/or MPSearch Result: 
15 Result: UNK 

SEQ ID NO: _221 ENCODED BY SEQ ID NO: _834 

20 

ORF # Start End Direction Codon Length 
1 56 763 R 236 codons 
25 ORF translation from 56-763, direction R 

Blastp and/or MPSearch Result: 

30 Description: 

regulator protein - Staphylococcus xylosus 

SEQ ID NO: 222 ENCODED BY SEQ ID NO: _835 

35 

ORF# Start End Direction Codon Length 
1 145 312 R 56 codons 

40 

ORF translation from 145-312, direction R 
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Blastp and/or MPSearch Result: 



Result: UNK 



5 



SEQ ID NO: _223 ENCODED BY SEQ ID NO: _836. 



ORF # Start End Direction Codon Length 



10 



3 



868 



1131 



F 88 codons 



ORF translation from 868- 1131, direction F 
1 5 Blastp and/or MPSearch Result: 
Result: UNK 

20 SEQIDNO:_224 ENCODED BY SEQ ID NO: _837. 

ORF# Start End Direction Codon Length 
1 24 368 F IIS codons 



ORF translation from 24-368, direction F 

30 Blastp and/or MPSearch Result: 
Description: 

STREPTOCOCCUS PYOGENES. STREPTOCOCCAL ACID GLYCOPROTEIN 

35 

SEQ ID NO: _225 ENCODED BY SEQ ID NO: _838 

ORF# Start End Direction Codon Length 



25 



40 



556 



696 



F 47 codons 



106 
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ORF translation from 556-696, direction F 



PCT/US97/02318 



Blastp and/or MPSearch Result: 

5 

Result: UNK 

10 SEQ ID NO: _226,227 ENCODED BY SEQ ID NO: _839 

ORF # Start End Direction Codon Length 

1 316 501 F 62codons 
15 2 573 1013 F 147codons 

ORF translation from 316-501, direction F 
20 Blastp and/or MPSearch Result: 
Result: UNK 

25 ORF translation from 573-101 3, direction F 

Blastp and/or MPSearch Result: 
30 Result: UNK 

SEQ ID NO: _228 ENCODED BY SEQ ID NO: 840 

35 ORF# Start End Direction Codon Length 

2 221 460 R 80codons 

40 ORF translation from 22 1 -460, direction R 
Blastp and/or MPSearch Result: 

107 
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SEQ ID NO: _229 ENCODED BY SEQ ID NO: _841 

5 

ORF# Start End Direction Codon Length 



2 275 712 F 146codons 

10 

ORF translation from 275-712, direction F 



Blastp and/or MPSearch Result: 

15 

Result: UNK 



SEQ ID NO: _230 ENCODED BY SEQ ID NO: _842 

20 

ORF# Start End Direction Codon Length 



1 141 365 F 75codons 

25 

ORF translation from 141-365, direction F 
Blastp and/or MPSearch Result: 
30 Result: UNK 



35 SEQ ID NO: _23 1 ENCODED BY SEQ ID NO: _843. 

ORF# Start End Direction Codon Length 



1 89 241 F 51 codons 

40 

ORF translation from 89-241, direction F 
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Blastp and/or MPSearch Result: 
Result: UNK 

5 

SEQ ID NO: _232 ENCODED BY SEQ ID NO: _844 

10 ORF# Start End Direction Codon Length 
1 128 595 F 156codons 

1 5 ORF translation from 128-595, direction F 

Blastp and/or MPSearch Result: 

20 Description: 

stringent response-like protein - Streptococcus equisimilis 

25 SEQ ID NO: _233 ENCODED BY SEQ ID NO: _845_ 

ORF# Start End Direction Codon Length 

1 230 382 F 51 codons 

30 

ORF translation from 230-382, direction F 
Blastp and/or MPSearch Result: 

35 

Result: UNK 

SEQ ID NO: _234 ENCODED BY SEQ ID NO: _846_ 

40 ORF# Start End Direction Codon Length 

2 682 843 R 54 codons 
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ORF translation from 682-843, direction R 



S Blastp and/or MPSearch Result: 
Description: 

MSU10425 NCBI gi: 595400NCBI gi: 473748 - Mycobacterium smegmatis. 

10 SEQ ID NO: _235 ENCODED BY SEQ ID NO: _847 

ORF# Start End Direction Codon Length 



I 376 573 F 66codons 

15 

ORF translation from 376-573, direction F 



20 Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL GAMMA-GLUTAMYLTRANSPEPTIDASE PRECURSOR (EC 

2.3.2.2). 

25 

SEQ ID NO: _236 ENCODED BY SEQ ID NO: _848 

ORF# Start End Direction Codon Length 

30 

1 72 272 R 67codons 



ORF translation from 72-272, direction R 

35 

Blastp and/or MPSearch Result: 
Description: 

40 STAPHYLOCOCCUS AUREUS. FIBRONECTIN-BINDING PROTEIN PRECURSOR 
(FNBP). 
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SEQ ID NO: _237 ENCODED BY SEQ ID NO: _849. 

5 ORF# Start End Direction Codon Length 



I 207 905 F 233codons 



10 ORF translation from 207-905. direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _238 ENCODED BY SEQ ID NO: _850. 

ORF# Start End Direction Codon Length 

20 

2 234 515 F 94codons 



ORF translation from 234-5 15, direction F 

25 

Blastp and/or MPSearch Result: 
Description: 

30 BACILLUS SUBTILIS. UDP-N-A(ZETYlJrfURAMOYLAI-ANIhfE-D-GLUTAMATE 
LIGASE (EC 6.32.9) (UDP-N- ACETYLMURANOYL-L-ALANYL-D^LUTAMATE 
SYNTHETASE). 



35 

SEQ ID NO: _239 ENCODED BY SEQ ID NO: _85 1 

ORF# Start End Direction Codon Length 



40 1 377 733 R 119codons 



ORF translation from 377-733, direction R 

111 



WO 97/30070 



PCT/US97/02318 



Blastp and/or MPSearch Result: 
5 Result: UNK 

SEQ ID NO: 240 ENCODED BY SEQ ID NO: _852. 

10 

ORF # Start End Direction Codon Length 
1 377 670 R 98codons 

15 

ORF translation from 377-670, direction R 
Blastp and/or MPSearch Result: 
20 Result: UNK 

25 SEQ ID NO: _241 ENCODED BY SEQ ID NO: _853„ 

ORF # Start End Direction Codon Length 
1 132 329 F 66codons 

30 

ORF translation from 132-329, direction F 

35 Blastp and/or MPSearch Result: 
Description: 

Potential prenylated protein 

40 

SEQ ID NO: _242 ENCODED BY SEQ ID NO: _854 

ORF # Start End Direction Codon Length 
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1 179 370 R 64codons 
ORF translation from 179-370, direction R 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _243 ENCODED BY SEQ ID NO: _855. 

ORF # Start End Direction Codon Length 
1 270 875 R 202codons 



ORF translation from 270-875, direction R 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 



SEQ ID NO: 244 ENCODED BY SEQ ID NO: _856 
30 ORF# Start End Direction Codon Length 



1 94 216 F 41 codons 
ORF translation from 94-216, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _245 ENCODED BY SEQ ID NO: _857 

ORF# Start End Direction Codon Length 
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2 261 515 F 85codons 

5 ORF translation from 261-515, direction F 

Blastp and/or MPSearch Result: 
10 Result: UNK 



15 



20 



25 



30 



35 



40 



SEQ ID NO: _246 ENCODED BY SEQ ID NO: _858_ 

ORF# Start End Direction Codon Length 



1 328 597 F 90codons 

ORF translation from 328-597, direction F 

Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: 247 ENCODED BY SEQ ID NO: _859_ 
ORF# Start End Direction Codon Length 



1 76 543 F 156codons 



ORF translation from 76-543, direction F 



Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTTLIS. ADENYLOSUCCINATE LYASE (EC 4.3.2.2) 
(ADENYLOSUCCINASE) (ASL). 
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SEQ ID NO: _248 ENCODED BY SEQ ID NO: _860 

5 ORF# Start End Direction Codon Length 
I 665 982 F 106codons 

10 ORF translation from 665-982, direction F 

Blastp and/or MPSearch Result: 

15 Description: 

membrane-associated component, branched amino acid transport system 

20 SEQ ID NO: _249 ENCODED BY SEQ ID NO: _861 

ORF# Start End Direction Codon Length 
1 45 296 R 84codons 

25 

ORF translation from 45-296, direction R 

30 Blastp and/or MPSearch Result: 
Result: UNK 

35 SEQ ID NO: 250 ENCODED BY SEQ ID NO: _862 

ORF # Start End Direction Codon Length 
1 29 256 F 76codons 

40 

ORF translation from 29-256, direction F 
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Blastp and/or MPSearch Result: 
Description: 

5 BACILLUS SUBTILIS. NAPHTHOATE SYNTHASE (EC 4.1 .3.36) 

(DMYDROXYNAPHTHOIC ACID SYNTHETASE) (DHNA SYNTHETASE). 



10 SEQ ID NO: 251 ENCODED BY SEQ ID NO: _863 

ORF# Start End Direction Codon Length 



1 136 282 R 49codons 

15 

ORF translation from 136-282, direction R 



20 Blastp and/or MPSearch Result: 
Result: UNK 



25 SEQ ID NO: _252 ENCODED BY SEQ ID NO: _864. 

ORF# Start End Direction Codon Length 



1 119 442 F 108 codons 

30 

ORF translation from 1 19-442, direction F 



35 Blastp and/or MPSearch Result 
Description: 

ESCHERICHIA COLI. MOLYBDATE-BINDING PERIPLASMIC PROTEIN 
PRECURSOR. 
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SEQ ID NO: _253 ENCODED BY SEQ ID NO: _865 

ORF # Start End Direction Codon Length 
S 1 1 204 R 68codons 

ORF translation from 1-204, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _254 ENCODED BY SEQ ID NO: _866 

ORF# Start End Direction Codon Length 

20 

1 324 611 F 96 codons 

ORF translation from 324-61 1, direction F 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _255 ENCODED BY SEQ ID NO: _867 

35 ORF# Start End Direction Codon Length 
1 93 395 F 101 codons 

40 ORF translation from 93-395, direction F 

Blastp and/or MPSearch Result: 
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Description: 

STREPTOCOCCUS PNEUMONIAE. DNA MISMATCH REPAIR PROTEIN HEXA. 

5 

SEQ ID NO: _256 ENCODED BY SEQ ID NO: _868 

ORF# Start End Direction Codon Length 
2 418 621 R 68codons 

ORF translation from 418-621, direction R 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: _257 ENCODED BY SEQ ID NO: _869 

ORF# Start End Direction Codon Length 
25 1 177 398 R 74codons 

ORF translation from 177-398, direction R 

30 Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _258 ENCODED BY SEQ ID NO: _870 

ORF # Start End Direction Codon Length 

40 

2 208 549 F 114codons 
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ORF translation from 208-549, direction F 

Blastp and/or MPSearch Result: 

5 

Description: 

BACILLUS SUBTTLIS. ELONGATION FACTOR TO (EF-TU). 

10 SEQ ID NO: _259 ENCODED BY SEQ ID NO: 871 

ORF# Start End Direction Codon Length 



15 



25 



40 



1 171 299 F 43codons 



ORF translation from 171-299, direction F 



20 Blastp and/or MPSearch Result: 
Result: UNK 



SEQ ID NO: _260 ENCODED BY SEQ ID NO: _872 

ORF# Start End Direction Codon Length 



30 1 11 478 F 156codons 

ORF translation from 1 1-478, direction F 
35 Blastp and/or MPSearch Result: 
Description: 

dehydrogenase/reductase-like protein 



SEQ ID NO: _261 ENCODED BY SEQ ID NO: _873 

ORF# Start End Direction Codon Length 
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1 85 393 F 103 codons 
ORF translation from 85-393, direction F 

5 

Blastp and/or MPSearch Result: 
Description: 

10 BACILLUS SUBTILIS. CARBAMOYL-PHOSPHATE SYNTHASE, PYRIMIDINE- 

SPECIFIC, LARGE CHAIN (EC 6.3.5.5) (CARBAMOYL-PHOSPHATE SYNTHETASE 
AMMONIA CHAIN). 

15 

SEQ ID NO: _262 ENCODED BY SEQ ID NO: _874 

ORF# Start End Direction Codon Length 
20 t 152 334 F 61 codons 

ORF translation from 152-334, direction F 
25 Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: 263 ENCODED BY SEQ ID NO: _875 

ORF# Start End Direction Codon Length 
35 1 124 435 F 104 codons 

ORF translation from 124-435, direction F 

40 

Blastp and/or MPSearch Result: 
Result: UNK 
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SEQ ID NO: _264 ENCODED BY SEQ ID NO: _876 

5 

ORF# Start End Direction Codon Length 



1 126 719 F 198codons 

10 

ORF translation from 126-719, direction F 



Blastp and/or MPSearch Result: 

15 

Description: 

EF protein - Streptococcus suis 

20 SEQ ID NO: _265 ENCODED BY SEQ ID NO: _877 

ORF # Start End Direction Codon Length 



1 83 361 R 93codons 

25 



ORF translation from 83-361, direction R 
Blastp and/or MPSearch Result: 

30 

Description: 

methionyl-tRNA formyltransferase (fmt) homolog - Haemophilus 



35 SEQ ID NO: 266 ENCODED BY SEQ ID NO: _878 
ORF# Start End Direction Codon Length 



1 131 559 R 143codons 

40 



ORF translation from 131-559, direction R 
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Blastp and/or MPSearch Result: 
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Result: UNK 

5 

SEQ ID NO: _267 ENCODED BY SEQ ID NO: _879 

ORF# Start End Direction Codon Length 



10 1 93 290 R 66codons 

ORF translation from 93-290, direction R 

15 

Blastp and/or MPSearch Result: 
Description: 

STAPHYLOCOCCUS CARNOSUS. TRANSCRIPTION ANTITERMINATION 
20 PROTEIN NUSG. 



SEQ ID NO: _268 ENCODED BY SEQ ID NO: _880. 

25 ORF# Start End Direction Codon Length 



2 166 378 R 71 codons 



30 ORF translation from 1 66-378, direction R 



Blastp and/or MPSearch Result: 
35 Description: 

BACILLUS SUBTTLIS. FERRICHROME-BINDING PROTEIN PRECURSOR. 



SEQ ID NO: _269 ENCODED BY SEQ ID NO: 881 

40 

ORF # Start End Direction Codon Length 



2 242 523 F 94 codons 
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ORF translation from 242-523, direction F 

5 

Blastp and/or MPSearch Result: 
Description: 

STREPTOCOCCUS MUTANS. GTP-BINDING PROTEIN ERA HOMOLOG. 

10 



SEQ ID NO: _270 ENCODED BY SEQ ED NO: _882 

15 

ORF # Start End Direction Codon Length 



1 38 409 F I24codons 

20 

ORF translation from 38-409, direction F 



Blastp and/or MPSearch Result: 

25 

Description: 

Bacterial regulatory protein, gntR family 

30 SEQ ID NO: _271 ENCODED BY SEQ ID NO: _883. 

ORF # Start End Direction Codon Length 



2 116 448 R lllcodons 

35 

ORF translation from 1 16-448, direction R 
Blastp and/or MPSearch Result: 

40 

Result: UNK 
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SEQ ID NO: _272 ENCODED BY SEQ ID NO: _884. 

ORF# Start End Direction Codon Length 
1 148 501 R 118codons 



ORF translation from 148-501, direction R 

10 

Blastp and/or MPSearch Result: 
Description: 

15 BACILLUS SUBTTLIS. GMP SYNTHASE (GLUTAMINE-HYDROLYZING) (EC 
6.3.5.2) (GLUT AMINE AMIDOTRANSFERASE). 



SEQ ID NO: _273 ENCODED BY SEQ ID NO: _885. 

20 

ORF# Start End Direction Codon Length 



1 71 415 F 115codons 

25 

ORF translation from 71-415, direction F 



Blastp and/or MPSearch Result: 

30 

Result: UNK 



SEQ ID NO: __274 ENCODED BY SEQ ID NO: _886 

35 

ORF# Start End Direction Codon Length 



2 130 384 R 85codons 

40 

ORF translation from 130-384, direction R 
Blastp and/or MPSearch Result: 
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Description: 

BACILLUS STEAROTHERMOPHILUS. N-ACETYL-GAMMA-GLUTAMYL- 
PHOSPHATE REDUCTASE (EC 1.2.1.38) (N-ACETYL- GLUTAMATE 
5 SEMIALDEHYDE DEHYDROGENASE) (NAGSA DEHYDROGENASE) 
(FRAGMENT). 



SEQ ID NO: _275 ENCODED BY SEQ ID NO: _887 

10 

ORF # Start End Direction Codon Length 



1 111 293 F 61codons 

15 

ORF translation from 1 1 1-293, direction F 
Blastp and/or MPSearch Result: 
20 Description: 

ESCHERICHIA COLL PHOH PROTEIN (PHOSPHATE STARVATTON-INDUCIBLE 
PROTEIN PSIH). 



25 

SEQ ID NO: _276,277 ENCODED BY SEQ ID NO: _888. 

ORF # Start End Direction Codon Length 



30 1 196 348 R 51 codons 

2 481 852 R 124 codons 



ORF translation from 196-348, direction R 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

ORF translation from 481-852, direction R 
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Blastp and/or MPSearch Result: 
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Result: UNK 

5 

SEQ ID NO: _278,279 ENCODED BY SEQ ID NO: _889 

ORF # Start End Direction Codon Length 



10 1 339 1004 F 222codons 

2 1193 1750 F 186codons 



ORF translation from 339-1004, direction F 

15 

Blastp and/or MPSearch Result: 
Description: 

S.carnosus mtlA gene for enzyme II(Mannitol). 

20 

ORF translation from 1 193-1750, direction F 



25 Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: _280_ ENCODED BY SEQ ID NO: _890 

30 

ORF# Start End Direction Codon Length 



1 111 446 R 112codons 

35 

ORF translation from 1 1 1-446, direction R 



Blastp and/or MPSearch Result: 

40 

Result: UNK 



126 



WO 97/30070 PCT/US97/02318 

SEQ ID NO: _281,282_ ENCODED BY SEQ ID NO: 891 

ORF# Start End Direction Codon Length 



10 



15 



1 305 637 R lllcodons 

2 664 1473 R 270codons 



ORF translation from 305-637, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

ORF translation from 664-1473, direction R 
Blastp and/or MPSearch Result: 



20 Description: 

glucoamylase-like protein 



25 SEQ ID NO: _283_ ENCODED BY SEQ ID NO: _892 
ORF# Start End Direction Codon Length 



2 962 1159 F 66codons 

30 

ORF translation from 962- 1 1 59, direction F 
Blastp and/or MPSearch Result: 

35 

Result: UNK 



SEQ ID NO: _284_ ENCODED BY SEQ ID NO: _893. 

40 

ORF# Start End Direction Codon Length 



2 1102 1683 F 194codons 
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ORF translation from 1 102-1683, direction F 

5 

Blastp and/or MPSearch Result: 
Description: 

botulinum neurotoxin type CI precursor - Clostridium botulinum 

10 

SEQ ID NO: _285 ENCODED BY SEQ ID NO: _894 

15 ORF# Start End Direction Codon Length 
1 256 474 F 73codons 

20 ORF translation from 256-474, direction F 

Blastp and/or MPSearch Result: 
25 Result: UNK 

SEQ ID NO: _286_ ENCODED BY SEQ ID NO: _895 

30 ORF# Start End Direction Codon Length 
1 132 353 F 74codons 

35 ORF translation from 1 32-353, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _287_ ENCODED BY SEQ ID NO: _896 
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ORF# Start End Direction Codon Length 
1 82 585 F I68codons 
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ORF translation from 82-585, direction F 

10 Blastp and/or MPSearch Result: 
Result: UNK 



15 



SEQ ID NO: _288_ ENCODED BY SEQ ID NO: _897. 
ORF# Start End Direction Codon Length 



20 1 80 229 F SOcodons 

ORF translation from 80-229, direction F 

25 Blastp and/or MPSearch Result: 
Result: UNK 

30 SEQ ID NO: _289,290_ ENCODED BY SEQ ID NO: _898_ 
ORF# Start End Direction Codon Length 



1 271 987 F 239 codons 
35 2 1131 1436 F 102 codons 

ORF translation from 271-987, direction F 



40 Blastp and/or MPSearch Result: 
Result: UNK 
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ORF translation from 1 131-1436, direction F 

5 Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _291 ENCODED BY SEQ ID NO: _899 

ORF # Start End Direction Codon Length 
15 1 62 454 F 131 codons 

ORF translation from 62-454. direction F 

20 Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: _292_ ENCODED BY SEQ ID NO: 900 

25 

ORF# Start End Direction Codon Length 
1 225 509 R 95 codons 

30 

ORF translation from 225-509, direction R 
Blastp and/or MPSearch Result: 

35 

Description: 

clumping factor - Staphylococcus aureus 

40 

SEQ ID NO: _293_ ENCODED BY SEQ ID NO: _901 

ORF # Start End Direction Codon Length 

130 



WO 97/30070 

1 1183 1527 R 115 codons 

5 ORF translation from 1 1 83- 1 527 t direction R 
Blastp and/or MPSearch Result: 
Description: 

10 anaerobic ribonucleoside-triphosphate reductase (nrdD) homolog - 

SEQ ID NO: _294_ ENCODED BY SEQ ID NO: 902 
IS ORF# Start End Direction Codon Length 
1 47 364 R 106 codons 

20 ORF translation from 47-364, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _295,296_ ENCODED BY SEQ ID NO: _903 

30 ORF# Start End Direction Codon Length 

1 21 308 R 96 codons 

2 417 590 R 58 codons 

35 

ORF translation from 21-308. direction R 
Blastp and/or MPSearch Result: 
40 Description: 

BACILLUS SUBTTUS. DIPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN 
DPPB. 
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ORF translation from 417-590, direction R 
Blastp and/or MPSearch Result: 

5 

Result: UNK 



SEQ ID NO: _297_ ENCODED BY SEQ ID NO: _904 

10 

ORF# Start End Direction Codon Length 



1 96 443 F 116codons 

15 

ORF translation from 96-443, direction F 

Blastp and/or MPSearch Result: 

20 Description: 

AZOTOBACTER CHROOCOCCUM MCD 1. NIFU PROTEIN. 



SEQ ID NO: _298_ ENCODED BY SEQ ID NO: _905 

25 

ORF# Start End Direction Codon Length 
2 244 381 F 46 codons 

30 

ORF translation from 244-381, direction F 
Blastp and/or MPSearch Result: 
35 Result: UNK 



SEQ ID NO: _299_ ENCODED BY SEQ ID NO: _906. 
40 ORF# Start End Direction Codon Length 



2 565 1086 R 174 codons 
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ORF translation from 565-1086. direction R 
Blastp and/or MPSearch Result: 

5 

Result: UNK 

SEQ ID NO: _300_ ENCODED BY SEQ ID NO: _907 
10 ORF # Start End Direction Codon Length 



25 



30 



35 



40 



1 868 1245 R 126codons 

1 5 ORF translation from 868-1 245, direction R 

Blastp and/or MPSearch Result: 
20 Result: UNK 



SEQ ID NO: _301_ ENCODED BY SEQ ID NO: _908_ 
ORF# Start End Direction Codon Length 



1 129 311 R 61codons 

ORF translation from 129-3 1 1, direction R 

Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: _302_ ENCODED BY SEQ ID NO: _909_ 
ORF# Start End Direction Codon Length 



1 859 1233 F I25codons 
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ORF translation from 859-1233, direction F 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQ ID NO: _303_ ENCODED BY SEQ ID NO: _910 
ORF# Start End Direction Codon Length 
1 292 453 F 54codons 



ORF translation from 292-453, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _304,305_ ENCODED BY SEQ ID NO: _91 1 
ORF# Start End Direction Codon Length 



30 1 547 1056 R 170codons 

3 1289 1597 R 103codons 



ORF translation from 547-1056, direction R 

35 

Blastp and/or MPSearch Result: 
Description: 

40 deoxyuridine 5-triphosphatase homologue 
ORF translation from 1289-1597, direction R 
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Blastp and/or MPSearch Result: 

Result: UNK 

5 

SEQ ID NO: _306 _ ENCODED BY SEQ ID NO: _912 

ORF # Start End Direction Codon Length 
10 

1 593 946 F H8codons 
ORF translation from 593-946, direction F 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: _307 _ ENCODED BY SEQ ID NO: _913 

ORF # Start End Direction Codon Length 
25 1 290 550 R 87codons 

ORF translation from 290-550, direction R 
30 Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _308 _ ENCODED BY SEQ ID NO: _914 

ORF# Start End Direction Codon Length 
40 1 126 374 R 83codons 

ORF translation from 126-374, direction R 
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Blastp and/or MPSearch Result: 
Result: UNK 

5 

SEQ ID NO: _309 _ ENCODED BY SEQ ID NO: _915 
10 ORF# Start End Direction Codon Length 
2 716 1018 F lOlcodons 

15 ORF translation from 716-1018, direction F 

Blastp and/or MPSearch Result: 
20 Result: UNK 

SEQ ID NO: _310 _ ENCODED BY SEQ ID NO: _916 
25 ORF # Start End Direction Codon Length 

1 40 546 F 169codons 

ORF translation from 40-546, direction F 
30 Blastp and/or MPSearch Result: 

Result: UNK 

35 

SEQ ID NO: _31 1 _ ENCODED BY SEQ ID NO: _917 

ORF # Start End Direction Codon Length 
40 

2 716 949 R 78codons 

ORF translation from 716-949, direction R 

136 
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Blastp and/or MPSearch Result: 
Result: UNK 

5 

SEQ ID NO: 312 ENCODED BY SEQ ID NO: _918 

ORF# Start End Direction Codon Length 
1 417 839 R Hlcodons 



ORF translation from 417-839, direction R 

15 

Blastp and/or MPSearch Result: 
Description: 

20 STAPHYLOCOCCUS AUREUS. CAPD PROTEIN. 



25 



30 



35 



SEQ ID NO: _3 13 _ ENCODED BY SEQ ID NO: _919_ 
ORF # Start End Direction Codon Length 



I 43 231 F 63codons 

ORF translation from 43-23 1 > direction F 

Blastp and/or MPSearch Result: 
Result: UNK 



40 SEQ ID NO: _31431 5 _ ENCODED BY SEQ ID NO: _920_ 
ORF # Start End Direction Codon Length 
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1 429 956 F 176codons 

2 1135 1299 F 55 codons 

ORF translation from 429-956, direction F 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 

ORF translation from 1 135-1299, direction F 



Biastp and/or MPSearch Result: 

15 

Result: UNK 



SEQ ID NO: 316 ENCODED BY SEQ ID NO: _921 

20 

ORF# Start End Direction Codon Length 



2 617 862 F 82 codons 

25 

ORF translation from 617-862, direction F 

Blastp and/or MPSearch Result: 

30 Description: 

STAPHYLOCOCCUS AUREUS. CAPB PROTEIN. 



35 SEQ ID NO: _317_ ENCODED BY SEQ ID NO: _922 
ORF # Start End Direction Codon Length 



1 163 777 R 205 codons 

40 

ORF translation from 163-777, direction R 
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Blastp and/or MPSearch Result: 



PCTVUS97/02318 



Description: 

STAPHYLOCOCCUS AUREUS. CAPB PROTEIN. 

5 

SEQ ID NO: _3 1 8_ ENCODED BY SEQ ID NO: _923. 
ORF # Start End Direction Codon Length 
1 385 813 F 143codons 



ORF translation from 385-8 1 3, direction F 
1 5 Blastp and/or MPSearch Result: 

Description: 

ESCHERICHIA COLL THREONINE DEHYDRATASE CATABOLIC (EC 4.2.1.16) 
(THREONINE DEAMINASE). 

20 



SEQ ID NO: _319,320_ ENCODED BY SEQ ID NO: _924 

25 

ORF # Start End Direction Codon Length 



1 69 635 F 189codons 
4 2862 3095 F 78codons 

30 

ORF translation from 69-635, direction F 
Blastp and/or MPSearch Result: 

35 Result: UNK 



ORF translation from 2862-3095, direction F 

40 

Blastp and/or MPSearch Result: 
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Result: UNK 
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5 

SEQ ID NO: _321,322_ ENCODED BY SEQ ID NO: _925. 

ORF# Start End Direction Codon Length 

10 1 213 593 F 127codons 

2 848 1657 F 270codons 



ORF translation from 213-593, direction F 

15 

Blastp and/or MPSearch Result: 
Description: 

nerve growth factor (promoter) [human. Genomic, 486 nt]. ptsG protein - Staphylococcus 
20 camosus 



ORF translation from 848- 1 657, direction F 
Blastp and/or MPSearch Result: 

25 

Description: 

Bacillus subtilis phospho-beta-glucosidase (bgIA) gene, complete 
SEQ ID NO: _323_ ENCODED BY SEQ ID NO: 926 

30 

ORF# Start End Direction Codon Length 
1 235 342 F 36 codons 

35 

ORF translation from 235-342, direction F 
Blastp and/or MPSearch Result: 

40 

Result: UNK 
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SEQ ID NO: _324_ ENCODED BY SEQ ID NO: _927. 
5 ORF # Start End Direction Codon Length 



1 345 743 R 133codons 
ORF translation from 345-743, direction R 

10 

Blastp and/or MPSearch Result: 
Description: 

ENTEROCOCCUS FAECALIS (STREPTOCOCCUS FAECALIS). AUTOLYSIN (EC 
1 5 3.5. 1 .28) (N-ACETYLMURAMOYL-L-ALANINE AMIDASE). 



SEQ ID NO: _325_ ENCODED BY SEQ ID NO: _928 
20 ORF# Start End Direction Codon Length 



2 647 877 F 77 codons 



25 ORF translation from 647-877, direction F 



Blastp and/or MPSearch Result: 

c 

30 Result: UNK 



35 SEQ ID NO: _326_ ENCODED BY SEQ ID NO: _929_ 
ORF# Stan End Direction Codon Length 



40 



1 57 431 F 125 codons 
ORF translation from 57-431, direction F 
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Blastp and/or MPSearch Result: 
Result: UNK 

5 

SEQ ID NO: _327,328,329_ ENCODED BY SEQ ID NO: _930. 
ORF # Start End Direction Codon Length 

1 22 216 R 65codons 

2 231 707 R 159codons 

3 874 1119 R 82codons 

15 

ORF translation from 22-216, direction R 



Blastp and/or MPSearch Result: 

20 

Result: UNK 

ORF translation from 231-707, direction R 

25 

Blastp and/or MPSearch Result: 
Description: 

aspartate carbamoyltransferase (EC 2.1 .3.2) - Bacillus caldolyticus 

30 

ORF translation from 874-1 119, direction R 
Blastp and/or MPSearch Result: 

35 

Description: 

aspartate carbamoyltransferase (EC 2.1 .3.2) - Bacillus caldolyticus 

40 

SEQ ID NO: _330_ ENCODED BY SEQ ID NO: _931 

ORF # Start End Direction Codon Length 
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1 198 506 F 103 codons 
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5 ORF translation from 1 98-506, direction F 



Blastp and/or MPSearch Result: 
10 Description: 

BACILLUS SUBTILIS. ACETYL-COENZYME A SYNTHETASE (EC 6.2. 1.1) 
(ACETATE— COA LIGASE) (ACYL- ACTIVATING ENZYME) (ACETYL-COA 
SYNTHASE). 

15 

SEQ ID NO: _331,332,333,334_ ENCODED BY SEQ ID NO: _932 

ORF# Start End Direction Codon Length 



1 


10 


342 


F 111 codons 


2 


S82 


749 


R 56 codons 


3 


1262 


1597 


F 112 codons 


4 


1634 


1948 


F 105 codons 



25 

ORF translation from 10-342, direction F 

Blastp and/or MPSearch Result: 

30 Description: 

bicyclomycin resistance protein (bcr) homolog - Haemophilus 



ORF translation from 582-749, direction R 

35 



Blastp and/or MPSearch Result: 
Result: UNK 

40 



ORF translation from 1262-1597, direction F 
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Blastp and/or MPSearch Result: 
Description: 

5 BACILLUS SUBTHJS. MEMBRANE-BOUND PROTEIN LYTR. 



ORF translation from 1634-1948, direction F 
Blastp and/or MPSearch Result; 

10 

Description: 

ESCHERICHIA COLL MULTIDRUG RESISTANCE PROTEIN A. 

15 

SEQ ID NO: _335,336,337_ ENCODED BY SEQ ID NO: _933_ 
ORF # Start End Direction Codon Length 

1 319 696 F 126 codons 

2 758 1024 F 89 codons 

3 1318 1596 R 93 codons 

25 

ORF translation from 319-696, direction F 



30 Blastp and/or MPSearch Result: 
Result: UNK 



35 ORF translation from 758- 1024, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

40 

ORF translation from 1318-1596. direction R 
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Blastp and/or MPSearch Result: 

Result: UNK 

5 

SEQ ID NO: _338_ ENCODED BY SEQ ID NO: _934 

10 ORF# Start End Direction Codon Length 
1 72 335 F 88 codons 

15 ORF translation from 72-335, direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: _339,340_ ENCODED BY SEQ ID NO: _935 

25 ORF# Start End Direction Codon Length 

1 108 392 R 95 codons 

2 824 1660 R 279 codons 

30 

ORF translation from 108-392, direction R 
Blastp and/or MPSearch Result: 
35 Result: UNK 

ORF translation from 824-1660, direction R 
40 Blastp and/or MPSearch Result: 
Result: UNK 
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5 SEQID NO: _341_ ENCODED BY SEQ ID NO: _936 
ORF# Start End Direction Codon Length 



1 343 747 F 135codons 

10 

ORF translation from 343-747, direction F 



15 Blastp and/or MPSearch Result: 
Description: 

STREPTOCOCCUS MUTANS. M ANNTTOL- 1 -PHOSPHATE 5-DEHYDROGENASE 
(EC 1.1.1.17). 

20 

SEQ ID NO: _342 t 343_ ENCODED BY SEQ ID NO: _937 

ORF# Start End Direction Codon Length 
25 

1 166 510 F 115codons 

2 511 792 F 94codons 

ORF translation from 166-510, direction F 

30 

Blastp and/or MPSearch Result: 
Result: UNK 

35 

ORF translation from 51 1-792, direction F 
Blastp and/or MPSearch Result: 
40 Result: UNK 
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SEQ ID NO: _344,345_ ENCODED BY SEQ ID NO: _938 

ORF# Stait End Direction Codon Length 

5 1 114 623 R 170codons 

2 874 1248 R 125 codons 

ORF translation from 1 14-623, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

ORF translation from 874-1248, direction R 
Blastp and/or MPSearch Result: 
20 Result: UNK 



SEQ ID NO: _346_ ENCODED BY SEQ ID NO: _939 

25 

ORF# Start End Direction Codon Length 



1 180 1130 R 317 codons 

30 

ORF translation from 180-1 130, direction R 
Blastp and/or MPSearch Result: 

Result: UNK 

35 



SEQ ID NO: _347_ ENCODED BY SEQ ID NO: _940. 
40 ORF# Start End Direction Codon Length 



1 68 271 F 68 codons 

147 



WO 97/30070 



PCT/US97/02318 



ORF translation from 68*271, direction F 



5 Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLI. SULFITE REDUCTASE (NADPH) FLAVOPROTEIN ALPHA- 
COMPONENT (EC 1.8.1.2) (SIR-FP). 

10 



SEQ ID NO: _348,349_ ENCODED BY SEQ ID NO: _941 
15 ORF# Start End Direction Codon Length 



1 564 752 F 63 codons 

2 1010 1549 F 180 codons 

20 ORF translation from 564-752, direction F 



Blastp and/or MPSearch Result: 

25 

Result: UNK 



ORF translation from 1010-1549, direction F 

30 

Blastp and/or MPSearch Result: 
Description: 

Pfeptostreptococcus magnus protein L gene, complete cds. 

35 



SEQ ID NO: _350_ ENCODED BY SEQ ID NO: _942 
40 ORF# Start End Direction Codon Length 



2 1472 1843 F 124 codons 

148 
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ORF translation from 1472-1843, direction F 

5 Blastp and/or MPSearch R suit: 
Result: UNK 

10 

SEQ ID NO: _35 1_ ENCODED BY SEQ ID NO: 
ORF# Start End Direction Codon Length 
15 1 222 335 F 38codons 

ORF translation from 222-335, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _352_ ENCODED BY SEQ ID NO: 

ORF# Start End Direction Codon Length 
30 

1 65 511 R 149codons 
ORF translation from 65-51 1. direction R 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 
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.944. 



WO 97/30070 PCT/US97/02318 

SEQ ID NO: _353_ ENCODED BY SEQ ID NO: _945 

ORF# Start End Direction Codon Length 
5 1 259 540 R 94codons 

ORF translation from 259-540, direction R 
Blastp and/or MPSearch Result: 

10 

Result: UNK 

SEQ ID NO: _354_ ENCODED BY SEQ ID NO: 946 
15 ORF# Start End Direction Codon Length 
1 734 1033 F lOOcodons 

20 ORF translation from 734-1033, direction F 
Blastp and/or MPSearch Result 
Description: 

25 BACILLUS SUBTIUS. RIBOFLAVIN BIOSYNTHESIS PROTEIN RIBG. 

SEQ ID NO: _355,356,357_ ENCODED BY SEQ ID NO: _947 

30 ORF # Start End Direction Codon Length 



1 


147 


362 


F 


72 codons 


2 


510 


794 


F 


95 codons 


3 


997 


1461 


F 


155 codons 



35 

ORF translation from 147-362, direction F 

40 Blastp and/or MPSearch Result: 
Result: UNK 

150 
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ORF translation from 510-794, direction F 

5 Blastp and/or MPSearch Result: 
Result: UNK 

10 ORF translation from 997-1461, direction F 

Blastp and/or MPSearch Result: 
15 Result: UNK 



SEQ ID NO: _358_ ENCODED BY SEQ ID NO: _948. 

20 

ORF # Start End Direction Codon Length 



2 188 811 F 208codons 

25 

ORF translation from 188-81 1, direction F 



Blastp and/or MPSearch Result: 

30 

Result: UNK 



SEQ ID NO: _359_ ENCODED BY SEQ ID NO: _949 

35 

ORF# Start End Direction Codon Length 



1 146 859 F 238codons 

40 

ORF translation from 146-859, direction F 
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Blastp and/or MPSearch Result: 



PCIYUS97/02318 



Description: 

BACILLUS SUBTILIS. DNA REPAIR PROTEIN RECN (RECOMBINATION 
5 PROTEIN N). 



SEQ ID NO: _360_ ENCODED BY SEQ ID NO: _950. 

10 

ORF# Start End Direction Codon Length 



1 290 592 R lOlcodons 

15 

ORF translation from 290-592, direction R 



Blastp and/or MPSearch Result: 

20 

Description: 

BACILLUS SUBTILIS. UTP-GLUCOSE- 1 -PHOSPHATE URTOYLYLTRANSFERASE 
(EC 2.7.7.9) (UDP-GLUCOSE PYROPHOSPHORYLASE) (UDPGP) (ALPHA-D- 
GLUCOS YL- 1 -PHOSPHATE URTOYLYLTRANSFERASE) (URIDINE 
25 DIPHOSPHOGLUCOSE PYROPHOSPHORYLASE). 



SEQ ID NO: _361_ ENCODED BY SEQ ID NO: _951 
30 ORF# Start End Direction Codon Length 



1 302 967 F 222codons 



35 ORF translation from 302-967, direction F 
Blastp and/or MPSearch Result: 
Description: 

40 alcohol dehydrogenase (EC 1.1.1.1)- Entamoeba histolytica 
SEQ ID NO: _362_ ENCODED BY SEQ ID NO: _952_ 
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ORF # Start End Direction Codon Length 



PCT/US97/02318 



5 



2 855 1202 R 116codons 



ORF translation from 855-1202, direction R 
Blastp and/or MPSearch Result: 
10 Result: UNK 



SEQ ID NO: _363,364__ ENCODED BY SEQ ID NO: _953. 

15 

ORF # Start End Direction Codon Length 



2 742 1053 F 104codons 

3 1267 1464 F 66 codons 

20 

ORF translation from 742-1053, direction F 



25 Blastp and/or MPSearch Result: 

Description: 

E. coli priA gene. 

30 

ORF translation from 1267-1464, direction F 



Blastp and/or MPSearch Result: 

35 

Result: UNK 



40 SEQ ID NO: _365_ ENCODED BY SEQ ID NO: _954 
ORF# Start Bid Direction Codon Length 
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1 44 646 F 201 codons 
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ORF translation from 44-646, direction F 

5 

Blastp and/or MPSearch Result: 
Description: 

GLUCOSAMINE-6-PHOSPHATE ISOMERASE (EC 5.3.1.10) (GLUCOSAMINE-6- 
10 PHOSPHATE DEAMINASE). - ESCHERICHIA COLL 



SEQ ID NO: _366_ ENCODED BY SEQ ID NO: _955. 

15 

ORF# Start End Direction Codon Length 



1 73 408 F 112 codons 

20 

ORF translation from 73-408, direction F 
Blastp and/or MPSearch Result: 
25 Result: UNK 



SEQ ID NO: _367_ ENCODED BY SEQ ID NO: _956 

30 

ORF# Start End Direction Codon Length 



1 74 937 F 288 codons 

35 

ORF translation from 74-937, direction F 
Blastp and/or MPSearch Result: 

Result: UNK 

40 
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SEQ ID NO: _368_ ENCODED BY SEQ ID NO: _957 
ORF# Start End Direction Codon Length 
5 2 797 976 R 60codons 

ORF translation from 797-976, direction R 

10 

Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTTLIS. QUINOL OXIDASE POLYPEPTIDE I (EC 1 .9.3.-) (QUINOL 
15 OXIDASE codons3-600, SUB UNIT QOXB). 



PCT/US97/02318 



SEQ ID NO: _369_ ENCODED BY SEQ ID NO: _958. 

20 

ORF# Start End Direction Codon Length 



2 672 974 R 101 codons 

25 

ORF translation from 672-974, direction R 



Blastp and/or MPSearch Result: 

30 

Description: 

BACILLUS SUBTTLIS. PTS SYSTEM, GLUCOSE-SPECIFIC IIABC COMPONENT 
(EHABC-GLC) (GLUCOSE- PERMEASE IIABC COMPONENT) 
(PHOSPHOTRANSFERASE ENZYME II, ABC COMPONENT) (EC 2.7.1.69) (EH-GLC 
35 /FJE-GLC). 



SEQ ID NO: _370_ ENCODED BY SEQ ID NO: _959. 

40 

ORF# Start End Direction Codon Length 



1 195 791 R 199 codons 
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ORF translation from 195-791, direction R 

5 

Blastp and/or MPSearch Result: 
Description: 

STAPHYLOCOCCUS AUREUS. PTS SYSTEM, LACTOSE-SPECIFIC IIBC 
10 COMPONENT (EHBC-LAC) (LACTOSE- PERMEASE IIBC COMPONENT) 

(PHOSPHOTRANSFERASE ENZYME H BC COMPONENT) (EC 2.7.1.69) (EII-LAC). 



15 SEQ ID NO: _371,372*373_ ENCODED BY SEQ ID NO: _960 
ORF# Start End Direction Codon Length 





1 


190 


480 


F 


97 codons 


20 


2 


536 


703 


F 


56 codons 




3 


761 


1186 


F 


142 codons 



ORF translation from 190-480, direction F 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 

ORF translation from 536-703, direction F 



Blastp and/or MPSearch Result: 

35 

Result: UNK 

ORF translation from 761-1 186, direction F 
40 Blastp and/or MPSearch Result: 
Description: 
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BACILLUS SUBTTLIS. DIVISION INITIATION PROTEIN (CELL DIVISION AND 
SPORULATION PROTEIN). 



5 



SEQ ID NO: _374_ ENCODED BY SEQ ID NO: _961. 
ORF# Start End Direction Codon Length 



10 ! 154 585 F 144codons 

ORF translation from 154-585, direction F 
1 5 Blastp and/or MPSearch Result: 
Result: UNK 



20 



30 



SEQ ID NO: _375_ ENCODED BY SEQ ID NO: _962. 
ORF# Start End Direction Codon Length 



25 2 187 477 F 97 codons 



ORF translation from 187-477, direction F 
Blastp and/or MPSearch Result: 

Result: UNK 



35 SEQ ID NO: _376_ ENCODED BY SEQ ID NO: _ 963 
ORF# Start End Direction Codon Length 



40 



1 180 479 F 100 codons 
ORF translation from 180-479, direction F 
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Blastp and/or MPSearch Result: 



PCT/US97/02318 



Description: 

ESCHERICHIA COLL POTASSIUM-TRANSPORTING ATPASE (EC 3.6.1.36). B 
5 CHAIN (ATP PHOSPHOHYDROLASE (POTASSIUM-TRANSPORTING), B CHAIN). 

SEQ ID NO: _377_ ENCODED BY SEQ ID NO: _964 

ORF# Start End Direction Codon Length 

1 285 524 R 80codons 



ORF translation from 285-524, direction R 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 



SEQ ID NO: _378379_ ENCODED BY SEQ ID NO: _965 
25 ORF# Start End Direction Codon Length 



1 257 487 F 77 codons 

2 551 919 F 123 codons 

30 

Blastp and/or MPSearch Result: 
Description: 

35 ESCHERICHIA COLL CYTOCHROME O UBIQUINOL OXIDASE SUBUNTT I (EC 

1.10.3.-). 



ORF translation from 55 1-919, direction F 

40 

Blastp and/or MPSearch Result: 
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Description: 

BACILLUS SUBTTLIS. QUINOL OXIDASE POLYPEPTIDE III (EC 1.9.3.-) (QUINOL 
OXIDASE codons3-600 t SUBUNIT QOXC). 

5 

SEQ ID NO: _380,38 1_ ENCODED BY SEQ ID NO: _966 

ORF# Start End 
10 

1 23 448 

2 525 965 

ORF translation from 23-448, direction F 
15 Blastp and/or MPSearch Result: 

Description: 

BACILLUS SUBTTLIS. DIPEPTTDE TRANSPORT SYSTEM PERMEASE PROTEIN 
DPPC. 

20 

ORF translation from 525-965, direction F 
Blastp and/or MPSearch Result: 

25 Description: 

BACILLUS SUBTTLIS. OLIGOPEPTIDE TRANSPORT SYSTEM PERMEASE 
PROTEIN OPPC. 



30 SEQ ID NO: _382_ ENCODED BY SEQ ID NO: _967 
ORF# Start End Direction Codon Length 



2 475 636 F 54codons 

35 

ORF translation from 475-636, direction F 

40 Blastp and/or MPSearch Result: 
Result: UNK 



Direction Codon Length 



F 142 codons 
F 147 codons 
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SEQ ID NO: _383_ ENCODED BY SEQ ID NO: _968 
5 ORF# Start End Direction Codon Length 



1 124 555 F 144codons 
ORF translation from 124-555, direction F 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _384_ ENCODED BY SEQ ID NO: _969 
ORF# Start End Direction Codon Length 
1 26 394 R 123 codons 



ORF translation from 26-394, direction R 

25 

Blastp and/or MPSearch Result: 
Description: 

SALMONELLA TYPHIMURIUM. SULFITE REDUCTASE (NADPH) 
30 FLAVOPROTEIN ALPHA-COMPONENT (EC 1.8.1 .2) (SIR-FP). 



35 SEQ ID NO: _385_ ENCODED BY SEQ ID NO: _970 

ORF # Start End Direction Codon Length 



40 



1 24 455 F 144 codons 
ORF translation from 24-455, direction F 
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Blastp and/or MPSeaich Result 
Result: UNK 

5 SEQ ID NO: _386_ ENCODED BY SEQ ID NO: 971 

ORF# Start End Direction Codon Length 



1 347 697 F 117codons 

10 

ORF translation from 347-697, direction F 
Blastp and/or MPSearch Result: 

15 

Description: 

BACILLUS MEGATERIUM. ATP SYNTHASE ALPHA CHAIN (EC 3.6. 1 .34). 

20 

SEQ ID NO: _387_ ENCODED BY SEQ ID NO: _972 

ORF# Start End Direction Codon Length 
25 2 406 879 F lS8codons 

ORF translation from 406-879, direction F 

30 

Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _388_ ENCODED BY SEQ ID NO: _973 

ORF# Start End Direction Codon Length 
40 

1 654 1361 F 236codons 
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ORF translation from 654-1361, direction F 
Blastp and/or MPSearch Result: 



Description: 

5 ACETYL-COENZYME A SYNTHETASE (EC 6.2. 1 . 1) (ACETATE-COA LIGASE) 
(ACYL- ACTIVATING ENZYME) (ACETYL-COA SYNTHASE). - BACILLUS 
SUBTILIS. 

10 

SEQ ID NO: _389,390_ ENCODED BY SEQ ID NO: 974 
ORF# Start End Direction Codon Length 



15 1 1 177 F 59codons 

2 197 568 F 124codons 



ORF translation from 1-177, direction F 

20 

Blastp and/or MPSearch Result: 
Description: 

25 ESCHERICHIA COLL NUCLEOSIDE PERMEASE NUPC (NUCLEOSIDE- 
TRANSPORT SYSTEM PROTEIN NUPC). 



ORF translation from 197-568, direction F 
30 Blastp and/or MPSearch Result: 

Description: 

BACILLUS SUBTILIS. PYRIMIDINE NUCLEOSIDE TRANSPORT PROTEIN. 

35 

SEQ ID NO: _391_ ENCODED BY SEQ ID NO: _975 

ORF# Start End Direction Codon Length 



40 2 285 452 F 56 codons 



ORF translation from 285-452, direction F 
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Blastp and/or MPSearch Result: 
5 Result: UNK 

SEQ ID NO: _392_ ENCODED BY SEQ ID NO: _976 

10 ORF # Start End Direction Codon Length 

1 276 521 F 82codons 

ORF translation from 276-521, direction F 
15 Blastp and/or MPSearch Result: 

Result: UNK 

20 SEQ ID NO: _393,394_ ENCODED BY SEQ ID NO: _977 

ORF# Start End Direction Codon Length 

1 97 444 F 116codons 
25 2 582 1010 F 143codons 

ORF translation from 97-444, direction F 

Blastp and/or MPSearch Result: 

30 

Result: UNK 

ORF translation from 582-1010, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 
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SEQ ID NO: _395,396,397_ ENCODED BY SEQ ID NO: _978 

ORF# Start End Direction Codon Length 



5 


1 


71 


235 


F 


55 codons 




2 


295 


477 


F 


61 codons 




3 


546 


647 


F 


34 codons 



ORF translation from 71-235, direction F 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

ORF translation from 295-477, direction F 

20 Blastp and/or MPSearch Result: 
Result: UNK 

25 ORF translation from 546-647, direction F 

Blastp and/or MPSearch Result 
30 Result: UNK 

SEQ ID NO: _398 ENCODED BY SEQ ID NO: _979. 

35 

ORF# Start End Direction Codon Length 
1 132 653 F 174 codons 

40 

ORF translation from 132-653, direction F 
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Blastp and/or MPSearch Result: 
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Description: 

CALDOCELLUM S ACCHAROLYTICUM . ACETYL ESTERASE (EC 3. 1 .-.-). 

5 

SEQ ID NO: _399 ENCODED BY SEQ ID NO: 980 
10 ORF# Start End Direction Codon Length 
1 211 381 F 57codons 

IS ORF translation from 21 1-381, direction F 

Blastp and/or MPSearch Result: 
20 Result: UNK 



SEQ ID NO: _400, 40 1_ ENCODED BY SEQ ID NO: 981 

25 

ORF# Start End Direction Codon Length 



1 1 294 F 98codons 

2 753 1070 F 106codons 

30 

ORF translation from 1-294, direction F 



35 Blastp and/or MPSearch Result: 
Description: 

BACILLUS STEAROTHERMOPHILUS. PYRUVATE DEHYDROGENASE El 
COMPONENT. ALPHA SUB UNIT (EC 1.2.4.1). 

40 

ORF translation from 753-1070, direction F 
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Blastp and/or MPSearch Result: 
Description: 

5 BACILLUS STEAROTHERMOPHILUS. PYRUVATE DEHYDROGENASE El 
COMPONENT, ALPHA SUBUNIT (EC 1.2.4.1). 



SEQ ID NO: _402_ ENCODED BY SEQ ID NO: _982 

10 

ORF# Start End Direction Codon Length 



1 173 586 F 138codons 

15 

ORF translation from 173-586, direction F 
Blastp and/or MPSearch Result: 
20 Description: 

ESCHERICHIA COLL PROLINE/BETAINE TRANSPORTER (PROLINE PORTER II) 
(PPH). 



25 

SEQ ID NO: _403,404_ ENCODED BY SEQ ID NO: _983. 

ORF# Start End Direction Codon Length 
30 

2 259 477 R 73codons 

3 803 1147 R 115codons 



35 ORF translation from 259-477, direction R 



Blastp and/or MPSearch Result: 

40 Description: 

BACILLUS SUBTILIS. INITIATION FACTOR IF-1. 
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ORF translation from 803-1 147, direction R 
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Blastp and/or MPSearch Result: 

5 

Description: 

BACILLUS SUBITLIS. ADENYLATE KINASE (EC 2.7.43) (ATP-AMP 
TRANSPHOSPHORYLASE). 

10 

SEQ ID NO: _405_ENCODED BY SEQ ID NO: _984 

ORF# Start End Direction Codon Length 
15 

1 125 436 F 104codons 



ORF translation from 125-436, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 



SEQ ID NO: _406_ENCODED BY SEQ ID NO: _985, 
30 ORF# Start End Direction Codon Length 



2 394 525 F 44 codons 



35 ORF translation from 394-525, direction F 



Blastp and/or MPSearch Result: 
40 Result: UNK 



SEQ ID NO: _407_ENCODED BY SEQ ID NO: _986. 
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ORF# Start End Direction Codon Length 



1 350 709 F 120codons 

5 

ORF translation from 350-709, direction F 
Blastp and/or MPSearch Result: 

10 

Description: 

RHODOBACTER CAPSULATUS (RHODOPSEUDOMONAS CAPSULATA). PETP 
PROTEIN. 

15 

SEQ ID NO: _408,409 _ENCODED BY SEQ ID NO: _987 

ORF# Start End Direction Codon Length 
20 

1 657 770 F 38codons 

2 800 928 F 43codons 



25 ORF translation from 657-770, direction F 



Blastp and/or MPSearch Result: 
30 Result- UNK 



ORF translation from 800-928, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _410 ENCODED BY SEQ ID NO: _988. 

ORF # Start End Direction Codon Length 
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i 98 562 F ISScodons 
5 ORF translation from 98-562, direction F 

Blastp and/or MPSearch Result: 

10 Description: 

CbrB protein - Erwinia chrysanthemi 

15 SEQ ID NO: _41 1 ENCODED BY SEQ ID NO: _989_ 

ORF # Start End Direction Codon Length 
1 125 271 F 49codons 

20 

ORF translation from 125-271, direction F 

25 Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _412 ENCODED BY SEQ ID NO: _990 

ORF # Start End Direction Codon Length 
35 1 52 354 R 101 codons 

ORF translation from 52-354, direction R 

40 

Blastp and/or MPSearch Result: 
Description: 
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BACILLUS SUBTILIS. PHOSPHORIBOSYLGLYCIN AMIDE 
FORMYLTRANSFERASE (EC 2.1 .12) (GART) (GAR TRANSFORMYLASE) (5 - 
PHOSPHORIBOS YLGLYCINAMIDE TRANSFORMYLASE). 

5 

SEQ ID NO: 4 1 3 _ENCODED BY SEQ ID NO: 991 

ORF# Start End Direction Codon Length 

10 

2 105 527 F 141codons 

ORF translation from 105-527, direction F 

15 

Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTILIS. ATP-DEPENDENT NUCLEASE SUB UNIT B. 

20 

SEQ ID NO: _414 ENCODED BY SEQ ID NO: _992 

ORF# Start End Direction Codon Length 
25 

2 489 668 F 60codons 



ORF translation from 489-668, direction F 
30 Blastp and/or MPSearch Result: 

Result: UNK 



35 SEQ ID NO: _415_ENCODED BY SEQ ID NO: _993 
ORF# Start End Direction Codon Length 



1 193 570 R 126codons 

40 

ORF translation from 193-570, direction R 
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Blastp and/or MPSearch Result: 

S Description: 

BACILLUS SUB1TLIS. COMA OPERON PROTEIN 2. 



SEQ ID NO: _416 _ENCODED BY SEQ ID NO: _994 

10 

ORF # Start End Direction Codon Length 



1 302 433 F 44codons 

15 

ORF translation from 302-433, direction F 

Blastp and/or MPSearch Result: 

20 Description: 

ALCALIGENES EUTROPHUS. MEMBRANE PROTEIN. 



SEQ ID NO: _417 ENCODED BY SEQ ID NO: _995 

25 

ORF # Start End Direction Codon Length 



2 569 706 F 46 codons 

30 

ORF translation from 569-706, direction F 



Blastp and/or MPSearch Result: 

35 

Result: UNK 



40 

SEQ ID NO: _41 8 _ENCODED BY SEQ ID NO: _996 

ORF# Start End Direction Codon Length 

171 



WO 97/30070 PCT/US97/02318 

1 248 541 F 98codons 

5 ORF translation from 248-54 1 , direction F 

Blastp and/or MPSearch Result: 
10 Result: UNK 

SEQ ID NO: _419 _ENCODED BY SEQ ID NO: _997 

15 

ORF# Start End Direction Codon Length 

2 113 481 F 123codons 

20 

ORF translation from 1 13-481, direction F 
Blastp and/or MPSearch Result: 

25 

Result: UNK 

30 

SEQ ID NO: _420 _ENCODED BY SEQ ID NO: _998 

ORF# Start End Direction Codon Length 
35 1 347 586 F 80codons 

ORF translation from 347-586, direction F 
40 Blastp and/or MPSearch Result: 
Result: UNK 
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SEQ ID NO: _421_ENCODED BY SEQ ID NO: _999 
5 ORF# Start End Direction Codon Length 



1 75 521 F 149codons 



10 ORF translation from 75-521 , direction F 
Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _422 _ENCODED BY SEQ ID NO: _1000 
ORF# Start End Direction Codon Length 
1 192 374 R 61codons 



ORF translation from 192-374, direction R 

25 

Blastp and/or MPSearch Result: 
Description: 

30 LEGIONELLA PNEUMOPHILA. ACONITATE HYDRATASE (EC 4.2.1.3) (CITRATE 
HYDRO-LYASE) (ACONITASE) (MAJOR IRON-CONTAINING PROTEIN) (MICP) 
(IP2I0). 



35 

SEQ ID NO: _423 _ENCODED BY SEQ ID NO: _1001 
ORF # Start End Direction Codon Length 



40 1 149 499 F 117codons 



ORF translation from 149-499, direction F 
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Blastp and/or MPSearch Result: 
5 Result: UNK 

SEQ ID NO: _424 _ENCODED BY SEQ ID NO: _1002 

10 

ORF # Start End Direction Codon Length 
1 79 519 R 147codons 
15 ORF translation from 79-519, direction R 
Blastp and/or MPSearch Result: 
Description: 

20 pyruvate synthase (EC 1 .2.7. 1 ) - Halobacterium halobium 

SEQ ID NO: _425 _ENCODED BY SEQ ID NO: _1003. 
25 ORF # Start End Direction Codon Length 
1 168 476 F I03codons 

30 ORF translation from 168-476, direction F 

Blastp and/or MPSearch Result: 

Description: 
35 ATP/GTP-bmding protein 

SEQ ID NO: _426 ENCODED BY SEQ ED NO: _1004 

40 

ORF# Start End Direction Codon Length 
1 265 843 R 193codons 
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ORF translation from 265-843, direction R 

5 

Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTILIS. ATP SYNTHASE BETA CHAIN (EC 3.6. 1 .34). 

10 

SEQ ID NO: _427 _ENCODED BY SEQ ID NO: _1005 

ORF # Start End Direction Codon Length 



15 1 83 301 F 73codons 



ORF translation from 83-301, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _428 ENCODED BY SEQ ID NO: _1006 

ORF# Start End Direction Codon Length 

30 

2 1014 1160 F 49codons 



ORF translation from 1014-1 160. direction F 

35 

Blastp and/or MPSearch Result: 

Description: 
40 scrA protein - Staphylococcus xylosus 
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SEQ ID NO: _429_ENCODED BY SEQ ID NO: _I007 

ORF# Start End Direction Codon Length 
5 2 718 894 R 59codons 

ORF translation from 71 8-894, direction R 
1 0 Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL POTASSIUM-TRANSPORTING ATPASE (EC 3.6.1.36), A 
CHAIN (ATP PHOSPHOHYDROLASE (POTASSIUM-TRANSPORTING), A CHAIN) 

15 

SEQ ID NO: _430 ENCODED BY SEQ ID NO: _1008 

ORF# Start End Direction Codon Length 
20 

1 467 667 F 67 codons 
ORF translation from 467-667, direction F 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _431 _ENCODED BY SEQ ID NO: 1009 
ORF# Start End Direction Codon Length 
35 I 102 347 F 82 codons 

ORF translation from 102-347, direction F 
40 Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL LARGE CONDUCTANCE MECHANOSENSITIVE CHANNEL. 
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SEQ ID NO: _432_ENCODED BY SEQ ID NO: _I010 

S ORF# Start End Direction Codon Length 

1 411 542 F 44 codons 

ORF translation from 41 1-542, direction F 
1 0 Blastp and/or MPSeaich Result: 

Description: 

ESCHERICHIA COLL MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A. 

15 

SEQ ID NO: _433_ENCODED BY SEQ ID NO: 101 1 

20 

ORF# Start End Direction Codon Length 

1 109 399 F 97 codons 

25 

ORF translation from 109-399, direction F 
Blastp and/or MPSearch Result. 

30 

Result: UNK 

35 SEQ ID NO: _434_ENCODED BY SEQ ID NO: _1012 

ORF# Start End Direction Codon Length 

2 411 845 R 145 codons 

40 

ORF translation from 4 1 1 -845, direction R 

177 



WO 97/30070 

Blastp and/or MPSeaich Result: 
Description: 

5 Aminoacyl-transfer RNA synthetase class-D 

SEQ ID NO: _435_ENCODED BY SEQ ID NO: 
10 ORF# Start End Direction Codon Length 
2 333 506 R 58 codons 

1 5 ORF translation from 333-506, direction R 

Blastp and/or MPSearch Result: 
20 Result: UNK 

SEQ ID NO: _436_ENCODED BY SEQ ID NO: 

25 

ORF# Start End Direction Codon Length 
2 502 849 F 116 codons 
30 ORF translation from 502-849, direction F 

Blastp and/or MPSearch Result: 
35 Result: UNK 

SEQ ID NO: _437_ENCODED BY SEQ ID NO. 

40 

ORF# Start End Direction Codon Length 
1 92 391 F 100 codons 
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5 



10 



20 



30 



ORF translation from 92-391, direction F 

Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: _438_ENCODED BY SEQ ID NO: _1016_ 
ORF# Start End Direction Codon Length 



15 1 45 305 R 87codons 



ORF translation from 45-305, direction R 



Blastp and/or MPSearch Result: 



Description: 

KLEBSIELLA AEROGENES. FORMIMINOGLUTAMASE (EC 3.5.3.8) 
25 (FORMIMINOGLUTAMATE HYDROLASE) (fflSTTDINE UTILIZATION PROTEIN G) 
(FRAGMENT). 



SEQ ID NO: _439_ENCODED BY SEQ ID NO: _1017_ 
ORF# Start End Direction Codon Length 



1 100 372 R 91codons 

35 

ORF translation from 100-372, direction R 

Blastp and/or MPSearch Result: 

40 Description: 

BACILLUS SUBTILIS. PEPTIDE SYNTHETASE 1 . 
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SEQ ID NO: _440_ENCODED BY SEQ ID NO: 1018 

ORF# Start End Direction Codon Length 
5 1 253 537 F 95codons 

ORF translation from 253-537, direction F 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 SEQ ID NO: _441_ENCODED BY SEQ ID NO: _J019 

ORF# Start End Direction Codon Length 
1 474 692 R 73codons 

20 

ORF translation from 474-692, direction R 
Blastp and/or MPSearch Result 

25 

Description: 
ABC Transporter 

30 SEQ ID NO: _442_ENCODED BY SEQ ID NO: 1020 
ORF# Start End Direction Codon Length 
1 234 401 R 56codons 

35 

ORF translation from 234-401, direction R 
Blastp and/or MPSearch Result: 

40 

Result: UNK 
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SEQ ID NO: _443.444_ENCODED BY SEQ ID NO: _1021 

ORF# Start End Direction Codon Length 

5 1 20 244 R 75codons 

2 303 584 R 94 codons 

ORF translation from 20-244, direction R 

10 

BJastp and/or MPSearch Result: 
Result: UNK 

15 

ORF translation from 303-584, direction R 

20 Blastp and/or MPSearch Result: 
Result: UNK 

25 SEQ ID NO: _445_ENCODED BY SEQ ID NO: 1022 
ORF# Start End Direction Codon Length 
1 98 232 R 45 codons 

30 

ORF translation from 98-232, direction R 

35 Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL ALKYL HYDROPEROXIDE REDUCTASE F52A PROTEIN 
(EC 1.6.4.-) (FRAGMENT). 

40 

SEQ ID NO: _446_ENCODED BY SEQ ID NO: _1023 
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ORF# Start End Direction Codon Length 



2 301 756 R 152codons 

5 

ORF translation from 301-756. direction R 
Blastp and/or MPSearch Result: 

10 

Description: 

BACILLUS STEAROTHERMOPH1LUS. MALATE OXIDOREDUCTASE (NAD) (EC 
1.1.1 .38) (MALIC ENZYME). 

15 



SEQ ID NO: _447_ENCODED BY SEQ ID NO: _1024 

ORF# Start End Direction Codon Length 

20 

2 351 578 R 76codons 



ORF translation from 35 1-578, direction R 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 



SEQ ID NO: _448_ENCODED BY SEQ ID NO: _1025. 

ORF# Start End Direction Codon Length 
35 

1 233 475 R 81 codons 



ORF translation from 233-475, direction R 

40 



Blastp and/or MPSearch Result: 
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5 SEQ ID NO: _449_ENCODED BY SEQ ID NO: 1026 
ORF# Start End Direction Codon Length 
1 162 437 F 92codons 

10 

ORF translation from 162-437, direction F 

1 5 Blastp and/or MPSearch Result: 
Result: UNK 

20 SEQ ID NO: _450_ENCODED BY SEQ ID NO: _1027 

ORF# Start fed Direction Codon Length 
1 274 447 F 58codons 

25 

ORF translation from 274-447, direction F 
Blastp and/or MPSearch Result: 

30 

Description: 

ESCHERICHIA COLL FERRIC ENTEROBACITN TRANSPORT PROTEIN FEPD. 

35 

45 l_ENCODED BY SEQ ID NO: _1028 

End Direction Codon Length 
138 F 46codons 



SEQ ID NO: _ 



40 



ORF# Start 



1 1 
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Blastp and/or MPSearch Result: 

5 

Result: UNK 

SEQ ID NO: _452_ENCODED BY SEQ ID NO: _1029 
10 ORF# Start End Direction Codon Length 



I 9 575 R 189codons 



1 5 ORF translation from 9-575, direction R 
Blastp and/or MPSearch Result: 

Description: 

ASCOBOLUS IMMERSUS. HOMOSERINE O-ACETYLTRANSFERASE (EC 2.3.1.31) 
20 (HOMOSERINE O-TRANS- ACETYLASE). 



SEQ ID NO: _453_ENCODED BY SEQ ID NO: _1030. 

25 

ORF# Start End Direction Codon Length 



1 117 512 F 132codons 

30 

ORF translation from 1 17-512, direction F 
Blastp and/or MPSearch Result: 
35 Result: UNK 



SEQ ID NO: _454_ENCODED BY SEQ ID NO: 1031 

40 

ORF # Start End Direction Codon Length 



1 115 225 F 37codons 
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ORF translation from 1 15-225, direction F 

5 

Blastp and/or MPSearch Result: 
Result: UNK 

10 

SEQIDNO:_ 
ORF# Start 
15 1 101 

ORF translation from 101-202. direction F 
20 Blastp and/or MPSearch Result: 
Result: UNK 

25 SEQ ID NO: _456_ENCODED BY SEQ ID NO: _1033 

ORF# Start End Direction Codon Length 
1 79 294 R 72codons 

30 

ORF translation from 79-294, direction R 

35 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTILIS. GLYCEROL-3-PHOSPHATASE TRANSPORTER (G-3-P 
TRANSPORTER) (G-3-P PERMEASE). 

40 

SEQ ID NO: _457_ENCODED BY SEQ ID NO: 1034 
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ORF# Start End 



Direction Codon Length 



156 



344 



R 63 codons 



5 



ORF translation from 156-344, direction R 



10 



Blastp and/or MPSearch Result: 



Result: UNK 



15 



SEQ ID NO: _458_ENCODED BY SEQ ID NO: _1035. 

ORF# Start End Direction Codon Length 

20 

2 75 281 R 69 codons 



ORF translation from 75-28 U direction R 

25 

Blastp and/or MPSearch Result: 
Description: 

30 BACILLUS SUBTHJS. GLUCONATE PERMEASE. 



SEQ ID NO: _459_ENCODED BY SEQ ID NO: _1036 

35 

ORF# Start End Direction Codon Length 
1 87 248 R 54 codons 

40 

ORF translation from 87-248, direction R 
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Blastp and/or MPSearch Result: 
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Description: 

BACILLUS SUBTILIS. GLUTAMYL-TRNA REDUCTASE (EC L2.1.-) (GLUTR). 

5 

SEQ ID NO: _460_ENCODED BY SEQ ID NO: _1037 

10 ORF# Start End Direction Codon Length 

1 35 160 R 42codons 

1 5 ORF translation from 35- 1 60, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: _46I_ENCODED BY SEQ ID NO: _1038 

ORF# Start End Direction Codon Length 
25 

2 390 494 F 35 codons 

ORF translation from 390-494, direction F 

30 

Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _462_ENCODED BY SEQ ID NO: _1039 

40 ORF# Start End Direction Codon Length 
1 240 374 F 45 codons 
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ORF translation from 240-374, direction F 

5 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTTLIS. ARGINASE (EC 3.5.3.1). 

10 

SEQ ID NO: _463_ENCODED BY SEQ ID NO: 1040 
15 ORF# Start End Direction Codon Length 

1 65 385 F 107codons 

20 ORF translation from 65-385, direction F 
Blastp and/or MPSearch Result: 
Description: 

25 BACILLUS STEAROTHERMOPHILUS. CARBOXYLESTERASE PRECURSOR (EC 
3.1.1.1). 

SEQ ID NO: _464 ENCODED BY SEQ ID NO: 1041 

30 ORF# Start End Direction Codon Length 

2 309 557 F 83 codons 

35 ORF translation from 309-557, direction F 

Blastp and/or MPSearch Result: 

40 Description: 

ABC transporter 
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SEQ ID NO: _465_ENCODED BY SEQ ID NO: _1042 

ORF# Start End Direction Codon Length 



10 



1 487 660 F 58codons 



ORF translation from 487-660, direction F 



Blastp and/or MPSearch Result 



Description: 

THERMOTOGA MARITIMA. ANTHRANILATE SYNTHASE COMPONENT II (EC 
15 4.13.27) (CONTAINS: GLUTAMINE AMIDOTRANSFERASE; ANTHRANILATE 
PHOSPHORIBOSYLTRANSFERASE (EC 2.4.2.18)). 



20 SEQ ID NO: _466_ENCODED BY SEQ ID NO: _1043. 
ORF# Start End Direction Codon Length 



1 79 303 R 75 codons 

25 

ORF translation from 79-303, direction R 

30 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTJLIS. DNA TOPOISOMERASE I (EC 5.99.1.2) (OMEGA-PROTEIN) 
(RELAXING ENZYME) (UNTWISTING ENZYME) (SWIVELASE). 

35 

SEQ ID NO: _467_ENCODED BY SEQ ID NO: _1044 

ORF# Start End Direction Codon Length 

40 

I 87 311 F 75 codons 
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ORF translation from 87-3 1 1, direction F 



PCTAIS97/02318 



Blastp and/or MPSearch Result: 

5 

Description: 

ESCHERICHIA COLL FERRIC ENTEROBACTIN TRANSPORT ATP-BINDING 
PROTEIN FEPC. 

10 

SEQ ID NO: _468_ENCODED BY SEQ ID NO: _1045 

ORF# Start End Direction Codon Length 

15 

2 190 435 F 82codons 



ORF translation from 190-435, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _469_ENCODED BY SEQ ID NO: _1046 

ORF# Start End Direction Codon Length 
30 

1 65 664 F 200 codons 



ORF translation from 65-664, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _470_ENCODED BY SEQ ED NO: _1047. 

190 
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ORF# Start End Direction Codon Length 

1 287 484 F 66 codons 

5 

ORF translation from 287-484, direction F 
Blastp and/or MPSearch Result: 

10 

Result: UNK 

15 SEQ ID NO: _471_ENCODED BY SEQ ID NO: _1048 
ORF# Start End Direction Codon Length 

1 36 416 R 127 codons 

20 

ORF translation from 36-416, direction R 

25 Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _472 > 473_ENCODED BY SEQ ID NO: _1049 
ORF # Start End Direction Codon Length 

35 1 178 312 F 45 codons 

2 359 619 F 87 codons 

ORF translation from 178-312, direction F 

40 

Blastp and/or MPSearch Result: 
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Result: UNK 
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5 



ORF translation from 359-619, direction F 

Blastp and/or MPSearch Result: 
Resuk: UNK 



10 



SEQ ID NO: _474_ENCODED BY SEQ ID NO: _1050 

15 ORF# Start End Direction Codon Length 



1 211 486 R 92codons 



20 ORF translation from 21 1 -486, direction R 
Blastp and/or MPSearch Result: 

Result: UNK 



25 



30 



SEQ ID NO: _475,476_ENCODED BY SEQ ID NO: _1051_ 
ORF# Start End Direction Codon Length 



35 



40 



2 131 340 F 70codons 

3 525 626 F 34codons 

ORF translation from 131-340, direction F 



Blastp and/or MPSearch Result: 
Result: UNK 

ORF translation from 525-626, direction F 
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Blastp and/or MPSearch Result: 
Result: UNK 



PCT7US97/02318 



SEQ ID NO: _477_ENCODED BY SEQ ID NO: _1052_ 
10 ORF # Start End Direction Codon Length 



25 



30 



35 



40 



1 187 474 F 96codons 



1 5 ORF translation from 1 87-474, direction F 



Blastp and/or MPSearch Result: 

20 Description: 

ESCHERICHIA COLL OSMOTICALLY INDUCIBLE PROTEIN C. 



SEQ ID NO: _478_ENCODED BY SEQ ID NO: _1053_ 
ORF # Start End Direction Codon Length 



1 107 520 F 138codons 

ORF translation from 107-520, direction F 

Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: _479_ENCODED BY SEQ ID NO: _1054. 
ORF # Start End Direction Codon Length 



I 90 377 F 96codons 
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ORF translation from 90-377, direction F 

5 

Blastp and/or MPSearch Result: 
Description: 

Helix-Turn-Helix containing protein 

10 

SEQ ID NO: _480_ENCODED BY SEQ ID NO: _1055 

ORF# Start End Direction Codon Length 
15 1 167 475 R 103codons 

ORF translation from 167-475, direction R 

20 

Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTTUS. 2,3-BISPHOSPHOGLYCERATE-INDEPENDENT 
25 PHOSPHOGLYCERATE MUTASE (EC 5.4.2. 1) (PHOSPHOGLYCEROMUTASE) 
(BPG-INDEPENDENT PGAM). 

SEQ ID NO: _481 ENCODED BY SEQ ID NO: _I056 

30 ORF# Start End Direction Codon Length 



1 50 268 F 73 codons 
35 ORF translation from 50-268, direction F 



Blastp and/or MPSearch Result: 
40 Result: UNK 
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SEQ ID NO: _482 ENCODED BY SEQ ID NO: .1057 

ORF# Start End Direction Codon Length 



30 



35 



40 



5 2 260 418 R 53codons 

ORF translation from 260-418, direction R 
10 Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: .483 ENCODED BY SEQ ID NO: _1058_ 

ORF# Start End Direction Codon Length 



2 214 441 R 76codons 
ORF translation from 214-441, direction R 
25 Blastp and/or MPSearch Result: 
Result: UNK 



SEQ ID NO: 48 4 ENCODED BY SEQ ID NO: _1059_ 
ORF# Start End Direction Codon Length 



1 40 273 R 78 codons 
ORF translation from 40-273, direction R 
Blastp and/or MPSearch Result: 
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SEQ ID NO: _485 ENCODED BY SEQ ID NO: _1060 

5 

ORF # Start End Direction Codon Length 
2 676 1125 R 150codons 
10 ORF translation from 676-1 125, direction R 
Blastp and/or MPSearch Result: 
Description: 

15 HAEMOPHILUS INFLUENZAE. LIPOPROTEIN E PRECURSOR (OUTER 
MEMBRANE PROTEIN P4) (OMP P4). 

SEQ ID NO: .486,487 ENCODED BY SEQ ID NO: _1061 

20 ORF # Start End Direction Codon Length 



2 1018 1344 F I09codons 

3 1498 1647 R 50codons 

25 ORF translation from 1018-1344, direction F 
Blastp and/or MPSearch Result: 
Description: 

30 BACILLUS SUBTHJS. TRYPTOPHANYL TRNA SYNTHETASE (EC 6.1.1.2) 
(TRYPTOPHAN-TRNA LIGASE) (TRPRS). 



ORF translation from 1498-1647, direction R 

35 

Blastp and/or MPSearch Result: 
Description: 

40 LACTOCOCCUS LACTIS (SUBSP. LACTIS) (STREPTOCOCCUS LACTIS). 
OLIGOPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN OPPC. 
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SEQ ID NO: .488,489,490 ENCODED BY SEQ ID NO: _1062 

ORF# Start End Direction Codon Length 



1 


415 


552 


F 


46 codons 


2 


964 


1116 


F 


51 codons 


3 


1132 


1401 


F 


90 codons 



10 

ORF translation from 415-552, direction F 



BLastp and/or MPSearch Result: 

15 

Result: UNK 



ORF translation from 964-1 1 16, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

ORF translation from 1 132-1401, direction F 



30 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTHJS. D-ALANINE-ACTIVATING ENZYME (EC 6.3.2. ) (DAE) (D- 
ALANINE^D-ALANYL CARRIER PROTEIN LIGASE) (DCL). 

35 

SEQ ID NO: .491 ENCODED BY SEQ ID NO: 1063 

ORF# Start End Direction Codon Length 



40 2 393 1070 F 226 codons 



ORF translation from 393-1070, direction F 
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Blastp and/or MPSearch Result: 

5 Description: 

ESCHERICHIA COLL VACB PROTEIN. 

SEQ ID NO: .492 ENCODED BY SEQ ID NO: 1064 

10 

ORF # Start End Direction Codon Length 
2 283 846 R 188 codons 

15 

ORF translation from 283-846, direction R 

Blastp and/or MPSearch Result: 

20 Description: 

BACILLUS SUBTILIS. PEPTIDE SYNTHETASE 1. 

SEQ ID NO: .493,494,495 ENCODED BY SEQ ID NO: J 065 

25 

ORF# Start End Direction Codon Length 

1 1 321 F 107 codons 

2 698 814 F 39 codons 
30 3 1690 2166 F 159 codons 

ORF translation from 1-321, direction F 
35 Blastp and/or MPSearch Result: 
Result: UNK 

40 ORF translation from 698-814, direction F 

Blastp and/or MPSearch Result: 
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Result: UNK 
5 ORF translation from 1690-2166, direction F 

Blastp and/or MPSearch Result: 

10 Description: 

BACILLUS SUBTTLIS. BMRU PROTEIN. 

SEQ ID NO: ,496 ENCODED BY SEQ ID NO: 1066 

15 ORF# Start End Direction Codon Length 

2 785 1003 R 73 codons 

20 ORF translation from 785-1 003, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: .497 ENCODED BY SEQ ID NO: .1067 

ORF # Start End Direction Codon Length 
30 1 459 755 F 99 codons 

ORF translation from 459-755, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: 498.49 9 ENCODED BY SEQ ID NO: _1068 

ORF# Start End Direction Codon Length 
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1 186 320 F 45codons 
3 1135 1251 F 39codons 

5 

ORF translation from 186-320, direction F 



Blastp and/or MPSearch Result: 

10 

Result: UNK 



ORF translation from 1 135-1251, direction F 

15 

Blastp and/or MPSearch Result: 

Description: 
20 peptidase-like protein 

SEQ ID NO: .500 ENCODED BY SEQ ID NO. .1069. 

25 ORF# Start End Direction Codon Length 



1 437 646 R 70 codons 



30 ORF translation from 437-646, direction R 



Blastp and/or MPSearch Result: 
35 Description: 

ENTEROCOCCUS FAECALIS (STREPTOCOCCUS FAECALIS). CELL WALL 
ENZYME EBSB. 



40 

SEQ ID NO: _501 ENCODED BY SEQ ID NO: .1070. 

ORF# Start End Direction Codon Length 
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1 68 259 F 64codons 

5 ORF translation from 68-259, direction F 
Blastp and/or MPSearch Result: 
Description: 

10 BACILLUS SUBTTUS. AMPS PROTEIN (FRAGMENT). 



SEQ ID NO: .502 ENCODED BY SEQ ID NO: _1071 

15 

ORF # Start End Direction Codon Length 



1 1199 1648 R 150codons 

20 

ORF translation from 1 199-1648, direction R 
Blastp and/or MPSearch Result: 

25 

Description: 

BACILLUS SUBTILIS. BMRU PROTEIN. 

30 SEQ ID NO: _503,504 ENCODED BY SEQ ID NO: _1072 

ORF# Start End Direction Codon Length 



2 701 970 F 90codons 
35 3 1941 2075 R 45codons 

ORF translation from 701-970, direction F 
40 Blastp and/or MPSearch Result: 
Result: UNK 
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ORF translation from 1941-2075, direction R 

5 Blastp and/or MPSearch Result: 
Description: 

ATP/GTP-binding protein 

10 

SEQ ID NO: .505 ENCODED BY SEQ ID NO: _1073 

15 ORF# Start End Direction Codon Length 

1 185 928 R 248codons 

20 ORF translation from 1 85-928, direction R 
Blastp and/or MPSearch Result: 

Description: 

nerve growth factor {promoter} [human, Genomic, 486 nt]. ptsG protein - Staphylococcus 
25 carnosus 

SEQ ID NO: _506 ENCODED BY SEQ ID NO: 1074 

30 ORF# Start End Direction Codon Length 

2 323 1648 R 442 codons 

35 ORF translation from 323-1648, direction R 
Blastp and/or MPSearch Result: 
Description: 

40 STAPHYLOCOCCUS AUREUS. MERCURIC REDUCTASE (EC 1.16.1.1) (HG(D) 
REDUCTASE). 
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SEQ ID NO: _507 ENCODED BY SEQ ID NO: _1075 

ORF# Start End Direction Codon Length 
1 627 1061 F 145codons 



ORF translation from 627-1061, direction F 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _508,509.5 10,51 1 ENCODED BY SEQ ID NO: _1076 

ORF# Start End Direction Codon Length 
20 



1 


204 


344 


F 


47 codons 


2 


491 


808 


F 


106 codons 


3 


961 


1227 


F 


89 codons 


5 


2808 


3092 


F 


95 codons 



25 

ORF translation from 204-344, direction F 

30 Blastp and/or MPSearch Result: 
Description: 

BACILLUS SUBTHJS. SUCCINATE DEHYDROGENASE FLAVOPROTEIN 
SUBUNTT(EC 1.3.99.1). 

35 

ORF translation from 491-808, direction F 

40 Blastp and/or MPSearch Result: 
Description: 



203 



WO 97/30070 PCT/US97/02318 

BACILLUS SUBTILIS. SUCCINATE DEHYDROGENASE FLAVOPROTEIN 
SUBUNTT (EC 1 .3.99. 1 ). 

5 ORF translation from 96 1 - 1 227, direction F 

Blastp and/or MPSearch Result: 
10 Description: 

BACILLUS SUBTILIS. SUCCINATE DEHYDROGENASE FLAVOPROTEIN 
SUBUNTT (EC 1.3.99.1). 

15 ORF translation from 2808-3092, direction F 
Blastp and/or MPSearch Result: 
Description: 

20 hydroxyneurosporene dehydrogenase - Myxococcus xanthus 

SEQ ID NO: _5 12,513,514,5 15,5 16. ENCODED BY SEQ ID NO:_1077 

25 ORF# Start Bid Direction Codon Length 



1 


257 


934 


F 


226 codons 


2 


945 


1280 


F 


112 codons 


3 


1610 


2362 


F 


251 codons 


5 


3173 


3364 


F 


64 codons 


6 


3380 


3559 


F 


60 codons 



ORF translation from 257-934, direction F 

35 

Blastp and/or MPSearch Result: 
Description: 

BACILLUS SP. (STRAIN C-125). PREPROTEIN TRANSLOCASE SECY SUBUNIT. 

40 

ORF translation from 945-1280, direction F 
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Blastp and/or MPSearch Result: 
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Result: UNK 

5 

ORF translation from 1610-2362. direction F 
Blastp and/or MPSearch Result: 

Result: UNK 

10 

ORF translation from 3173-3364, direction F 



1 5 Blastp and/or MPSearch Result: 
Result: UNK 



20 ORF translation from 3380-3559, direction F 



Blastp and/or MPSearch Result: 
25 Result: UNK 



SEQ ID NO: .517,518 ENCODED BY SEQ ID NO: .1078 

30 

ORF# Start End Direction Codon Length 



3 1310 2068 F 253 codons 

4 2110 2355 F 82 codons 

35 

ORF translation from 1310-2068, direction F 
Blastp and/or MPSearch Result: 

Description: 

40 STAPHYLOCOCCUS AUREUS. CAPM PROTEIN. 



ORF translation from 21 10-2355, direction F 
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Blastp and/or MPSearch Result: 
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Result: UNK 

5 

SEQ ID NO: .5 1 9 ENCODED BY SEQ ID NO: _1079 

ORF# Start End Direction Codon Length 

1 92 484 F BIcodons 
ORF translation from 92-484, direction F 

15 

Blastp and/or MPSearch Result: 
Description: 

Rhizobium leguminosarum pss4 gene encoding a protein involved in 

20 

SEQ ID NO: .520 ENCODED BY SEQ ID NO: _1080 

25 ORF# Start End Direction Codon Length 

2 866 1414 F 183codons 

30 ORF translation from 866-1414, direction F 

Blastp and/or MPSearch Result: 
35 Description: 

ESCHERICHIA COLL GALACITTOL- 1 -PHOSPHATE DEHYDROGENASE (EC 
1.1.1.-). 

40 

SEQ ID NO: _521 ENCODED BY SEQ ID NO: _1081 

ORF# Start End Direction Codon Length 
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1 642 1322 R 227codons 



5 ORF translation from 642- 1 322, direction R 



Blastp and/or MPSearch Result: 

10 Description: 

BACILLUS SUBTIUS. EXCINUCLEASE ABC SUBUNIT C. 



15 

SEQ ID NO: _522,523,524,525 ENCODED BY SEQ ID NO: J082. 

Start End Direction Codon Length 



144 866 R 241codons 

1267 1857 R 197codons 

1978 2250 R 91 codons 

2561 2794 R 78 codons 

25 

ORF translation from 144-866, direction R 
Blastp and/or MPSearch Result: 
30 Description: 

ESCHERICHIA COLL RESPIRATORY NITRATE REDUCTASE 2 ALPHA CHAIN 
fEC 1.7.99.4). 

ORF translation from 1267-1857. direction R 

35 

Blastp and/or MPSearch Result: 
Description: 

40 BACILLUS MEGATERIUM. UROPORPHYRIN-III C-METHYLTRANSFERASE (EC 
2.1.1.107) (UROGEN ffl METHYLASE) (SUMT) (UROPORPHYRINOGEN UI 
METHYLASE) (UROM). 



ORF# 



20 



1 
2 
3 
5 
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ORF translation from 1978-2250, direction R 
Blastp and/or MPSearch Result: 

5 

Description: 

BACILLUS MEGATERIUM. UROPORPHYRINS C-METHYLTRANSFERASE (EC 
2.1.1.107) (UROGEN ffl METHYLASE) (SUMT) (UROPORPHYRINOGEN in 
METHYLASE) (UROM). 

10 

ORF translation from 2561-2794, direction R 

15 Blastp and/or MPSearch Result: 
Description: 

BACTRPC2 NCBI gi: 710- Bacillus subtilis (strain 168TrpC2, ) DNA. 

20 

SEQ ID NO: .526,527 ENCODED BY SEQ ID NO: _1083 

25 ORF* Start End Direction Codon Length 

1 384 596 R 71 codons 

2 626 970 R 115 codons 

30 ORF translation from 384-596, direction R 

Blastp and/or MPSearch Result: 
35 Result UNK 

ORF translation from 626-970, direction R 
40 Blastp and/or MPSearch Result: 
Description: 
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BACILLUS SUBTILIS. OROTATE PHOSPHORIBOSYLTRANSFERASE (EC 2.4.2.10) 
(OPRT). 



5 



SEQ ID NO: _528 ENCODED BY SEQ ID NO: _J084_ 

ORF# Start End Direction Codon Length 



1 24 893 R 290codons 

10 

ORF translation from 24-893, direction R 
Blastp and/or MPSearch Result: 
IS Description: 

ESCHERICHIA COLL ALCOHOL DEHYDROGENASE (EC 1.1.1.1) (ADH) / 
ACETALDEHYDE DEHYDROGENASE (ACETYLATING) (EC 1.2.1.10) (ACDH) / 
PYRUVATE-FORMATE-LYASE DEACTTVASE (PEL DEACTTVASE). 



20 



SEQ ID NO: _529 ENCODED BY SEQ ID NO: _1085_ 

ORF# Start End Direction Codon Length 



25 5 2558 3331 R 258codons 

ORF translation from 2558-3331, direction R 
Blastp and/or MPSearch Result: 



30 



35 



40 



Description: 

STAPHYLOCOCCUS STAPHYLOLYTICUS . LYSOSTAPHIN PRECURSOR (EC 

3.5.1.-). 



SEQ ID NO: .530 ENCODED BY SEQ ID NO: .1086. 

ORF# Start End Direction Codon Length 



2 280 1095 F 272codons 
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ORF translation from 280-1095, direction F 
Blastp and/or MPSearch Result: 
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Description: 

5 ESCHERICHIA COLL RESPIRATORY NITRATE REDUCTASE 1 ALPHA CHAIN 
(EC 1.7.99.4). 



SEQ ID NO: .531 ENCODED BY SEQ ID NO: _1087 

10 

ORF# Start End Direction Codon Length 



30 



3 701 1195 R 165codons 

15 

ORF translation from 701-1 195, direction R 
Blastp and/or MPSearch Result: 
20 Description: 

LACTOCOCCUS LACTTS (SUBSP. LACTIS) (STREPTOCOCCUS LACTIS). 3- 
ISOPROPYLMALATE DEHYDRATASE (EC 4.2. 1 .33) (ISOPROPYLMALATE 
ISOMERASE) (ALPHA-IPM ISOMERASE). 

25 

SEQ ID NO: __532 ENCODED BY SEQ ID NO: J 088 

ORF # Start End Direction Codon Length 



1 16 288 F 91codons 
ORF translation from 16-288, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 
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SEQ ID NO: _533 ENCODED BY SEQ ID NO: _I089. 

ORF# Start End Direction Codon Length 
1 302 508 R 69codons 



ORF translation from 302-508. direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 



SEQ ID NO: _534 ENCODED BY SEQ ID NO: _1090 

20 ORF# Start End Direction Codon Length 



1 182 505 F KBcodons 



25 ORF translation from 1 82-505, direction F 



Blastp and/or MPSearch Result: 
30 Description: 

BACILLUS SUBULIS. PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN (EC 
6-1-1-20) (PHENYLALANINE- TRNA LIGASE BETA CHAIN) (PHERS). 



35 

SEQ ID NO: _535,536 ENCODED BY SEQ ID NO: _1091 

ORF# Start End Direction Codon Length 



40 2 618 1097 F 160codons 

3 1150 1542 F 131codons 



PCTVUS97/D2318 
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ORF translation from 618-1097, direction F 



PCT/US97/D2318 



Blastp and/or MPSearch Result: 

5 

Description: 

ANACYSTIS NIDULANS. ELONGATION FACTOR G (EF-G). 
10 ORF translation from 1 1 50- 1 542, direction F 

Blastp and/or MPSearch Result: 

15 Description: 

MYCOBACTERIUM LEPRAE. ELONGATION FACTOR G (EF-G). 

20 SEQ ID NO: _537 ENCODED BY SEQ ID NO: _1092 

ORF# Start End Direction Codon Length 

1 637 1287 R 217 codons 

25 

ORF translation from 637-1287, direction R 
Blastp and/or MPSearch Result: 

30 

Result: UNK 

SEQ ID NO: .538 ENCODED BY SEQ ID NO: _1093 

35 ORF# Start End Direction Codon Length 

2 513 1124 F 204 codons 

40 ORF translation from 5 1 3- 1 1 24, direction F 
Blastp and/or MPSearch Result: 
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Description: 

Human KIP2 gene for p57KIP2. high-affinity choline transport protein (betT) homolog - 

5 SEQ ID NO: 539.540 ENCODED BY SEQ ID NO: 1094 
ORF # Start End Direction Codon Length 



20 



25 



40 



1 66 368 F lOlcodons 
10 2 580 1011 F 144codons 

ORF translation from 66-368, direction F 



15 Blastp and/or MPSearch Result: 
Description: 

secA protein - Staphylococcus camosus 



ORF translation from 580-101 1, direction F 



Blastp and/or MPSearch Result: 



Description: 

BACILLUS SUBTHJS. PREPROTEIN TRANSLOCASE SECA SUBUNIT. 



30 SEQ ID NO: 541-542 ENCODED BY SEQ ID NO: .1095. 
ORF # Start End Direction Codon Length 



1 747 926 F 60codons 
35 2 998 1285 F 96codons 



ORF translation from 747-926, direction F 

Blastp and/or MPSearch Result: 
Description: 
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LACTOCOCCUS LACTIS (SUBSP. LACTIS) (STREPTOCOCCUS LACTIS). DI- 
TRIPEPTIDE TRANSPORTER. 

5 ORF translation from 998-1285, direction F 
Blastp and/or MPSearch Result: 
Description: 

10 LACTOCOCCUS LACTIS (SUBSP. LACTIS) (STREPTOCOCCUS LACTIS). DI- 
TRIPEPTIDE TRANSPORTER. 

SEQ ID NO: __543 ENCODED BY SEQ ID NO: .1096 

15 ORF# Start End Direction Codon Length 
3 914 1297 R 128codons 

20 ORF translation from 914-1297, direction R 
Blastp and/or MPSearch Result: 
Result: UNK 

25 

SEQ ID NO: _544 ENCODED BY SEQ ID NO: _1097 

30 ORF# Start End Direction Codon Length 
1 89 763 R 225codons 

35 ORF translation from 89-763, direction R 
Blastp and/or MPSearch Result: 
Description: 

40 ESCHERICHIA COLL BETAINE ALDEHYDE DEHYDROGENASE (EC 1 .2. 1 .8) 
(BADH). 



214 



WO 97/30070 PCIYUS97/02318 

SEQ ID NO: _545 ENCODED BY SEQ ID NO: _1098 

ORF# Start End Direction Codon Length 
1 126 236 F 37codons 



ORF translation from 126-236, direction F 

10 

Blastp and/or MPSearch Result- 
Description: 

1 5 STAPHYLOCOCCUS AUREUS. PROBABLE CADMIUM-TRANSPORTING ATPASE 
(EC 3.6.1.-) (CADMIUM EFFLUX ATPASE). 



20 SEQ ID NO: _546 ENCODED BY SEQ ID NO: _1099 

ORF# Start End Direction Codon Length 
2 2412 2747 F 112codons 

25 

ORF translation from 2412-2747, direction F 

30 Blastp and/or MPSearch Result: 
Description: 

mecuric reductase-like protein 

35 SEQ ID NO: _547 ENCODED BY SEQ ID NO: _1 100. 

ORF # Start End Direction Codon Length 
2 843 1154 R I04codons 

40 



ORF translation from 843-1 154, direction R 
Blastp and/or MPSearch Result: 
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Description: 

thiolase (EC 2.3.1.9) - Clostridium acetobutylicum 

5 

SEQ ID NO: _548 ENCODED BY SEQ ID NO: _1 101 

10 ORF# Start End Direction Codon Length 
1 420 938 F 173codons 

1 5 ORF translation from 420-938, direction F 
Blastp and/or MPSearch Result: 

Description: 

ALCALIGENES EUTROPHUS. ACETYL-COA ACETYLTRANSFERASE (EC 2.3.1.9) 
20 (ACETOACETYL-COA THIOLASE). 

SEQ ID NO: _549,550 ENCODED BY SEQ IDNO:_l 102 

ORF# Start End Direction Codon Length 

1 386 562 F 59codons 

2 845 1195 F I17codons 

30 ORF translation from 386-562, direction F 

Blastp and/or MPSearch Result: 
35 Result: UNK 

ORF translation from 845-1 195, direction F 

40 

Blastp and/or MPSearch Result: 
Result: UNK 
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5 SEQ ID NO: __551 ENCODED BY SEQ ID NO: _1 103_ 

ORF# Start End Direction Codon Length 



2 357 647 R 97 codons 

10 

ORF translation from 357-647, direction R 



15 Blastp and/or MPSearch Result: 
Result: UNK 

20 

SEQ ID NO: _552^53 ENCODED BY SEQ ID NO: _1 104 

ORF# Start End Direction Codon Length 



25 1 173 397 R 75 codons 

2 443 697 R 85 codons 



ORF translation from 173-397, direction R 

30 

Blastp and/or MPSearch Result: 
Description: 

35 fructose-permease IIBC component (fruA) homolog - Haemophilus 

ORF translation from 443-697, direction R 
40 Blastp and/or MPSearch Result: 
Description: 
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XANTHOMONAS CAMPESTRIS (PV. CAMPESTRIS). PTS SYSTEM, FRUCTOSE; 
SPECIFIC IIBC COMPONENT (EHBC-FRU) (FRUCTOSE- PERMEASE IIBC 
COMPONENT) (PHOSPHOTRANSFERASE ENZYME II, BC COMPONENT) (EC 
2.7.1.69) (EH-FRU). 

5 

SEQ ID NO: _554 ENCODED BY SEQ ID NO: _1 105 

ORF# Start End Direction Codon Length 

1 277 531 F 85codons 
ORF translation from 277-531. direction F 
1 5 Blastp and/or MPSearch Result: 
Result: UNK 



20 SEQIDNO:_555 ENCODED BY SEQ ID NO: _1 106. 

ORF# Start End Direction Codon Length 



1 43 861 R 273codons 

25 

ORF translation from 43-861, direction R 



30 Blastp and/or MPSearch Result: 
Description: 

STREPTOCOCCUS PNEUMONIAE. MALTODEXTRIN TRANSPORT SYSTEM 
PERMEASE PROTEIN MALC. 

35 

SEQ ID NO: _556 ENCODED BY SEQ ID NO: _1 107 

ORF# Start End Direction Codon Length 
40 

1 253 618 R 122codons 
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10 



20 



25 



ORF translation from 253-6 1 8, direction R 
Blastp and/or MPSearch Result: 

Description: 

ESCHERICHIA COLL GLUTAMATE SYNTHASE (NADPH) LARGE CHAIN 
PRECURSOR (EC 1.4.1.13) (GLUTAMATE SYNTHASE ALPHA SUBUNIT) (NADPH- 
GOGAT) (GLTS ALPHA CHAIN). 



SEQ ID NO: .557 ENCODED BY SEQ ID NO: _1 108. 

ORF# Start End Direction Codon Length 



15 1 210 398 R 63codons 



ORF translation from 210-398, direction R 

Blastp and/or MPSearch Result: 
Result: UNK 

SEQ ID NO: _558 ENCODED BY SEQ ID NO: _1 109. 

ORF # Start End Direction Codon Length 



30 1 246 806 F 187 codons 



ORF translation from 246-806, direction F 

35 Blastp and/or MPSearch Result* 

Description: 
ABC transporter 

40 

SEQ ID NO: _559 ENCODED BY SEQ ID NO: _1110. 

ORF # Start End Direction Codon Length 
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1 576 1160 F 195codons 

5 ORF translation from 576- 1 160, direction F 

Blastp and/or MPSearch Result: 
10 Description: 

2 622 38.7 262 1 FABI_ECOLI ENOYL-[ACYL-CARRIER-PR 1.40e-l 10 
SEQ ID NO: _560.561 ENCODED BY SEQ ID NO: _1 111 

15 

ORF# Start End Direction Codon Length 

1 527 811 F 95codons 

2 849 950 F 34codons 

20 

ORF translation from 527-81 1, direction F 
Blastp and/or MPSearch Result: 

25 Result: UNK 

ORF translation from 849-950, direction F 

30 

Blastp and/or MPSearch Result: 
Result: UNK 

35 

SEQ ID NO: _562 ENCODED BY SEQ ID NO: _1 1 12_ 

ORF# Start End Direction Codon Length 
40 1 292 513 R 74codons 

ORF translation from 292-513, direction R 

220 



WO 97/30070 



PCTYUS97/02318 



Blastp and/or MPSearch Result: 
Description: 

5 BACILLUS SUBTIUS. PHOSPHO-2-DEHYDRO-3-DEOXYHEPTONATE ALDOLASE 
(EC 4.1.2.15) (PHOSPHO-2- KETO-3-DEOXYHEPTONATE ALDOLASE) (DAHP 
SYNTHETASE) (3-DEOXY-D-ARABINO- HEPTULOSONATE 7-PHOSPHATE 
SYNTHASE) /CHORISMATE MUTASE (EC 5.4.99.5). 

10 

SEQ ID NO: _563 ENCODED BY SEQ ID NO: _1 1 13_ 

ORF# Start End Direction Codon Length 

15 

1 977 1207 F 77codons 



ORF translation from 977-1207, direction F 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 



SEQ ID NO: 564 ENCODED BY SEQ ID NO: .1114 

30 

ORF# Start End Direction Codon Length 



2 448 678 R 77 codons 

35 

ORF translation from 448-678, direction R 



Blastp and/or MPSearch Result: 

40 

Result: UNK 
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SEQ ID NO: _565 ENCODED BY SEQ ID NO: .1115. 

5 ORF# Start End Direction Codon Length 
2 121 267 R 49codons 

1 0 ORF translation from 121 -267, direction R 

Blastp and/or MPSearch Result: 
15 Result: UNK 



20 SEQ ID NO: _566 ENCODED BY SEQ ID NO: _1 1 16. 

ORF # Start End Direction Codon Length 



2 440 736 F 99 codons 

25 

ORF translation from 440-736, direction F 
Blastp and/or MPSearch Result: 

30 

Description: 

ESCHERICHIA COLL FERRIC ENTEROBACTIN TRANSPORT ATP-BINDING 
PROTEIN FEPC. 

35 

SEQ ID NO: _567 ENCODED BY SEQ ID NO: _1117 

ORF # Start End Direction Codon Length 
40 1 355 735 F 127 codons 

ORF translation from 355-735, direction F 
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Blastp and/or MPSearch Result: 
5 Result: UNK 



SEQ ID NO. _568 ENCODED BY SEQ ID NO: _1 1 18. 

10 ORF# Start End Direction Codon Length 



1 199 555 R 119codons 
ORF translation from 199-555, direction R 

15 

Blastp and/or MPSearch Result: 
Result: UNK 

20 



SEQ ID NO: _569 ENCODED BY SEQ ID NO: _1 1 19_ 

25 

ORF# Start End Direction Codon Length 
2 914 1282 F 123 codons 

30 

ORF translation from 914-1282, direction F 

Blastp and/or MPSearch Result: 

35 Description: 

beta-ketothiolase=pory(3-bydroxyalkanoate) synthase - Thiocystis 



SEQIDNO:_570 ENCODED BY SEQ ID NO: 1120 

40 

ORF# Start End Direction Codon Length 



2 570 974 F 135 codons 
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ORF translation from 570-974, direction F 

5 

Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL UDP-N-ACETYL-D-MANNOSAMINURONIC ACID 
10 DEHYDROGENASE (EC 1.1.1.-) (UDP-MANNACA DEHYDROGENASE). 

SEQIDNO:_571 ENCODED BY SEQ ID NO: _1121_ 

15 ORF# Start End Direction Codon Length 



1 258 521 F 88codons 

20 ORF translation from 258-521, direction F 

Blastp and/or MPSearch Result: 

Description: 
25 ABC transporter 



SEQ ID NO: _572 ENCODED BY SEQ ID NO: _1 122 

30 

ORF # Start End Direction Codon Length 
1 316 867 R 184codons 

35 

ORF translation from 316-867, direction R 

Blastp and/or MPSearch Result: 

40 Description: 

Potentially pienylated protein 
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SEQ ID NO: _573 ENCODED BY SEQ ID NO: _1 123_ 

ORF# Stan End Direction Codon Length 
2 334 576 R 81 codons 



ORF translation from 334-576, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 



SEQ ID NO: _574 ENCODED BY SEQ ID NO: _1 124. 

20 ORF# Start End Direction Codon Length 



1 85 372 R 96 codons 



25 ORF translation from 85-372, direction R 
Blastp and/or MPSearch Result- 
Description: 

30 BACILLUS SUBTILIS. OLIGOPEPTIDE TRANSPORT ATP-BINDING PROTEIN 
OPPD. 



SEQ ID NO: _575 ENCODED BY SEQ ID NO: 1 1 25 

35 

ORF# Start End Direction Codon Length 
1 319 774 F 152 codons 

40 

ORF translation from 319-774, direction F 
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Description: 

D-3-PHOSPHOGLYCERATE DEHYDROGENASE (EC 1.1.1.95) (PGDH). - BACILLUS 
5 SUBTTLIS. 



SEQ ID NO: _576 ENCODED BY SEQ ID NO: _1 126. 

10 ORF# Start End Direction Codon Length 



1 375 734 F 120codons 



15 ORF translation from 375-734, direction F 
Blastp and/or MPSearch Result: 
Description: 

20 .>GLTB_ECOLI GLUTAMATE SYNTHASE (NADPH) LARGE CHAIN PRECURSOR 
(EC 1.4.1.13) (GLUTAMATE SYNTHASE ALPHA SUB UNIT) (NADPH-GOGAT).- 
ESCHERICHIA COLL Length = 1514 

SEQ ID NO: _577 ENCODED BY SEQ ID NO: _1 127 

25 

ORF# Start End Direction Codon Length 



1 12 533 R 174codons 

30 

ORF translation from 12-533, direction R 



Blastp and/or MPSearch Result: 

35 

Result: UNK 



SEQ ID NO: _578 ENCODED BY SEQ ID NO: _1 128 

40 

ORF# Start End Direction Codon Length 



1 216 524 R 103codons 
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ORF translation from 216-524, direction R 

5 

Blastp and/or MPSearch Result: 
Description: 

gluconate permease - Bacillus licheniformis 

10 

SEQ ID NO: _579_ ENCODED BY SEQ ID NO: _J 129 
ORF# Start End Direction Codon Length 
1 332 430 F 33codons 



ORF translation from 332-430, direction F 

20 

Blastp and/or MPSearch Result: 

Description: 
25 Putative transaldolase 



SEQ ID NO: _580_ ENCODED BY SEQ ID NO: _1 130_ORF # Start End 
Direction Codon Length 

30 

1 186 311 F 42codons 



ORF translation from 186-31 1, direction F 

35 

Blastp and/or MPSearch Result: 
Result: UNK 

40 

SEQ ID NO: _581 ENCODED BY SEQ ID NO: _I 131 
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2 130 363 R 78 codons 

5 

ORF translation from 130-363, direction R 
Blastp and/or MPSearch Result: 
10 Result: UNK 

15 SEQ ID NO: _582_ ENCODED BY SEQ ID NO: .1 132 
ORF# Start End Direction Codon Length 
2 371 469 F 33 codons 

20 

ORF translation from 371-469, direction F 

25 Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _583_ ENCODED BY SEQ ID NO: _J 133 

ORF# Start End Direction Codon Length 

35 

1 170 565 F 132 codons 

ORF translation from 170-565, direction F 

40 

Blastp and/or MPSearch Result: 
Description: 
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lipoate biosynthesis protein A (lipA) homolog - Haemophilus 

5 SEQ ID NO: _584_ ENCODED BY SEQ ID NO: _l 134_ 
ORF # Start End Direction Codon Length 



1 233 457 F 75codons 

10 

ORF translation from 233-457, direction F 



Blastp and/or MPSearch Result: 
Result: UNK 



20 SEQ ID NO: _585_ ENCODED BY SEQ ID NO: _1 135. 
ORF# Start End Direction Codon Length 



1 173 454 R 94codons 

25 

ORF translation from 173-454, direction R 



30 Blastp and/or MPSearch Result: 
Result: UNK 



35 

SEQ ID NO: _586_ ENCODED BY SEQ ID NO: _1 136. 

ORF # Start End Direction Codon Length 
40 

1 58 261 R 68 codons 
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ORF translation from 58-261, direction R 

Blastp and/or MPSeaxch Result: 
Result: UNK 

SEQ ID NO: _587_ ENCODED BY SEQ ID NO: J 137_ 
ORF# Start End Direction Codon Length 
1 143 745 F 201codons 

ORF translation from 143-745, direction F 

Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL BACITRACIN RESISTANCE PROTEIN (PUTATIVE 
UNDECAPRENOL KINASE) (EC 2.7. 1 .66). 

SEQ ID NO: _588_ ENCODED BY SEQ ID NO: _1 138 

ORF # Start End Direction Codon Length 
1 124 411 R 96codons 

ORF translation from 1 24-41 1, direction R 

Blastp and/or MPSearch Result: 
Description: 

STREPTOMYCES HYGROSCOPICUS. ACETYL-HYDROLASE (EC 3.1.-.-). 
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SEQ ID NO: _589_ ENCODED BY SEQ ID NO: _1 139 
5 ORF# Start End Direction Codon Length 



1 288 494 F 69codons 



1 0 ORF translation from 288-494, direction F 



Blastp and/or MPSearch Result: 
15 Result: UNK 



SEQ ID NO: _590_ ENCODED BY SEQ ID NO: _1 140_ 

20 

ORF# Start End Direction Codon Length 
1 37 3% R 120codons 

25 

ORF translation from 37-396. direction R 
Blastp and/or MPSearch Result: 

30 

Description: 

BACTEROroESNOIX)SUS(DICHELOBACTERNODOSUS). PEPTIDE CHAIN 
RELEASE FACTOR 3 (RF-3). 

35 



SEQ ID NO: _591_ ENCODED BY SEQ ID NO: J 141 

ORF # Start End Direction Codon Length 
40 

2 140 283 R 48codons 
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Blastp and/or MPSearch Result: 

5 

Result: UNK 



10 SEQ ID NO: _592,593_ ENCODED BY SEQ ID NO: J 142 
ORF# Start End Direction Codon Length 

1 1 150 R SOcodons 
15 2 164 400 R 79codons 

ORF translation from 1-150, direction R 

20 

Blastp and/or MPSearch Result: 
Result: UNK 

25 

ORF translation from 164-400, direction R 
Blastp and/or MPSearch Result: 

30 

Description: 

ligoendopeptidase F - Lactococcus lactis 

35 

SEQ ID NO: _594,595_ ENCODED BY SEQ ID NO: _1 143_ 
ORF# Start End Direction Codon Length 

40 1 365 808 R 148codons 

2 860 1075 R 72codons 
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Blastp and/or MPSearch Result: 

5 

Result: UNK 



ORF translation from 860-1075, direction R 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _596_ ENCODED BY SEQ ID NO: _1 144. 
20 ORF # Start End Direction Codon Length 



1 37 360 F 108codons 



25 ORF translation from 37-360, direction F 



Blastp and/or MPSearch Result: 

30 Description: 

2,4-dihydroxyhept-2-ene-l J-dioic acid aldolase - Escherichia coli 



SEQ ID NO: _597_ ENCODED BY SEQ ID NO: __1 145 

35 

ORF# Start End Direction Codon Length 
2 367 504 R 46 codons 

40 

ORF translation from 367-504, direction R 
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Blastp and/or MPSearch Result: 



PCTYUS97/02318 



Result: UNK 

5 

SEQ ID NO: _598.599_ ENCODED BY SEQ ID NO: .1 146. 

ORF # Start End Direction Codon Length 
10 

3 1375 1902 F 176codons 

4 1963 2421 F 153codons 



15 ORF translation from 1375-1902, direction F 



Blastp and/or MPSearch Result: 

20 Description: 

glucose-fructose oxidoreductase (EC 1.1.-.-) precursor - Zymomonas 



ORF translation from 1963-2421, direction F 
25 Blastp and/or MPSearch Result: 

Result: UNK 



30 SEQ ID NO: _600_ ENCODED BY SEQ ID NO: _1 147 
ORF # Start End Direction Codon Length 



1 195 317 F 41codons 

35 

ORF translation from 195-317, direction F 
Blastp and/or MPSearch Result: 

40 

Result: UNK 
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SEQ ID NO: .601 ENCODED BY SEQ ID NO: .1 148 
ORF # Start End Direction Codon Length 
1 1 372 F 124codons 



ORF translation from 1-372, direction F 

10 

Blastp and/or MPSearch Result: 
Result: UNK 

15 

SEQ ID NO: _602_ ENCODED BY SEQ ID NO: _1 149 

ORF# Start End Direction Codon Length 

20 

1 166 711 R 182codons 



ORF translation from 166-71 1, direction R 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 

SEQ ID NO: _603_ ENCODED BY SEQ ID NO: _1 150. 

ORF# Start End Direction Codon Length 
35 

1 390 917 R 176codons 



ORF translation from 390-917, direction R 
40 Blastp and/or MPSearch Result: 

Result: UNK 
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SEQ ID NO: _604_ ENCODED BY SEQ ID NO: _1 151 

ORF# Start End Direction Codon Length 
1 20 433 F 138codons 



ORF translation from 20-433, direction F 

10 

Blastp and/or MPSearch Result: 
Description: 

15 BACILLUS SUBTTLIS. CITRATE SYNTHASE I REPRESSOR. 



SEQ ID NO: _605_ ENCODED BY SEQ ID NO: _1152 

20 

ORF# Start End Direction Codon Length 
1 52 585 R 178codons 

25 

ORF translation from 52-585, direction R 
Blastp and/or MPSearch Result: 

30 

Description: 

STAATL NCBI gi: 765069NCBI gi: 4737 - Staphylococcus aureus (strain 8325/4) DNA 



35 SEQ ID NO: _606_ ENCODED BY SEQ ID NO: _1 1 53 
ORF # Start End Direction Codon Length 



1 85 315 R 77codons 

40 

ORF translation from 85-315, direction R 
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Blastp and/or MPSearch Result: 

Description: 

BACILLUS SUBTILIS. ARGININE HYDROXIMATE RESISTANCE PROTEIN. 

5 

SEQ ID NO: _607_ ENCODED BY SEQ ID NO: _1 154_ 
ORF# Start End Direction Codon Length 
10 1 256 594 F 113codons 

ORF translation from 256-594, direction F 

15 

Blastp and/or MPSearch Result: 
Result: UNK 



SEQ ID NO: .608. ENCODED BY SEQ ID NO: _1 155. 

25 

ORF# Start End Direction Codon Length 



2 603 1148 R 182codons 

30 

ORF translation from 603-1 148, direction R 



Blastp and/or MPSearch Result: 

35 

Result: UNK 



40 

SEQ ID NO: _609_ ENCODED BY SEQ ID NO: _l 156. 

ORF# Start End Direction Codon Length 
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1 223 354 R 44codons 

5 ORF translation from 223-354, dirccti nR 

MTQ ARPGNHYMLS RLKPNGQFIDRLL V KNGG HGTHN A YR YIVQ * 

Blastp and/or MPSearch Result: 

10 Result: UNK 



SEQ ID NO: _610,61 1,612,613_ ENCODED BY SEQ ID NO: 1 157 

15 

ORF# Start End Direction Codon Length 



1 


679 


1272 


F 


198 codons 


3 


1668 


1844 


F 


59codons 


4 


1885 


2064 


F 


60 codons 


5 


2326 


2832 


F 


169 codons 



ORF translation from 679-1272, direction F 

25 

Blastp and/or MPSearch Result: 
Result: UNK 

30 

ORF translation from 1668-1844, direction F 

35 Blastp and/or MPSearch Result: 
Result: UNK 

40 ORF translation from 1 885-2064, direction F 
Blastp and/or MPSearch Result: 



238 



WO 97/30070 
Result: UNK 



PCT/US97/02318 



5 ORF translation from 2326-2832, direction F 

Blastp and/or MPSearch Result: 

10 Description: 

STAPHYLOCOCCUS AUREUS. AUTOLYSIN (EC 3.5.1.28) (N- 
ACOTYLMURAMOYL-L- ALANINE AMIDASE). 

15 

SEQ ID NO: .614,615. ENCODED BY SEQ ID NO: _1158 

ORF # Start End Direction Codon Length 

20 I 95 925 R 277codons 

2 945 1538 R 198codons 

ORF translation from 95-925, direction R 

25 

Blastp and/or MPSearch Result: 
Description: 

30 MORAXELLA SP. HALOACETATE DEHALOGENASE H-l (EC 3.8.1.3). 

ORF translation from 945-1538, direction R 
35 Blastp and/or MPSearch Result. 
Result: UNK 
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SEQ ID NO: _616_ ENCODED BY SEQ ID NO: _1 159 

ORF # Start End Direction Codon Length 



5 2 189 518 F HOcodons 



ORF translation from 189-518, direction F 
10 Blastp and/or MPSearcb Result: 
Description: 

BACILLUS SUBTILIS. SUCCINATE DEHYDROGENASE IRON-SULFUR PROTEIN 
(EC 1.3.99.1). 

15 



SEQ ID NO: _617_ ENCODED BY SEQ ID NO: _1 160 

20 

ORF# Start End Direction Codon Length 



1 1 138 R 46codons 

25 

ORF translation from 1-138, direction R 



Blastp and/or MPSearch Result: 

30 

Result: UNK 



SEQ ID NO: _618_ ENCODED BY SEQ ID NO: _1 161 

35 

ORF# Start End Direction Codon Length 
1 303 455 F 51codons 

40 

ORF translation from 303-455, direction F 
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Blastp and/or MPSearch Result: 
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Result: UNK 

5 

SEQ ID NO: _619,620_ ENCODED BY SEQ ID NO: _1 162. 
ORF# Start End Direction Codon Length 



10 1 80 457 F 126codons 

2 586 891 F 102codons 



ORF translation from 80-457, direction F 

15 

Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL TYPE I RESTRICTION ENZYME ECOR 124/3 I M PROTEIN 
20 (EC 2.1.1.72). 

ORF translation from 586-891, direction F 

Blastp and/or MPSearch Result: 

25 

Description: 

TYPE I RESTRICTION ENZYME ECOKI SPECIFICITY PROTEIN (S PROTEIN). - 
ESCHERICHIA COLL 

30 



SEQ ID NO: _621_ ENCODED BY SEQ ID NO: .1 163. 
35 ORF# Start End Direction Codon Length 



1 114 635 R 174codons 



40 



ORF translation from 1 14-635, direction R 
Blastp and/or MPSearch Result: 
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5 

SEQ ID NO: _622_ ENCODED BY SEQ ID NO: _1 164. 
ORF# Start End Direction Codon Length 



10 i 178 492 F 105codons 



ORF translation from 178-492, direction F 
1 S Blastp and/or MPSearch Result: 
Description: 

ESCHERICHIA COLL PHOSPHOPENTOMUTASE (EC 5.4.2.7) 
(PHOSPHODEOXYRIBOMUTASE). 

20 

SEQ ID NO: _623_ ENCODED BY SEQ ID NO: _l 165_ 
ORF# Start End Direction Codon Length 



25 1 101 238 R 46 codons 



ORF translation from 101-238, direction R 

30 

Blastp and/or MPSearch Result: 
Description: 

STAPHYLOCOCCUS AUREUS. 6-PHOSPHO-BETA-GALACTOSIDASE (EC 3.2.1.85) 
35 (BETA-D-PHOSPHOGALACTOSIDE GALACTOHYDROLASE). 



242 



WO 97/30070 



PCT/US97/02318 



SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Black, Michael 
Buraham, Martin 
Hodgson, John 
Knowles, David 
Nicholas, Richard 
Pratt, Julie 
Reichard, Raymond 
Rosenberg , Mart in 
Ward, Judith 

(ii) TITLE OF THE INVENTION: Novel Compounds 

(iii) NUMBER OF SEQUENCES: 1166 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swede land Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 19-FEB-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/011,888 

(B) FILING DATE: 20 -FEB- 199 6 
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(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Gimmi, Edward R 

(B) REGISTRATION NUMBER: 38,891 

(C) REFERENCE / DOCKET NUMBER: GM50006 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Gin Phe Ser Leu Leu He Tyr He Val Val He Cys Ala Val Met 

15 10 15 

Tyr Phe Leu Met He Arg Pro Gin Gin Lys Arg Ala Lys Gin His Arg 

20 25 30 

Glu Leu He Asn Asn He Gin Ser Gly Gin Arg He Thr Thr He Gly 

35 40 45 

Gly He Lys Gly Thr Val Lys Ala Val Asp Glu Thr Thr Val Val He 

50 55 60 

Thr Val Asn Gly His Gly Thr Glu Leu Thr Phe Glu Lys Pro Ala He 
65 70 75 80 

Lys Gin Val Gly Pro Xaa He Thr Arg Ser Gly He Lys Lys Asp Glu 

85 90 95 

Thr Asn Ser Val Ser Ser Leu Leu Lys Leu Xaa Val Xaa Leu 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Ser His lie Thr Ala Tyr Glu lie Leu Ser Glu lie Arg Lys 

15 10 15 

Lys Leu Ala Gin Lys Leu Met Arg Leu Pro Leu Gly Val Val Glu Ser 

20 25 30 

Lys Lys lie Gly Glu Leu Lys Asn lie Phe Xaa Asp Lys Val Glu Thr 

35 40 45 

He Glu Leu Pro Leu Ala His Met He Pro Glu Val Xaa Gly Asn Leu 

50 55 60 

Leu Val Ala Ala Ala He Phe Leu Tyr He Met Xaa Xaa Asp Trp Arg 
65 70 75 80 

He Gly 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Thr Leu Gly Phe Thr Thr Leu Val Met Gin Asp Gly Phe Met 

1 5 10 15 

Xaa Gly Leu He Gly Xaa Ala Thr Xaa Gly Pro Xaa Leu Xaa Xaa Phe 

20 25 30 

Pro Val Tyr Tyr His Arg Tyr Leu He Trp He Asn Asp Gly Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO:4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Phe Met Val lie Lys Thr Leu Gly Gly lie Trp Thr Leu Lys 

15 10 15 

Leu Phe lie Leu Pro Asn His Pro lie lie Lys Ser Gly Leu Tyr Lys 

20 25 30 

lie Thr Lys Thr Pro Lys Leu Leu Leu Lys His His Ser 
35 40 45 

(2) INFORMATION FOR SEQ ID NO:5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gly Glu Asn Lys Asn Asn Leu Ser lie Asn Asp Asp His Ser Asn 

1 5 10 15 

Ala Ala Met Thr His Thr Ser Asp Ala He Ala Ser Ser Asp Phe He 

20 25 30 

He Arg Glu Leu Asp Leu Asn Gin Glu Pro Glu Met Gin Arg Glu Ser 

35 40 45 

Thr Xaa Ser Arg Gin Asp Ala Trp Xaa Gin Leu Lys Arg Xaa Lys Leu 

50 55 60 

Ala Val Val Gly Met He Gly Gly Xaa Xaa His Xaa Pro He Xaa Phe 
65 70 75 80 

Tyr Arg Ser Ser Xaa 
85 



(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

Met Lys Ser Asp Ser Leu Lys Glu Asn lie lie Tyr Gin Gly Leu Tyr 

15 10 15 

Gin Leu lie Arg Thr Met Thr Pro Leu lie Thr lie Pro lie lie Ser 

20 25 30 

Arg Ala Phe Gly Pro Ser Gly Val Gly lie Val Ser Phe Ser Phe Asn 

35 40 45 

lie Val Gin Tyr Phe Leu Met He Ala Ser Val Gly Val Gin Leu Tyr 

50 55 60 

Phe Asn Arg Val He Ala Lys Ser Val Asn Asp Lys Arg Gin Leu Ser 
65 70 75 80 

Gin Gin Phe Trp Asp He Phe Val Ser Lys Leu Phe Leu Ala Leu Thr 

85 90 95 

Val Phe Ala Val Tyr Met Val Val He Thr He Phe He Asp Asp Tyr 

100 105 110 

Tyr Leu He Phe Leu Leu Gin Gly He Tyr He He Gly Ala Ala Leu 

115 120 125 

Asp He Ser Trp Phe Tyr Ala Gly Xaa Glu Lys Phe Lys He Pro Ser 

130 135 140 

Leu Ser Asn He Val Ala Ser Gly He Val Leu Ser Val Val Val He 
145 150 155 160 

Phe Val Lys Asp Gin Ser Asp Leu Ser Leu 
165 170 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Gly Glu Ser Asp Glu Leu Leu 

15 
Asn Leu Lys Gin Val lie Asn Glu 
20 

lie Phe Leu Asn Val Glu Asn His 

35 40 
Asp Leu Ser Asn Ser Ala Pro Leu 

50 55 
Tyr His Asn Phe Asp Gly Phe Gly 
65 70 



Leu Lys Asn lie Leu His Phe Lys 

10 is 
Leu Asp lie Glu Gin Glu Lys Leu 
25 30 
Lys Leu Leu Asn Asn Lys Glu Arg 
45 

He Tyr Lys Thr Leu Ser Ala Leu 
60 

Leu Asn Asn Phe 

75 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met Thr Lys Glu Tyr Ala Thr Leu 

1 5 
Val Lys Pro Phe Thr Ser Tyr Pro 
20 

Tyr Ser Gly Gly His Val Asp Asn 

35 40 
Asn Lys Thr Gly His Val Glu Ala 

50 55 
Val Thr Ser Xaa Glu Asn He Leu 
65 70 
Pro Xaa 



Ala Gly Gly Cys Xaa Trp Cys Met 

10 15 
Gly He Lys Ser Val Val Ser Gly 
25 30 
Pro Thr Tyr Glu Gin Val Cys Thr 
45 

Xaa Gin He Thr Cys Asp Pro Glu 
60 

Asp He Tyr Phe Lys Thr Cys Xaa 
75 80 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 98 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Ser Asp Thr lie Asn Asn Lys Glu Asp Leu Arg Glu lie Ala Glu 

15 10 15 

Phe Gin Glu Glu Asp Ala Val Phe Asn Lys Val Leu Xaa Asn Val Ser 

20 25 30 

He Lys He Met His Thr Asp He Ala Asn Asn Arg Asp Arg He Asp 

35 40 45 

He Val Ser Asp He Pro Asp Asn Leu He Phe Tyr He Tyr Asp Ser 

50 55 60 

Leu Ser Val Gly Phe He His Trp He Lys Asp Val Lys He Xaa Cys 
65 70 75 80 

Pro Val Xaa Asp He Asp Asn He Phe His Arg Leu He Asn He Asn 
85 90 95 

He Xaa 



(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys Tyr Ala Leu His Met Tyr Leu Tyr Val Gly He Trp Asp Asn 

1 5 10 15 

Gin Leu Leu Glu Met He Cys Phe Asp Gly Tyr Asn Gin Asn Arg Ser 

20 25 30 

Ala Pro Pro Glu Glu Leu He Pro Gin Phe Asp Tyr Val Lys Glu He 

35 40 45 

Ser Glu Gin Phe Gly Phe Val Asn He Gly Val Lys Asn Tyr Glu Ala 
50 55 60 
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Asp Asp Val lie Gly Thr Leu Ala Gin Gin Tyr Ser Thr Asp Xaa Asp 
65 70 75 80 

Val Tyr lie He Thr Gly Asp Lys Asp Leu Leu Gin Cys He Asn Asp 

85 90 95 

Asn Val Glu Val Trp Leu He Lys Lys Gly Phe Asn He Tyr Asn Arg 

100 105 no 

Tyr Thr Leu His Arg Phe Asn Glu Glu Tyr Ala Leu Glu Pro Gin Gin 

115 120 125 

Leu He Asp He Lys Ala Phe Met Gly Asp Thr Ala Asp Gly Tyr Ala 

"0 135 140 

Gly Val Lys Gly He Gly 
145 150 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met Val Thr Xaa Leu Val He Pro He Tyr Xaa Gin Xaa Asp Tyr Val 

1 5 10 15 

Xaa Thr Thr Tyr Asp Arg Trp Val Val Ser Glu He Asp Asn His Gin 

20 25 30 

He He Trp Val He Leu Asn Tyr Glu Phe Asn Glu Ala Arg Pro Thr 

35 40 45 

He Gly Gin Ser Asp Glu Asp Glu Lys Ser Glu 
50 55 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met lie Asp Xaa Phe Met Gin Val Leu Lys Leu lie Lys Glu Lys Arg 

15 10 15 

Thr Asn Asn Val Val Lys Lys Ser Asp Trp Asp Lys Gly Asp Leu Tyr 

20 25 30 

Lys Thr Leu Xaa His Asp Lys Leu Pro Lys Gin Leu Lys Val His lie 

35 40 45 

Lys Glu Asp Lys Tyr Ser Xaa Val Gly Lys Val Ala Thr Gly Asn Tyr 

50 55 60 

Ser Lys Xaa Pro Trp He Xaa He Tyr Pro Pro Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Xaa Pro Arg Asp Lys Asn Ala Ala Lys Gin Arg Ala Leu Thr Leu 

15 10 15 

Ser Ser Glu Leu Asn Lys Tyr He Thr Ser Asn Glu Phe Asn Thr Gly 

20 25 30 

Arg Phe Tyr Tyr Ala Glu Asn Lys Asp Ser Ser Tyr Asp Leu Lys Asn 

35 40 45 

Asp Tyr Pro Ser Gly Tyr Ser His Gly Ser He Arg Xaa Lys Tyr Tyr 

50 55 60 

Asp Xaa Asn Glu Gly Xaa Thr Glu Glu Asp Met Leu Glu Asp Leu Xaa 
65 70 75 80 

Lys Xaa Xaa Xaa Glu Leu He Xaa 
85 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Phe Asp Phe Gly Val Leu Lys Arg Pro His He Phe Tyr Ala Tyr 

15 10 15 

Asp Leu Asp Lys Tyr Gly Asp Glu Leu Arg Gly Phe Tyr Met Asp Tyr 

20 25 30 

Lys Lys Glu Leu Pro Gly Pro He Val Val Asn His Thr Ala Leu He 

35 40 45 

Asp Ala Leu Lys Xaa He Asp Glu Thr Xaa Xaa Glu Tyr He Glu Ala 

5 ° 55 6 o 

Arg Ala Val Phe Tyr His Lys Phe Cys Ser Leu Glu Asp Gly Gin Ala 
65 7 ° 75 80 

Ser Gin Arg He Cys Gin Pro He Phe Lys 
85 90 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Met Phe Gly Phe He Gly Met Leu He Val Gly Gly Leu He Gly Trp 

1 5 io 15 

Ala Ala Gly Ala He Met Gly Lys Asp He Pro Gly Gly He Leu Gly 

20 25 30 

Asn He He Ala Gly He He Gly Ser Xaa Gly 
35 40 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Asn Asn Gly Phe Phe Asn Ser Asp Phe Asp Ser lie Phe Arg Arg 

15 10 15 

Met Met Gin Asp Met Gin Gly Ser Asn Gin Val Gly Asn Lys Lys Tyr 

20 25 30 

Tyr lie Asn Gly Lys Glu Val Ser Pro Glu Glu Leu Ala Gin Leu Thr 

35 40 45 

Gin Gin Gly Ser Asn Gin Ser Ala Glu Gin Ser Ala Gin Ala Phe Ser 

50 55 60 

Thr Lys Gin His Lys Asp Asn Lys Gly Asn Lys Val Ala Thr Ala lie 
65 70 75 80 

lie 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Met lie Arg Phe Lys Met Arg Ala Val lie Lys Gin Gin Ala Leu Lys 

15 10 15 

Ala lie Gin Gin Glu His His Pro Val Lys Val Phe Val Ala Pro Thr 

20 25 30 

He Lys Phe Met Glu Trp Arg Val Ala He Gin Thr Asp Ala His Asp 

35 40 45 

Tyr Val Gly Lys Ala Tyr Gly Arg Asn Val Xaa Phe Ser Asp Xaa Val 

50 55 60 

Glu Arg Ser Asn He He Asn Arg Leu His Phe Met Glu Ser Gin Arg 
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80 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Tyr He His Lys Trp Lys Asp Asp He Met Thr Ser He Lys Thr 

1 5 10 15 

He Thr Pro Lys Asp Phe He Phe Arg Val Leu Ser Gly Val Ala lie 

20 25 30 

Gly He Val Ala Gly Leu Xaa Pro Asn Ala He Leu Gly Glu He Phe 

35 40 45 

Lys Tyr Phe Met Asp Tyr His Pro He Phe Lys Thr Leu Leu Gly Val 

50 55 60 

Val Gly Ala He Gin Phe Thr Val Pro Ala Leu He Gly Ala Leu Gly 
65 70 75 80 

Cys His Glu He 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Lys Ala Thr Thr He lie Gly He Ala Gly Gly Ser Gly Ser Gly 

15 10 15 

Lys Thr Thr Val Thr Asn Glu He Met Lys Asn Leu Glu Gly His Ser 
20 25 30 
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Val Ala Leu Leu Ala Gin Asp Tyr Tyr Tyr Lys Xaa Ser Lys Ser Thr . 
35 .40 45 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 92 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Xaa lie Xaa Pro Xaa Leu Asp Ala Val Leu Ser Ser Asp Thr Thr 

15 10 15 

Thr Leu Arg lie lie Lys Gin Tyr Glu Gin Ala Val Xaa Ala Tyr Ala 

20 25 30 

Asp Asp Gin Ser Asp Lys Leu Phe Lys Arg Met lie Asp Ala Gin Asp 

35 40 45 

Ala Met Asp Gin His Asp Ala Trp Asp Tyr Asn Ala Glu lie Lys Thr 

50 55 60 

lie Leu Ser Lys Leu Gly lie His Asp Thr Thr Lys Tyr He Lys Glu 
65 70 75 80 

Leu Ser Gly Gly Gin Gin Lys Arg Val Val Leu Ala 
85 90 



(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Met Met Ser Gin Phe Tyr Leu Asn Lys Met Ala Ser He Asp His His 

15 10 15 

Pro He Pro Phe Arg Asn Leu Phe He Glu Asp Ser Glu Gin Leu Asn 
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20 25 30 

Ser Xaa Asp Asn Trp Leu Xaa Ser Thr Lys Phe Met Leu Pro Lys Trp 

35 40 45 

Leu Tyr Lys lie Ala Lys Gin Arg Ala Asp Asn Lys Gin Leu Gin Asn 

50 55 60 

Phe Gly Leu Tyr Thr Lys Gin Pro Asn Val Leu Lys Asp His lie Val 
65 70 75 so 

Phe lie Gly Asp His His Gin Tyr He Gly Asn Ser Lys Tyr Leu Phe 

85 90 95 

Thr Tyr Phe Xaa Lys His Asn Pro Met Thr Ala Cys Tyr Leu Val Thr 

100 105 no 

Asp Asp Arg Arg Gly Thr Thr Phe His Xaa Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Met Xaa Xaa Ser Ala Cys Met Pro Gly He Xaa Xaa He Asp Asp Xaa 

15 10 15 

Xaa Pro Ser Gin Ala Leu He Gly Asn Xaa Asp Glu His Leu Lys Ala 

20 25 30 

He Xaa Glu Ser Phe Asp Val Val He His Xaa Arg Gly Gin Glu Val 

35 40 45 

Ala Val Lys Gly Xaa Thr Asn Arg Lys Arg Arg Glu Ser Gly He Ser 

50 55 60 

He Asn Gin Phe Ala Xaa Gly Tyr 
65 70 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

<C> STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: protein 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala Gly Leu Leu Lys Gly Thr Ala Ser lie Asn Glu Leu Leu Gin 

15 10 15 

His Gly Asp Leu Gly lie Ala Thr Leu Thr Gly Xaa Lys Arg Val Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met Leu Leu lie Val Thr Ala Cys 

1 5 
lie Xaa Xaa Ala Leu Xaa Xaa Asp 
20 

Leu Thr Met Trp Xaa Gly Trp Arg 

35 40 
Lys lie Thr Ala Ser lie lie Leu 
50 55 



Gly Pro Asn Arg Ser Xaa Glu Asp 

10 15 
Asn Xaa Xaa Asp Lys Xaa Asn Gin 
25 30 
Gin Ser Lys Trp Arg Phe Xaa Xaa 
45 

Lys Lys Leu Ala Ser Xaa 
60 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
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Met Ala Lys Lys He Val Ser Asp 

1 5 
Leu Val Arg Ala Asp Phe Asn Val 
20 

Asn Asp Asn Arg He Xaa Gin Ala 

35 40 
Glu Gin Gly Gly Lys He Val Leu 

50 55 
Glu Glu Ser Asp Lys Ala Lys Leu 
65 70 



Leu Asp Leu Lys Gly Lys Thr Val 

10 15 
Pro Leu Lys Asp Gly Glu He Thr 
25 30 
Leu Pro Thr He Gin Tyr He He 
45 

Phe Xaa His Leu Gly Lys Val Lys 
60 

Thr Leu Arg Pro Val Ala 
75 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Met Leu Gly Xaa Leu Leu He Phe Ser He Leu Gly Met Leu Val Val 

1 5 10 15 

Gin Tyr Ala Tyr Met Ala Ser He Asn Thr Gly Asn Ala Ala He Ala 

20 25 30 

Thr Leu Leu Gin Tyr Xaa Ala Pro Val Tyr Xaa He He Xaa Xaa Xaa 

35 40 45 

His Lys Arg Arg Xaa Glu Asn 
50 55 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Met Lys Lys Arg Arg Lys Cys Met Leu Gin Val Thr Asp Val Ser Leu 

1 5 10 15 

Arg Xaa Gly Asp Arg Lys Leu Phe Glu Asp Val Asn lie Lys Phe Thr 

20 25 30 

Glu Gly Asn Cys Tyr Gly Leu lie Gly Ala Asn Gly Ala Gly Lys Ser 

35 40 45 

Thr Phe Leu Xaa lie Leu Ser Gly Glu Leu Asp Ser Gin Thr Gly Gin 

50 55 60 

Xaa Phe Xaa Arg Val Lys Met Asn Xaa 
65 70 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Glu Gin lie Asn lie Gin Phe Pro Asp Gly Asn Lys Lys Ala Phe 

15 10 15 

Asp Lys Gly Thr Thr Thr Glu Asp He Ala Gin Ser He Ser Pro Gly 

20 25 30 

Leu Arg Lys Lys Ala Val Xaa Gly Lys He 
35 40 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i> SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 38 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
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Met lie Asp Lys Asp Asp Val His Xaa Glu Thr Lys Asp Val Gly Trp 

15 10 is 

Gin His lie Xaa Trp Xaa Met Xaa Xaa Xaa Gin Xaa Xaa Asn Xaa Asn 

20 25 30 

Phe Arg Glu Thr Phe Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Asn Met Asn Xaa Ser Gly Asp Asn Lys Leu Ser Lys Ser Ser Leu 

1 5 10 15 

lie Gly Leu Val lie Gly Xaa Met He Gly Gly Gly Ala Xaa Asn He 

20 25 30 

Met Ser Asp Met Gly Gly Lys Ala Gly Gly Leu Ala He He He Gly 

35 40 45 

Trp He He Thr Ala He Gly Met Xaa Xaa Leu Ala Leu Val Ser Gin 

50 55 60 

Asn Leu Xaa Asn Glu Arg Pro Glu Leu Asp Gly Gly He Tyr Ser Tyr 
65 70 75 80 

Ala Gin Ala Gly Phe Gly Asp Xaa Val Gly Xaa He Lys Arg Leu Glu 

85 90 95 

He Leu Xaa Leu Lys Arg Leu 
100 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Asn Leu Lys Gin Ser lie Glu Glu lie lie Asn Gin Pro Glu Tyr 

1 5 10 15 

Glu Pro Met Ser Val Ser Asp Phe Gin Asp Ala Leu Gly Leu Ser Ser 

20 25 30 

Ala Asp Ser Phe Arg Asp Leu lie Xaa Val Xaa Val Glu Leu Glu Xaa 

35 40 45 

Ser Gly Leu He Glu Arg Thr Lys Xaa Xaa Arg Tyr Gin Xaa Lys His 

50 55 60 

Ser Xaa Xaa Gly Xaa Ser Xaa Leu He Lys Gly Thr Leu Xaa Gin Asp 
65 70 75 80 

Lys Lys Gly Phe Xaa He Leu Lys Thr 
85 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Ala Met Tyr Leu He Arg Asn Asn Phe Lys Ala Xaa Xaa Pro Phe 

1 5 10 15 

Leu Xaa Glu Glu He Gly Leu Ser Thr Leu Glu Leu Gly Tyr He Gly 

20 25 30 

Leu Ala Val Ser He Thr Tyr Gly Leu Gly Lys Thr Leu Leu Gly Tyr 

35 40 45 

Xaa Val Asp Gly Arg Asn Thr Lys Arg He He Ser Xaa Leu Leu He 

50 55 60 

Leu Ser Ala He Thr Val Leu He Met Gly Phe Gly Phe Lys Leu Leu 
65 70 75 80 

Trp Phe Leu 



(2) INFORMATION FOR SEQ ID NO: 33: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



Met Asp Met Lys lie Xaa Xaa Leu Phe Pro Glu Xaa Gly Glu Gly Gin 

1 5 10 is 

Glu Asn Gin Leu Xaa Asn Thr Xaa Lys Xaa lie Gly Leu Lys Thr Phe 

20 25 30 

Xaa Glu Glu Arg Gly Thr 
35 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Asn Ala Arg Pro Glu He Gin Ala Leu Thr Ser Asp Val Thr 

1 5 10 15 

Ala Tyr Thr Asn Glu Glu Asp Gly Ala Pro Lys Tyr Leu Ala Lys His 

20 25 30 

Phe Leu Ala Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Tyr Gin Asp Leu lie Val Asn Asn Arg Lys Thr Glu lie Asp Tyr 

1 5 10 15 

lie Asn Gly Ala Val Ala Thr Leu Gly Lys Gin Arg His lie Glu Ala 

20 25 30 

Pro Val Asn Arg Phe He Thr Asp Leu lie His Thr Lys Glu Ser Gin 

35 < 40 45 

Arg His Ala Gin Asp 
50 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 36: 

Met Glu Gly He Ser His Glu Pro Xaa Ser Phe Ala Gly His Asn Lys 

1 5 10 15 

Leu Ser Lys Leu Val Val Leu Tyr Asp Ser Xaa Asp lie Ser Leu Asp 

20 25 30 

Gly Glu Leu Asn Lys Ala Phe Ser Glu Asn Thr Xaa Ala Arg Phe Glu 

35 40 45 

Ala Tyr Gly Trp Asn Tyr Leu Leu Val Lys Arg Trp 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Met Thr Lys Glu Met His Xaa Ser 

1 5 
He Gly Met He Met Xaa Thr He 
20 

Gin Tip Phe Glu Asp Asn Arg Asn 

35 40 
Ala Leu Lys Leu Xaa Asp Ser Leu 

50 55 
Thr He Val Met Xaa Leu Tyr Gin 
65 70 
Xaa Ser Ser He Trp Xaa Asp Ser 
85 



Gly Asn Val Thr Leu Phe Lys Gin 

10 15 
Ala Ser Val Lys His Val He Glu 
25 30 
Lys Ser Phe Xaa Thr Tyr Glu Ala 
45 

Glu Asn Thr Pro Ala Ala Arg Ala 
60 

Met Phe His Val Leu Xaa Ser Phe 

75 80 
Glu Ala Gly Leu Ser Asn 
90 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Asp He Ser Pro Gly Gly Val He Val Xaa Leu Leu Val He Leu 

1.5 10 15 

Leu Met lie Thr Met Xaa Tyr Gin Lys Met Arg Met Lys Phe Lys Lys 

20 25 30 

Gly Ala Asn He Asn Glu Tyr Lys 
35 40 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Pro Thr Lys Gly Asp Val Tyr Leu Asn Ser Glu Ser lie Phe Glu 

15 10 15 

Asn Ser Thr Leu Lys Lys Arg Asp Phe Ser His Thr Xaa Lys Ser Leu 
20 25 30 

Phe Xaa 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Thr Lys Gly Lys Glu Val Glu Arg Asp Gly Phe Gin lie Arg Arg 

15 10 15 

Thr Thr Val Glu Asn Gly He Pro Cys Leu Thr Ser Leu Asp Thr Ala 

20 25 30 

Asn Ala Leu Thr Asn Val He Glu Ser Met Thr Phe Thr Met Arg Gin 
35 40 45 

Met 



(2) INFORMATION FOR SEQ ID NO: 41: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Met Lys Thr He Glu Val Asp Asp Glu Leu Tyr Ser Tyr He Ala Ser 

15 10 15 

His Thr Lys His He Gly Glu Ser Ala Ser Asp He Leu Arg Arg Met 

20 25 30 

Leu Lys Phe Ser Ala Ala Ser Gin Pro Ala Ala Pro Val Thr Lys Glu 

35 40 45 

Val Arg Val Ala Ser Pro Ala He Val Glu Ala Lys Pro Val Lys Thr 

50 55 60 

He Lys Asp Lys Val Arg Ala Met Arg Glu Leu Leu Leu Ser Asp Glu 
65 70 75 80 

Tyr Ala Glu Gin Lys Arg Ala Gly Asn Arg Phe Met Leu Leu Leu Ser 

85 90 95 

Ser Leu Tyr Phe Ser 
100 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Phe Val Phe Pro His Gly Ala Gin Asp Arg His Thr Gin Ala Tyr 

15 10 15 

Asp Tyr Leu Val Asp Glu Ala Glu Phe Lys Phe He Ala Gly Val Gly 

20 25 30 

Pro His Asn Phe Thr Asp He Ser Ala Xaa Asn Val Tyr Gin Asp Arg 

35 40 45 

Val Ala He Asp Gly Leu Asn Leu Phe Glu Phe Lys Tyr Lys Leu Gin 

50 55 60 

Thr Val Phe 
65 

(2) INFORMATION FOR SEQ ID NO: 43: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



Met Val Ser He Arg His Arg Ala 

1 5 
Asn Lys Leu Asp Arg Tyr Gin Phe 
20 

Xaa Lys Arg Tyr Asn Gin Phe Leu 

35 40 
Glu Glu Tyr He Ala Ala Ser Asn 

50 55 
Thr Glu Asn Asp Lys Gin Phe Lys 
65 70 
Asp Gly Tyr Leu He Val Arg Asp 
85 

He Ser Ala Asp He Asp Phe 
100 



Thr Ser He Gin Leu His Asp Xaa 

10 15 
Leu Glu Arg Leu Leu Gin Glu He 
25 30 
Thr Leu Pro Phe Ser Glu He Arg 
45 

He Trp Asn Arg Thr Leu Leu Phe 
60 

Gly Gin Ala He Asp Leu Asp Tyr 

75 80 
Glu Ala Gly Glu Ser His Arg Leu 
90 95 



(2) INFORMATION FOR SEQ ID NO: 44: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Met Gly He Ala Ala Leu Phe Thr Glu Ala Leu Val Gly Ala Asn Tyr 

1 5 10 15 

Ala He Ala Val Val Phe He Thr He Gin Val He Leu Met Asn Gly 

20 25 30 

Leu Ala Ser Gin Asn Leu Thr He Asn He Ala Xaa Pro Arg Val He 
35 40 45 
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Asp Val Ala He Gly He Val He Ala He He Gly Leu Phe Gly Pro 

50 55 . 60 

Trp Thr Thr Tyr Arg He Pro Tyr Leu Val Pro 

65 70 75 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met He His Val Ala Leu Val Ser Asp Gly Phe Phe Pro Met Gly Asp 

1 5 10 15 

Thr Val Glu Leu Ala Ala Gin His Gly He Lys Ala He He Gin Pro 

20 25 30 

Gly Gly Ser He Lys Asp Gin Asp Ser He Asp Met Ala Asn Lys His 

35 40 45 

Gly He Ala Met Val Val Thr Gly Thr Arg His Phe Lys His 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Leu Ser Tyr Gly Gly Leu Leu Lys Glu Lys His Lys Xaa Leu Asn 

15 10 15 

Leu Asp Asp Ala Glu Asp Gly Asn Leu He Asn Thr Xaa Asp Glu Asp 

20 25 30 

Lys Thr Thr Asp Glu Glu Glu Lys Ala His Ser Xaa Thr Xaa Xaa Trp 
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35 40 45 

Xaa Xaa Glu Xaa Gin Asn Tyr Tyr Leu Gin Asn Leu Arg Ser Xaa Lys 

50 55 60 

Tyr Glu His 
65 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Gin Ala Leu Lys Thr Lys Ser Asn lie Gly Glu Met Phe Asn lie 

15 10 15 

Gin Glu Lys Glu Asn Gly Xaa lie Ala He Ser Gly Arg Glu Leu His 

20 25 30 * 

Gin Ala Leu Glu Val Xaa Thr Arg Tyr Asp Lys Trp Ser Glu Arg Met 

35 40 45 

Thr Glu Tyr Gly Leu Glu Xaa Gly He Gly Phe Tyr Phe Ala Ser Trp 

50 55 60 

Xaa Xaa Tyr Thr Gly Gin Xaa Lys Gly Ala Val Leu He Glu Gin Ser 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Met Val Gin Tyr Ser Met Ser He Leu Thr Phe Gly Asp Val Asn Met 
1 5 10 15 
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Asn Lys Pro Phe lie Ala lie Glu 
20 

Ser Leu Ala His Lys Leu Ser Gin 

35 40 
Glu lie lie Thr Glu Asn Pro Phe 
50 55 
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Gly Pro He Gly Val Gly Lys Ser 
25 30 
Thr Leu Asp Phe Tyr Glu Glu Lys 
45 

Leu Ser Asp Phe Leu 
60 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



Met Arg He Leu Val Glu He Ala Tyr Xaa Gly Asn Asn Phe Leu Gly 

15 10 15 

Phe Xaa He Gin Gin Asn Gly Arg Thr Val Gin Gin Gin Phe Glu Lys 

20 25 30 

Leu Leu Gin Arg Met His Lys Arg His Val Arg He His Pro Ser Ser 

35 40 45 

Arg Thr Asp Arg Gly Val His Ala He Gin Gin Tyr Phe His Phe Asp 

50 55 60 

Thr Glu Leu Asn He Pro Met Ser Gin Trp Gin Tyr Ala Met Asn Arg 
65 70 75 80 

Thr Leu Pro Asp Asp He Tyr Val Asn Asn Val Val Thr Val Asp Asp 

85 90 95 

Asp Phe His Cys Arg Tyr Asp Cys Val Gly Lys Arg Tyr Arg 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Arg Lys Glu Ser Arg Val Gin Ala Phe Gin Thr Leu Phe Gin 

15 10 15 

Leu Glu Met Lys Asp Ser Asp Leu Thr lie Asn Glu Ala lie Ser Phe 

20 25 30 

lie Lys Asp Asp Asn Pro Asp Leu Asp Phe Glu Phe lie His Trp Leu 

35 40 45 

Val Ser Gly Val Lys Asp His Glu Pro Val Leu Asp Glu Thr lie Ser 

50 55 60 

Pro Tyr Leu Lys Asp Trp Thr lie Gly Arg Leu Leu Lys Thr Gly Arg 
65 70 75 80 

He Phe 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Met Lys Arg Asn He Leu Ala Val He Val Pro Ala Leu Leu Val 

15 10 15 

Ala Gly Thr Ala Asn Ala Ala Glu He Tyr Asn Lys Asp Gly Asn Lys 

20 25 30 

Val Asp Leu Tyr Gly Lys Ala Val Gly Leu His Xaa Phe Ser Lys Gly 

35 40 45 

Asn Gly Glu Asn Ser Tyr Gly Gly Asn Gly Asp Met Thr Tyr Ala Arg 

50 55 60 

Leu Gly Phe Lys Gly Glu Thr Gin Xaa Asn Ser Xaa Leu Asp Arg Leu 
65 70 75 80 

Trp Ser Val Gly He Xaa Leu Pro Gly 
85 

(2) INFORMATION FOR SEQ ID NO: 52: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met His Val Ala Glu Phe Lys Arg Met Asn Ala Asn He Asn Val Glu 

1 5 10 is 

Gly Arg Ser Ala Lys Leu Glu Gly Lys Ser Gin Leu Gin Gly Ala Gin 

20 25 30 

Val Xaa Ser Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Asp Lys He Val Leu Lys Gly Gly Asn Lys Leu Thr Gly Glu Val 

1 5 10 15 

Lys Val Glu Gly Ala Lys Asn Ala Ser He Thr Asn He Asp Lys His 

20 25 30 

Leu Leu Leu Ala Ser Asp Asn Pro Ser Lys Leu Gly 
35 40 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Pro Lys lie Xaa Lys Gin Ala He Xaa Thr He Leu Ala Thr Arg 

15 10 15 

He Glu Arg His Xaa Ala Gin Leu Leu Ala Asp Leu He Thr Lys He 

20 25 30 

Glu Thr Asp His Asn Lys lie Phe Asn Leu Val Lys Ser Ala Leu Xaa 

35 40 45 

Gly Lys Ala Asp Asp Leu Leu Asn Leu Xaa Lys Arg Leu Asn Gin Thr 

50 55 60 

Lys Lys Asp He Asp 
65 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Gly Phe Ala Arg Thr Val Ala Asp Arg Val He Phe Met Asp Arg 

1 5 10 15 

Gly Glu He Val Glu Gin Ala Ala Pro Asp Glu Phe Phe Ala His Pro 

20 25 30 

Lys Ser Glu Arg Thr Arg Ala Phe Leu Ser Gin Val He His 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
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(ii) MOLECULE TYPE: protein 



Met lie Xaa Xaa Ala Ser Leu Ala 

1 5 
Ala Asp Met He Val Ala Ser Asp 
20 

Lys Gin Ala Leu Xaa Asn Asn Xaa 

35 40 
Xaa Xaa Gin Ser Lys Glu Lys Xaa 

50 55 
Tyr His Pro Gly Leu 
65 



He Leu Asp Asp He Asp Xaa Leu 

10 15 
Xaa Tyr Ala Ser Phe Glu Gin Ala 
25 30 
Glu Ala His Leu Leu Tyr Gin Ser 
45 

Asp Glu Val Val Arg Phe Gly Lys 
60 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



Met Phe Val Glu Arg Val Asn Xaa Asp Gly Xaa He Leu Val Ser Glu 

15 10 15 

Xaa Xaa Tyr Ser Xaa Ala Pro Gly He Leu Thr Tyr Arg Ser Val Pro 

20 25 30 

Ala Tyr Gin Val Asn Asn Tyr Arg Tyr He His 
35 40 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Met Val Glu Gly Ala Ser He Asp Lys Ser Ala His Ser Asn Asp He 

15 10 15 

Thr Gly Val Met Ser Glu Met Glu Gly Phe Glu Lys Ala Phe Asp Asp 

20 25 30 

Ala He Gin Tyr Ala Lys Lys His Lys Asp Thr Leu Val Val Ala Thr 

35 40 45 

Ala Asp His Ser Thr Gly Gly Leu Thr He Gly Lys Asp Lys Gly Tyr 

50 55 60 

Glu Trp Asn Pro Gin Pro He Lys Ser Met Lys His Ser Gly Ser 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



Met Lys His Tyr Xaa Xaa Xaa Xaa He Asn Asn Arg Lys Met Ser Lys 

1 5 10 15 

Asp He Gin Arg Ala Leu Leu Ser Ser Glu Asp Val Leu Lys He Met 

20 25 30 

Asn Lys Leu Asp Ser Met Leu Val Asn Lys He Asn He Leu Tyr Ser 

35 40 45 

Asp Asn Glu Gly Asp Gly Thr Met Thr Phe Val Thr Gly Thr Glu Tyr 

50 55 60 

Phe Lys Trp Tyr Phe Pro Glu Tyr Pro Asp Xaa He Xaa Thr Leu Lys 
65 70 75 80 

He Val Asp Val Asn Lys His Thr Phe Thr Asp Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



Met Phe Val Arg Thr Ala Gly Cys Asp Tyr Arg Cys Ser Trp Cys Asp 

1 5 10 15 

Ser Ala Phe Thr Trp Asp Gly Ser Ala Lys Gly Asp lie Lys Thr His 
20 25 30 

Asp Arg 



(2) INFORMATION FOR SEQ ID NO: 61: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



Met Cys Lys Val Leu Lys lie Pro 

1 5 
Lys Arg Xaa Asp Asn Lys lie Thr 
20 

Ala Ala lie Asn lie Phe Asn Ser 

35 40 
Arg He Lys Asn His Leu Asn Asp 

50 55 
Lys He Gly Arg Leu 
65 



Arg Ser Thr Tyr Tyr Asp Ser He 

10 15 
Lys Asp Asp Ser His Val Glu Arg 
25 30 
Asn Arg Lys Val Phe Ser Thr Arg 
45 

Lys Gly Leu Thr Val Ser Gly Gin 
60 



(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62; 

Met Ser His Arg His Thr Leu lie Ser His Ser Phe Lys Phe Ser Arg 

15 10 15 

Leu Pro Gin Asp lie Asn Phe Phe Ser Trp Xaa Gin His Val Lys Asp 

20 25 30 

Ser Asp Lys Thr Asp Glu Leu Thr Tyr Ser Glu Ser Leu Xaa Pro Glu 

35 40 45 

Gly His Pro Thr His Pro Leu Thr Pro Thr Lys Leu Pro Xaa Thr Met 

50 55 60 

Glu Glu Val Arg Ala Tyr Ala Pro Glu Phe Asp Xaa Arg Asn Pro Phe 
65 70 75 80 

Ala Thr Leu 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Gly lie Leu Ser Xaa Gin Pro Tyr lie Phe Ser Ala Ser Xaa Lys 

1 5 10 15 

Glu Asn Xaa Xaa Met Phe Xaa Asp lie Glu Asn Asn Xaa lie Glu Glu 

20 25 30 

Val Leu Xaa Glu Val Gly Leu Leu Asp Lys Val Gin Ser Phe Thr Lys 

35 40 45 

Gly He Asn Thr He He Gly Glu Gly Gly Glu Met Leu Ser Gly Gly 

50 55 60 

Gin Met Arg Arg He Glu Leu Cys Arg Leu Leu Val Met Lys Pro Asp 
65 70 75 80 

Leu Val He Phe Asp Glu Pro Ala Thr Gly Leu Asp He Gin Thr Glu 
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85 



90 



95 



His Met lie Gin Asn Val Leu. Phe Gin His Phe Lys Asp Thr Thr Met 

100 105 110 

He Val He Ala His Arg Asp Xaa Thr He Arg His Leu Gin Arg Arg 
115 120 125 

Leu 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Met He Ala Ala Phe Lys Val Ser Lys Asp Ser Lys Leu Ala Ser Leu 

15 10 15 

Leu He He Thr Ser Xaa Gin Ala Val Xaa lie Trp Asn Xaa Gly He 

20 25 30 

Lys Thr Ala Ala Ala Gin Asn He Xaa Ala Asn 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Met Ser Glu Thr He Phe Gly Lys He Leu Thr Gly Glu He Pro Ser 

15 10 15 

Phe Lys Val Tyr Glu Asp Asp Tyr Val Tyr Ala Phe Leu Asp He Ser 



35 



40 



20 



25 



30 
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Gin Val Thr Lys Gly His Thr Leu 

35 40 
Asn lie Phe Glu Thr Asp Glu Glu 

50 55 
Leu Pro Lys Val Ala Asn Ala lie 
65 70 
Leu Asn lie lie Gin Asn Asn Gly 
85 

Ser Tyr Ser Xaa Xaa Thr 
100 
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Leu lie Pro Lys Lys Ala Ser Ala - 
45 

Thr Met Lys His lie Gly Ala Ala 
60 

Lys Arg Ala Phe Xaa Pro Xaa Gly 

75 80 
Glu Phe Gly Asp Gin Ser Cys lie 
90 95 



(2) INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Met Leu lie Leu Ala Val He Leu Ala Gly Ser Ala He Ser Val Val 

1 5 10 15 

Gly Gly He Ser Phe Leu Gly Leu He Ala Pro His He Ala Ser Gin 

20 25 30 

Leu He Gly Asn Lys Thr Leu His Val He He Met Ser Gly Leu He 

35 40 45 

Gly Ala He Leu Leu Thr Phe Gly Asp Gly Leu Ala Arg Gly He His 

50 55 60 

Pro Pro Leu Asp He Pro Val Gly Val He He Ala He He Gly Ala 
€5 70 75 80 

Pro Tyr Phe Leu He Leu Leu Arg Arg Met Lys 
85 90 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Met Leu Ser Pro Leu Leu lie He Phe Phe He Val Leu Ser He Leu 

1 5 10 is 

Glu Glu Arg Lys Arg Thr Lys Lys Lys Gin Leu Glu Lys Glu Lys Ala 

20 25 30 

Asn Thr Leu Asn Gin Asn Thr Asn Asp Thr Glu Ser Ser Asn Gin Glu 

35 40 45 

Pro Ser Leu Gin Gin Thr Lys Glu Gin Lys Asp Asn Lys Gly 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 140 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Met Asp Arg Met Tyr Glu Gin Asn Gin Xaa Pro His Asn Asn Glu Ala 

1 5 10 15 

Glu Gin Ser Val Leu Gly Ser Xaa He He Asp Pro Glu Leu He Asn 

20 25 30 

Thr Thr Gin Glu Val Xaa Leu Pro Glu Ser Phe Tyr Arg Gly Ala His 

35 40 45 

Gin His He Phe Arg Ala Met Met His Leu Asn Glu Asp Asn Lys Glu 

50 55 60 

He Asp Val Val Thr Leu Met Asp Gin Leu Ser Thr Glu Gly Thr Leu 
65 70 75 80 

Asn Glu Ala Gly Gly Pro Gin Tyr Leu Ala Glu Leu Ser Thr Asn Val 

85 90 95 

Pro Thr Thr Arg Asn Val Gin Tyr Tyr Thr Asp He Xaa Ser Xaa Ala 

100 105 110 

Cys He Lys Xaa Val Gly Leu He Gin Thr Ala Asp Ser He Ala Asn 

115 120 125 

Asp Gly Tyr Xaa Asp Glu Leu Gly Leu Asp Ala He 
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130 135 140 



(2) INFORMATION FOR SEQ ID NO: 69: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 118 amino acids 
<B) TYPE: amino acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

Met Asn Lys Pro Tyr lie Lys Met Ala Gin Leu Phe His Val Pro Thr 

15 10 15 

Xaa Thr lie Leu lie Arg His Leu Thr Pro Lys lie lie Pro Ma lie 

20 25 30 

lie Val Leu Met Val Val Asp Phe Gly Lys lie lie Leu Tyr He Ser 

35 40 45 

Ser Leu Ser Phe He Gly Leu Gly Ala Gin Pro Pro Thr Pro Glu Tip 

50 55 60 

Gly Ala Met Xaa Gin Gin Gly Arg Asp Phe He Ser Ser His Pro He 
65 70 75 80 

Met Leu He Ala Pro Ala Ser Val lie Ala He Thr He Leu He Phe 

85 90 95 

Asn Leu Thr Gly Asp Ala Leu Arg Asp Arg Leu Leu Lys Xaa Arg Gly 

100 105 110 

Gly Tyr Asp Glu Ser His 
115 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 70: 
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Met Thr Xaa Gly Lys Arg Leu Tyr Ser Asn Gin Tyr Lys Asn Ser Pro 

1 5 10 15 

Ser Xaa Ser Leu Thr Xaa Xaa His Lys lie Lys lie Asn Glu Gly Asp 
20 25 30 

Glu Xaa Phe 

35 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Asn Gin Ser Val Xaa Leu Leu Lys His Leu Thr Asp Val Asn Gly 

1 5 10 15 

He Ala Gly Tyr Glu Met Gin Val Xaa Glu Ala Met Arg Xaa Tyr He 

20 25 30 

Glu Leu Ser Gly Xaa Asn He Gly Asp Asn Leu Gly Gly Asn Leu Gly 

35 40 45 

Xaa Glu Lys Cys 
50 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



Met Arg Phe Val Gly Met Asn Leu Xaa He Xaa He Asn Asn Thr Thr 

1 5 io 15 

Val He Ala Thr Gly Tyr Thr Met Leu Asn Thr Gly He Asp He Glu 
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20 

lie Thr Ala Phe Lys 
35 

Lys Thr Asn Thr Asn 
50 

Lys Tyr Ser Glu Thr 
65 

Ala Ser Ala Thr Met 
85 

Pro Val Glu Ala lie 
100 

Leu Asp Phe Asn 
115 



25 

Pro Trp .Leu Ala lie lie 
40 

Ala Leu Tyr Asp Thr Leu 
55 

Ala Thr lie Ser Pro His 
70 75 
Met Pro Asn Lys Tyr Phe 
90 

Val Val Asn Lys Tyr Val 
105 
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30 

Lys Leu lie Lys Met 
45 

Gly Thr lie Leu Trp 
60 

Ala Val Val Arg His 
80 

Pro Asn Glu Pro Asn 
95 

Pro Met Phe Arg Ser 
110 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i)* SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met lie Phe Ala Leu Gly Val Xaa lie Gly Leu Ala Gly Gly Asp Gly 

1 5 10 15 

Val Ala Ala lie Ala Ala Phe Val Gly Tyr He He Met Asn Lys Thr 

20 25 30 

Met Gly Asp Phe Leu Gin Val Thr Pro Lys Asn Val Xaa Glu Pro Thr 

35 40 45 

Ser Gly Xaa Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Met Arg Leu Ser Thr Phe Thr Phe Met Thr Lys Arg Met Asn Leu Pro 

15 10 15 

Glu Val Asp Leu Glu Lys Val lie Ala Pro Trp Lys Glu He Leu Ala 

20 25 30 

Thr Asn Arg Glu He Pro Asn Leu Asp Asn Gin Met Cys He Gly Gly 

35 40 45 

Leu Asp Phe Ala Asn lie Arg Asp Phe Xaa Ser Val Gly Leu Leu Phe 

50 55 60 

Arg Lys Asn Asp Asp Tyr He Trp Leu Gly His Ser Xaa Val Arg Gin 
65 70 75 80 

Gly Phe Leu Asp Asp Val Lys Leu Glu Pro Pro He Lys Glu Trp Gly 

85 90 95 

Lys Met Gly Leu Leu Thr Xaa Val Asp Asp Asp Val He Glu He Glu 

100 105 HO 

Tyr He Val Gly Xaa Gly Phe Xaa Xaa Gin Lys Ala Arg Glu Lys Tyr 

115 120 125 

Gly He Glu Lys Val He Gly 
130 135 



{2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



Met Asp Lys Asp Phe Gin Glu Lys Leu Lys Lys Ala Phe He Asp He 

15 10 15 

Ala Lys Ser Lys Glu Gly His Lys He He Ser Glu Val Tyr Ser His 

20 25 30 

Glu Gly Tyr Thr Glu Thr Lys Asp Ser Asn Phe Asp He Val Arg Glu 

35 40 45 

Tyr Glu Lys Leu Val Lys Asp Met Lys 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Met Ser Gin lie Glu Phe Lys Asn Val Ser Lys Val Tyr Pro Asn Gly 

15 10 15 

His Val Gly Leu Lys Asn lie Asn Leu Asn lie Glu Lys Gly Glu Phe 

20 25 30 

Ala Val lie Val Gly Leu Ser Gly Ala Gly Lys Ser Thr Leu Leu Arg 

35 40 45 

Ser Cys Lys Ser Xaa Ala 
50 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Met Tyr Gly Trp Lys Pro Pro Pro Gly Tyr Asp Ser Asp Gin Phe Asp 

15 10 15 

Gin Phe Leu Val Gin Glu Lys Ser lie Leu Val Ala Pro Gly Lys Pro 

20 25 30 

Phe Gly Glu Asn Gly Asn Arg Tyr Val Arg lie Ser Leu Ala Leu Asp 

35 40 45 

Asp Gin Lys Leu Asp Glu Ala Ala He Arg Leu Thr Glu Leu Ala Tyr 

50 55 50 

Leu Tyr Glu 
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(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



Met Asn Val Lys Gly Ser Tyr He Val Tyr Glu Pro Phe Val His Pro 

15 10 is 

Glu Thr Asp Lys Tyr Arg Leu Val Tyr Gin Gly Gly Asn Tyr Asn Tyr 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 79: 

Met Gin Leu Leu Gin Lys Asn Pro Met Ala He Gly Val Trp Ala Gly 

1 5 10 15 

Leu Cys Gin Met He Lys Pro Thr Val He Asp Trp Asp He Ser Glu 

20 25 30 

Tyr Thr Pro Lys Pro Ala Trp Met Gin Ala Thr Lys Ala Arg Ala Tyr 

35 40 45 

Val Pro Arg He Leu Val Glu Lys Leu Leu He Xaa He Asp Asp Met 

50 55 60 

Leu Glu Gly He Glu He Tyr Asp Xaa Xaa Glu Ser Arg 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 80: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Met Thr Gin His Glu Pro Asp Leu Pro Glu Ala Glu Lys Leu Tyx Leu 

15 10 15 

Thr Leu His Leu Leu Ser Thr Ser Val Gin Trp Thr Asp Leu Gin Glu 

20 25 30 

Ser Asp Asn lie Ser Asn Leu Thr Met Ala He Ala Gin Met He His 

35 40 45 

His Cys Glu Gin He Thr Phe lie Asn He Glu Asp Lys Glu Lys Leu 

50 55 60 

Ser Gin Gin Leu Leu Leu His Leu Thr Pro Ala Phe Tyr Arg He Lys 
65 70 75 80 

Tyr Asn Leu Thr Asp Pro 
85 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Met Thr Lys Lys Met Gly Leu Leu Val Met Ala Tyr Gly Thr Pro Tyr 

1 5 10 15 

Lys Glu Ser Asp He Glu Pro Tyr Tyr Thr Asp lie Arg His Gly Lys 

20 25 30 

Arg Pro Ser Glu Glu Glu Leu Gin Asp Xaa Xaa Asp Arg Tyr Xaa Phe 

35 40 45 

He Gly Gly Lys Leu He Asp Thr Val Asp Leu Glu Gly Gly Ser Gly 
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60 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Met Thr Leu Asp Lys Leu Asp Leu Pro lie Val Gly Lys Pro Thr Ser 

15 10 is 

Tyr Lys Thr Leu Pro Asn Arg Tyr Lys Asp Val Pro Glu lie Gly Gin 

20 25 30 

Pro Met Glu Pro Asn Val Glu Ala Val Lys Lys Leu Lys Pro Thr His 

35 40 45 

Val Leu Ser Val Ser Thr lie Lys Asp Glu Met Gin Pro Phe Tyr Lys 

50 55 60 

Gin Leu Asn Met Lys Gly Tyr Phe Tyr Asp Phe Asp Ser Leu Lys Gly 
65 70 75 80 

Met Gin Lys Ser He Thr Gin Leu Gly Asp Gin Phe Asn Arg Lys Ala 

85 90 95 

Gin Ala Lys Glu Leu Xaa Asp His Leu Xaa Ser Val Lys Gin Lys He 

100 105 no 

Xaa Asn Lys Ala Gly Lys Gin Lys Lys His Pro Xaa Val Leu He Leu 

H5 120 125 

Met Gly Val Pro Gly Arg Tyr Leu Val Ala Thr Asp Lys Ser Tyr He 
130 135 i4 0 

Gly 
145 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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Thr Gin Phe Ala Leu 
65 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Met Met Met Lys Phe Xaa His Asn Leu Asp Met Xaa Leu Glu Leu Phe 

15 10 15 

Asn Lys Ala Xaa Xaa Asn Gin Lys Asn Lys Xaa Trp Asn Gly Leu Ala 

20 25 30 

Asn Ala Cys Arg Asn Pro Tyr Val Arg His lie Ala His Pro Thr Gly 

35 40 45 

Arg lie lie Gly Arg Arg Asp Gly Tyr Lys Pro Asn lie Glu Gin Leu 

50 55 60 

Met Ala Leu Ala Glu Glu Thr Asn Thr Val Leu Glu lie Asn Ala Asn 
65 70 75 80 

Pro His Arg Leu Asp Leu Glu Arg 
85 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Met Ser Lys Thr Ala lie He Phe Pro Gly Gin Gly Ala Gin Lys Val 

1 5 10 is 

Gly Met Ala Gin Asp Leu Phe Asn Asn Asn Asp Gin Ala Thr Glu He 

20 25 30 

Leu Thr Ser Ala Ala Lys Thr Leu Asp Phe Asp He Leu Glu Thr Met 

35 40 45 

Phe Thr Asp Glu Glu Gly Lys Leu Gly Glu Thr Glu Asn Thr Gin Pro 

50 55 60 

Ala Leu Leu Thr His Trp Ser Gly He He Ser Ser Ala Lys Asn Leu 
65 70 75 80 

Gly Ser Leu He Leu Leu Trp Gly He Ser Leu Gly Trp Asn Phe Gin 
85 90 95 
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Ser Leu Val Gly Gly Xaa Arg Xaa Xaa Ser lie Xaa Arg Met Gin Xaa 

100 105 110 

Xaa Leu Val Glu Thr Xaa Xaa Xaa 
115 120 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Met Ala Arg Gly Leu Lys Phe Lys lie Met Phe Asn Phe Gly Leu Pro 

15 10 15 

Leu Val lie Ala Leu Ser His Ala Tyr Phe Thr Ser Leu Ala Tyr Met 

20 25 30 

Lys Leu Met Gly Thr Xaa Asn Gin lie Pro Val Phe He Val Met Gly 

35 40 45 

Leu Tyr He Cys Met Tyr Ala Val Phe Ala Val Thr Ala Tyr Asn His 

50 55 60 

Ser Lys Arg Thr He Arg His Ser He 
65 70 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Leu Lys Ser Lys Met Lys Leu Asn Tyr Arg Gly Asp Asn Met Asn 

15 10 15 

Phe Glu Glu Lys Leu Ser Gin Met Tyr Asn Glu He Ala Asn Glu He 
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20 25 30 

Ser Gly Met He Pro Val Glu Trp Glu Gin Val Phe Thr He Ala Tyr 

35 40 45 

Val Thr Asp Gin Ala Gly Glu Val He Phe Asn Tyr Thr Lys Pro Gly 

50 55 60 

Ser Asp Glu Leu Asn Tyr Tyr Thr Tyr He Pro Arg Glu Tyr Tyr Val 
65 70 75 80 

Ser Glu Lys Val Phe Tyr Asp Leu Trp Thr Asp Leu Tyr Arg Leu Xaa 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Met He Pro Val Xaa Trp Xaa Lys Val Tyr Ala Met Ala Tyr Val Asn 

15 10 15 

Glu Xaa Ser Gly Glu Val Phe Tyr Asn Tyr Thr Glu Pro Arg Ser Asp 

20 25 30 

Glu Leu Phe Tyr Tyr Thr Ser Val Leu Asn Lys Tyr Asn He Ser Arg 

35 40 45 

Ser Glu Phe Met Asp Ser Val Tyr Glu Leu Tyr Lys Xaa Phe Asp Lys 

50 55 60 

Leu Arg Asp Leu Phe Lys Glu Glu Gly Leu Glu Pro Trp Thr Ser Cys 
65 70 75 80 

Glu Phe Asp Phe Thr Arg Asp Gly Lys Leu Asn 
85 90 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Met Val Gin His Leu Leu Asn Glu Gly Leu Ser Arg Phe Glu Asp Lys 

1 5 10 15 

Phe Val Gly Val Tyr Leu Glu Val Asp Asn Xaa Asn Xaa Glu Ala Val 

20 25 30 

Ala Tyr Tyr Lys Glu Gin Gly Phe Thr lie Leu Arg Ser Tyr Gin Pro 

35 40 45 

Glu Met Tyr Gly Glu Lys Leu Asp Leu Ala Leu Met Tyr Lys Ala Phe 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Asp Gin His He Asn He Leu Gly Ala Ser Ser Asp His Leu Met 

1 5 10 15 

Leu Asp Leu Asn Gly Gin Gly His Tyr Gin Val Gly Asp His He Ser 

20 25 30 

Phe Ser Leu Asn Tyr Glu Ala Leu Ser His Ser Met Tyr Met Lys Asn 

35 40 45 

Leu His Lys Val Tyr He Asp Asp Ser Lys He Asp Thr Leu Leu Gin 

50 55 60 

Asn Phe Asp Val Lys Ser Pro Ala Met Val Asn Gin Tyr 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 90: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 43 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Leu Tyr Gin Asp Ala lie Thr Leu Glu Ala Glu lie Leu Glu lie 

15 10 15 

Lys Pro Arg Val Asn Thr His Thr His Glu Ser Phe Leu Gin Ala lie 

20 25 30 

Val Asp lie Gly Tyr Xaa Arg Tyr Thr Ser Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Met Ser lie His Thr Glu Leu Ser Thr lie His Xaa lie Asn Glu Val 

15 10 15 

Ala Glu Val Leu Gly Lys Lys His Lys lie Leu Leu Met Val Asp Trp 

20 25 30 

Lys Asp Val Arg Glu Gly Val Leu Thr Tyr Asp Val Leu Asp Tyr Xaa 

35 40 45 

Lys Glu lie He His Leu Xaa Asn He His Phe Val Gly Leu Ala Phe 

50 55 60 

Asn Phe Met Cys Phe Lys Ser Asp Ala Pro Ser Gly Xaa Asp He Phe 
65 70 75 80 

Met He Asn Arg Phe Val Ser Ala Val Xaa Arg Glu He Gly Tyr Arg 

85 90 95 

Xaa Xaa He He Ser Gly Trp 
100 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 71 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



Met Gly Gly lie Gin He He Pro Glu Thr Asn Gin Val He Ser Pro 

15 10 15 

Arg Phe Gly Thr Leu Thr Asn Met He Ala He Gly Gin Met Thr Asn 

20 25 30 

Gly Val Asn Lys Leu Arg Asn Gly Val Lys Met He Val Glu Gin Val 

35 40 45 

Ala His Thr Val Ser Gin Leu Tyr Asp Ala Leu Glu Ser Asn Glu Gin 

50 55 60 

Gin Gin Arg Ser Tyr Asn Gin 
65 70 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



Met Asp Asp Asn Asp He Xaa Arg Leu Lys Asn He Ser Asn Arg Leu 

1 5 io is 

Arg Ser Lys He He Val Lys Asp Gin Pro Glu Xaa Met Val Ser Thr 

20 25 30 

Leu Gin Phe Xaa Val He Ala Pro Gly Val 
35 40 



(2) INFORMATION FOR SEQ ID NO: 94: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/DS97/02318 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Met Lys Asp Val Met Ala lie Gly Asp Asn Leu Asn Asp Leu Ser Met 

15 10 15 

Leu Glu Lys Val Gly Tyr Pro Val Ala Met Glu Asn Gly Ala Glu Glu 

20 25 30 

Val Lys Lys lie Ala Lys Tyr Val Thr Asp Thr Asn Glu Asn Ser Gly 

35 40 45 

Val Gly Lys Ala lie Met Lys Leu Leu Arg Glu Gin Gin Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Met Lys Asp Val Met Pro He Gly Asp Asn Leu Asn Asp Leu Ser Met 

15 10 15 

Leu Glu Lys Val Gly Tyr Pro Val Ala Met Glu Asn Gly Ala Glu Glu 

20 25 30 

Val Lys Lys He Ala Lys Tyr Val Thr Asp Thr Asn Glu Asn Ser Gly 

35 40 45 

Val Gly Lys Ala He Met Lys Leu Leu Arg Glu Gin Gin Val 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Met Gly Arg Gin Asp lie Ala Val Ala Lys Gly Ala Asp Arg Pro Leu 

15 10 15 

lie Lys Pro Ala Ala Phe Ala Ser Glu lie His Gly Glu Ser Gly Leu 

20 25 30 

Asp Gly Pro Lys Leu Pro Ser Thr Pro Ser Arg Gin Ala Val Ala Met 
35 40 45 

Pro Ala 
50 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Met Ser Glu Val Ser Ala lie lie Ala Glu His Cys Leu Phe Asp Leu 

15 10 15 

Asp Ala Pro lie Met Arg Leu Ala Ala Pro Asp Val Pro Ser Met Pro 

20 25 30 

Phe Ser Pro Val 
35 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Met Leu Gly lie Asp lie Asn Thr Gin Thr Arg Ala Lys Leu lie Gin 

15 10 15 

Asn lie Leu Asn Gin Phe Asn Leu Lys Leu Val Asp Xaa Glu He Thr 

20 25 30 

Ala Asp Val Lys Asn Glu Ser Phe Ala Gin Ser Lys His Asn Leu He 

35 40 45 

Gin Gly He Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Met Gly Phe Gly He Pro Ser Ser He Gly Ala Lys Leu Ala Asn Pro 

15 10 15 

Asp Lys Thr Val Val Cys Phe Val Gly Asp Gly Gly Phe Gin Met Pro 

20 25 30 

Asn Gin Glu Met Ala Leu Leu Pro Glu Tyr Gly Leu Asp Val Lys He 

35 40 45 

Val Leu He Asn Asn Gly Thr Leu Gly Met Val Lys Gin Trp Gin Asp 

50 55 60 

Lys Phe Phe Asn Gin Arg Phe Ser His Ser Val Phe Asn Gly Gin Pro 
65 70 75 80 

Asp Phe Met Lys Met Ala Glu Ala Tyr Gly Val Lys Gly Phe Leu He 

85 90 95 

Asp Lys Pro Glu Gin Leu Glu Glu Gin Leu Asp Ala Ala Phe Ala Tyr 

100 105 HO 

Gin Gly Pro Val Leu He Glu Val Arg He Ser Pro Thr Glu Ala Val 

115 120 125 

Thr Pro Met Val Pro Ser Gly Lys Ser Asn His Glu Met Glu Gly Leu 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 100: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



Met Lys Ala He Ser Glu Ser Asp Thr Val He Leu Ala Tyr Gly Ala 

15 10 15 

Tyr Ala Lys Arg Pro Val Val He Asp Arg Val Glu Gin Val Met Glu 

20 25 30 

Met Leu Lys Pro His Lys Lys Lys Val Lys Lys Leu He Asn Pro Val 

35 40 45 

Thr Asn Glu He Met His Pro Leu Asn Pro Lys Ala Arg Gin Lys Trp 

50 55 60 

He Leu Lys Ser 
65 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



Met Asn He Asn Ser Ala Gin Thr Glu Glu Met Gly Glu Thr Tyr Leu 

15 10 15 

Glu Val Gin Arg He Tyr Phe Leu Lys Asp Phe Gin Gly Gly Gly Arg 

20 25 30 

Gly Ser Gin Leu He Glu Leu Ala Glu Lys He Ala Pro Arg Thr 
35 40 • 45 



(2) INFORMATION FOR SEQ ID NO: 102: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Met Ser Leu He Met Gly Leu Leu Ser Glu Lys Axg Leu He Phe Lys 

15 10 15 

Phe Ser He Gly Val Leu Val Trp Gly He Gly Leu Ser He He Asn 

20 25 30 

Val Phe Pro Ser Val Ala He Leu Tyr He Gly Ala Thr Leu Val Gly 

35 40 45 

Leu Gly Gin Ser He Glu Gly Leu Thr Arg Ser Val Xaa Xaa Gin He 

50 55 60 

Lys Asn Ala Lys Xaa Phe Gly Trp Gly Lys Gly He 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Met Tyr Leu Leu Lys His Xaa Pro Glu Thr Lys Ala Xaa Pro He Lys 

15 10 15 

Gly Met Lys Ala Glu Ala Lys Lys Cys Asp Val He Gly Leu Asp He 

20 25 30 

Xaa Val Gly Thr Met Leu Ser Leu Asn Val He He Thr Gin Thr Ser 

35 40 45 

His Val Gly Leu Val Ser Gin Arg 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 104: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Met Ser Phe Asp Pro Glu Met Val Lys Glu Ala lie Gin Glu Tyr Tyr 

1 5 10 is 

Pro Asn Phe Thr Leu Asp Tyr Asp Val Asp Pro lie Arg Gin Gly lie 

20 25 30 

Ala Asn Ser Trp Pro Asp Ser lie Asp Thr Ser Cys Ser Arg Gly Glu 

35 40 45 

Trp Gly Phe Asp Pro Lys Tyr Asp Leu Ala Ser Met Thr Lys Leu Met 

50 55 60 

Leu Glu Ala lie Glu Gin Lys Asp Thr Val Lys Asn Asn Asn 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Met Ala Val Lys Val Ala lie Asn Gly Phe Gly Arg He Gly Arg Leu 

1 5 10 15 

Ala Phe Arg Arg He Gin Glu Val Glu Gly Leu Glu Val Val Ala Val 

20 25 30 

Asn Asp Leu Thr Asp Asp Asp Met Leu Ala His Leu Leu Lys Tyr Asp 

35 40 45 

Thr Met Gin Gly Arg Phe Thr Gly Glu Val Glu Val Val Asp Gly Gly 
50 55 60 
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Phe Arg Val Asn Gly Lys Glu Val 
65 70 
Ser Lys Leu Pro Trp Lys Asp Leu 
85 

Thr Gly Phe Tyr Thr Asp Lys Asp 
100 

Gly Ala Lys Lys Val Leu lie Ser 
115 120 
Thr lie Arg lie Gin His Tyr Thr 
130 135 



PCT/US97/02318 

Lys Ser Phe Ser Glu Pro Asp Ala 

75 80 
Asn lie Asp Val Val Leu Glu Cys 

90 95 
Lys Ala Gin Ala His lie Glu Ala 
105 HO 
Ala Xaa Ala Thr Gly Asp Leu Lys 
125 

Thr Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 106: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Met Arg Thr lie Ser lie Asn Val Phe Leu Met Leu Val Gin Phe Leu 

1 5 10 15 

Xaa Asn lie Gin Val Thr Tyr Thr His Ala Gin Leu Lys Gly Gly Asn 

20 25 30 

Lys Glu Pro Tyr Arg He Gly Leu Lys Leu Ser Asn Gly Gly Trp Val 

35 40 45 

Tyr val Gin Gly Leu Thr His Xaa Glu Val Asn Glu His Asp Glu Phe 

50 55 60 

Leu He Ala Gly Phe Xaa Tyr Xaa Gly Xaa Leu Ala Ala Ala Leu Xaa 
65 70 75 80 

He Ser Gly Ala Thr Xaa Xaa Phe He Glu Glu Ala Ser Thr Leu Thr 

85 90 95 

Gly Arg Lys Asp Met Phe Phe Ser Thr He Ser Thr Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



Met lie Lys Glu Val Ala Val Lys Arg 
1 5 



Tyr Asn 
10 



Arg He Gly Ala Glu 
15 



Gly Met Thr Ser Glu Ala Val Asp Gly Arg Ser Asn Ala Tyr Glu Leu 

20 25 30 

Asn Asp Phe Lys Glu Tyr Glu Ala He He Asp Asn Tyr Phe Asn Ala 

35 40 45 

Arg Thr Arg Thr Lys Lys Gly Arg Ala Val Phe Phe 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Met Arg Val Xaa He He Xaa Met Gly Pro Ala Gly Val Ser Val Leu 

1 5 10 15 

Arg Glu Leu Val Lys Xaa Pro Lys Phe Asn Gin Leu Asp He Asp Leu 

20 25 30 

Tyr Asp Asp Lys Val Xaa Met Gly Gin Gly Val Pro Phe Gin Asn Asp 

35 40 45 

Ser Ser Glu Leu Leu He Asn Met Pro Ser Xaa Xaa Met Ser Leu Asn 

50 55 60 

Leu Asp Asp Glu Thr Glu Phe Trp Lys Trp Tyr Xaa Gin Gin Xaa Asp 
65 70 75 80 

Phe Asn Phe Asp Glu Pro Ala Tyr Leu Pro Arg Phe Val Phe Gly His 

85 90 95 

Tyr Met Lys Ser Tyr Leu Ser Met Phe Thr Lys Lys Tyr Pro Asn He 

100 105 110 

Ser Thr Asn Tyr Asn Lys Val Gin Glu He Tyr Thr Asn Ser Asn He 



50 



55 



60 
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115 



120 



125 



Asp Glu Xaa Asn Leu Thr Tyr Tyr lie Cys Thr Thr Asn Ser Glu Gin 

130 135 140 

Ser Trp Gin Ala Tyr Asp Tyr Val Phe Leu Thr Cys Gly Thr Phe Ala 
145 150 155 160 

Tyr His Asp Pro Tyr Asn Leu Lys Gly Lys Lys Gly Tyr lie Ala Thr 

165 170 175 

Pro Tyr Pro Thr Tyr Asn Thr Leu Asp Glu Val Asn Glu Leu Asp Asp 

180 185 190 

He Ala He He Gly Thr Gly Leu Ala Ser Leu Asp Val Val Arg Tyr 

195 200 205 

Val Ala Ala His His Pro Lys Leu Pro He Thr Met Thr Ser Arg Ser 

210 215 220 

Ala His Leu Pro Ser Val Arig Gly Thr Met He Asp Val Thr Phe Lys 
225 230 235 240 

Tyr Leu Thr Lys Asp Lys Leu Asn Asp lie Lys Lys His His Phe Gly 

245 250 255 

Asn Ala Pro Leu Asp Thr He Xaa Ser Phe He Phe Lys Arg Met Cys 

260 265 270 

Leu Asn Met Thr Leu He Phe Lys Lys Leu Val His Arg Arg Thr Arg 

275 280 285 

Lys Ser Tyr Cys Arg Phe Glu Val Leu He 
290 295 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Met He Glu His Leu Xaa Glu Asn Leu Asn Trp He Trp Asn Ser Leu 

1 5 10 15 

Ser He Glu Asp Gin His Gin Phe Asn Pro Lys He Xaa Lys Asp Asp 

20 25 30 

Ser He Lys Ser Lys Ser Asn Xaa Ser Lys Xaa Cys 



35 



40 
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(2) INFORMATION FOR SEQ ID NO: 110: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



Met Ser Tyr Ser His Lys Glu Leu lie Glu Ala His Phe Gly Leu Gin 

15 10 15 

Asn His Val Phe Thr Leu His Glu Tyr Val Lys Glu Ala Gly Glu Val 

20 25 30 

lie Asp Pro Tyr Gly Gly Thr Lys Glu Met Tyr Val His Thr Tyr Glu 

35 40 45 

Glu Leu Val Ser Leu lie Leu Lys Leu Lys Asp lie lie Cys 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 111: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



,<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



Met Pro Asn His Gin Phe Glu Ser Arg Gly lie Phe Ala Val Asn Asn 

15 10 15 

Gin Gly Val Ser Asn Tyr Val Glu Asp Leu Val Glu Glu His His Leu 

20 25 30 

Ala Glu Xaa Asp Leu lie Ala Thr lie Tyr 
35 40 



(2) INFORMATION FOR SEQ ID NO: 112: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



PCT/US97/02318 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Met Tyr Met Arg Lys lie lie Gly Leu Leu Leu Val Ser Thr Leu Ala 

15 10 15 

Leu Thr Ala Cys Gly Glu Lys Glu Asn Pro Xaa Lys Glu Glu Asn Lys 

20 25 30 

Xaa Ser His Thr His Gin His Xaa Xaa Xaa Xaa His Gin Xaa Arg Asn 

35 40 45 

Lys Glu Asn Xaa Lys Lys Ala Asp Asp Lys lie His Leu He He Ala 

50 55 60 

Tyr Lys He He Gin Thr He Lys Thr Ser Gin Asn He Asn Leu He 
65 70 75 80 

He Ser Asp Pro Ser Tyr Asn Thr Pro Gly Asn Tyr 
85 90 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Met Asn Glu His Tyr Val Val Glu Lys He Thr Tyr Asp Arg Ala Asn 

15 10 15 

Ala Phe Lys Leu Asn Gin Glu Leu Lys Asn Tyr Gly Phe Glu Thr Glu 

20 25 30 

Glu Thr Arg Gin Gly Ala Leu Thr Leu Ser Pro Ala Leu Lys Asp Leu 

35 40 45 

Lys Glu Met Phe Leu Asp Gly Lys He He Phe Asn Asn Asn Pro Leu 

50 55 60 

Met Lys Trp Tyr He Asn Asn Val Gin Leu Lys Leu Asp Arg Asn Gly 
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65 70 75 80 

Asn Trp Leu Pro Ser Lys Gin. Ser Arg Tyr Arg Xaa He Asp Gly Phe 

85 90 95 

Ala Ala Phe Ser Asn Thr Tyr Thr Asp He Met Xaa Lys Xaa Val Ser 

100 105 HO 

Asp Lys Gly Glu Gly Xaa Xaa Xaa Phe He Ser Xaa Lys Asp Xaa Met 
115 120 125 

Arg 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Met Ser He Met He Cys Lys Glu Gly Leu Phe Met He Met Gly Asn 

15 10 15 

Leu Arg Phe Gin Gin Glu Tyr Phe Arg He Tyr Lys Asn Asn Thr Glu 

20 25 30 

Ser Thr Thr His Arg Asn Ala Tyr Trp Val Lys Leu Ala Lys Asn Val 

35 40 45 

Glu Ala Thr Lys Met Met Tyr Ala Leu Ser Thr He Val Gin Gin His 

50 55 60 

Ala Ser He Arg His Phe Phe Asp Val Thr Thr Asp Asp Asn Leu Thr 
65 70 75 80 

Met He Leu His Glu Phe lieu Pro Phe He Glu He Lys Gin Val Pro 

85 90 95 

Ser Ser Ser Ala Asn Tyr Asp Leu Glu Ala Phe Phe Lys Gin Glu Leu 

100 105 110 

Ser Thr Tyr His Phe Asn Asp Ser Pro Leu Phe Lys Val Lys Leu Phe 

115 120 125 

Gin Phe Ala Asp Ala Ala Tyr He Leu Leu Asp Phe His Val Ser He 

130 135 140 

Phe Asp Asp Ser Gin He Asp He Phe Leu Asp Asp Leu Cys Asn Ala 
145 150 155 160 

Tyr Arg Gly Asn Thr Val He Asn Tyr Thr Arg His His Ala His He 
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165 170 175 

Lys 



<2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



Met Glu Ala Thr lie Glu Val Val Thr Asp His Lys Val He Asp Trp 

15 10 15 

Leu Trp Glu Thr Gin Asp Lys Ser Phe Phe Ser Ser Xaa Glu Asp Pro 

20 25 30 

Glu Leu Cys Val Leu Lys Val Thr Pro Gin Ser Val Lys Leu Met Asn 

35 40 45 

Asp Lys Ser Leu Asp Thr Pro He Lys He Asp Leu 

50 55 60 

(2) INFORMATION FOR SEQ ID NO: 116: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



Met Asp Leu Tyr Thr Ala Lys Gin Phe Ala Tyr Gly Lys Ala Lys Asp 

1 5 10 15 

Asn Lys Val Ala Ala Val Trp Lys Leu Asp Leu Lys Gly His Lys Pro 

20 25 30 

Ala Leu Glu Gly Pro Glu Glu Thr Leu 
35 40 
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(2) INFORMATION FOR SEQ ID NO; 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 117: 

Met Ser Phe Tyr Val Val Leu lie lie lie lie Val Ala Leu lie Gly 

15 10 15 

lie Leu Val Leu Asn Gin Arg Tyr Ser Asn Ser Lys lie Asp Thr Glu 

20 25 30 

Val Tyr Ala Arg Lys Gin Leu lie Lys Lys Asn Lys Ala Leu Ser Ala 

35 40 45 

Glu Asn Ala Glu Leu Arg Ser Gin Met Leu Ser Xaa Asn Asn Asp Val 

50 55 60 

Gly Thr Pro Met Gin Tyr Lys Lys Cys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Met Ala Leu Thr Ala Gly lie Val Gly Leu Pro Asn Val Gly Lys Ser 

15 10 15 

Thr Leu Phe Asn Ala lie Thr Lys Ala Gly Ala Leu Gly Xaa Asn Tyr 

20 25 30 

Pro Xaa Ala Thr Xaa Asp Pro Asn Val Gly lie Val Glu Val Xaa Asp 

35 40 45 

Ala Arg Leu Leu Lys Leu Glu Glu Met Val Gin Pro Lys Lys Thr Leu 

308 



WO 97/30070 PCT/US97/02318 

50 55 60 

Pro Thr Thr Phe Xaa Phe Thr Asp Phe Gly Gly lie Gly Lys Gly Ala 

65 70 75 80 

Ser 



(2) INFORMATION FOR SEQ ID NO:119: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



Met Ala Val Phe Lys Val Phe Tyr 

1 5 
Val Arg Glu Asn Thr Gin Ser Leu 
20 

Gin Val Arg Arg Tyr Leu Lys Asp 

35 40 
Thr Lys Leu Glu Gly Ala His Leu 

50 55 
His Phe Asn Val Glu He Ala Lys 
65 70 



Gin His Asn Arg Asp Glu Val He 

10 15 
Tyr Val Glu Ala Gin Thr Glu Glu 
25 30 
Arg Asn Phe Asn He Glu Phe He 
45 

Asp Tyr Glu Lys Glu Asn Ser Glu 
60 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



Met Gly Leu Leu Asp Ala Val Ser Lys Gly He He Ser Met Glu Asp 
15 10 15 
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He Ala Met He Ala Leu Leu He Gly Gly Leu Val Gly He He Gin _ 

20 25 30 

His Asn Gly Gly He Glu Trp Leu Leu Gin Phe Val Arg Ser Lys Val 

35 40 45 

Lys Ser Lys Arg Gly Ala Glu Leu Gly He Ala Ser Leu Val Ser Val 

50 55 60 

Ala Asp He Ala Thr Ala Asn Asn Thr He Ser He Ser Met Ser Gly 
€5 70 75 80 

Pro Leu Ala Lys His He Ala Asp Glu Tyr Asp Val Asp Ser Arg Lys 

85 90 95 

Ser Ala Ser He Leu Asp He Phe Gly Arg Leu Leu Pro Arg Xaa Xaa 
100 105 HO 

Thr He 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Met Leu He Asn Tyr His Leu Asp Leu Gly Asn Gly Leu Tyr Leu Phe 

15 10 15 

He Asp Arg Lys Asn Lys Val Gly Glu Ala He Lys Asn Ala Lys Asn 

20 25 30 

Tyr Leu Asp Arg Trp Lys Pro Lys Tyr Glu Lys Lys He He Ala Gin 

35 40 45 

He Asp He Lys Val Lys Glu Asn Arg Leu Met Asp Leu Asp Asp His 

50 55 60 

Asp Asn Gin Tyr Glu Phe Asn Ser Phe Xaa Xaa Glu Asn Glu Glu Thr 
65 70 75 80 

He Tyr Asn Glu Leu Asp Lys Xaa He Asn Asn Asn Thr Lys Ser Arg 

85 90 95 

Gly Asn He Asp Gly Leu He He Glu He Met He Arg Glu Tyr Tyr 
100 105 HO 

Leu Gly Asn 
115 
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(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Met Thr Met lie Glu Tyr Asn His Gin Arg He Asp Lys He He Tyr 

15 10 15 

Phe His Ser Pro Gin Ala Glu Asn Lys Lys Ser Ser Val Ala He He 

20 25 30 

Met Asn Pro Thr Thr Gly Asn His Phe Gin Ala Phe Tyr He Met He 

35 40 45 

Asn Ala He Lys Tyr Pro Tyr Pro Asp Ser Asn Lys Lys Phe Gin Met 

50 55 60 

He Asn Asp Cys Ala Glu- Lys Phe Asp He Pro He Leu Gly He Asp 
65 70 75 80 

Val Gin Pro Pro Gin Ala Phe His Asp Leu Ser Val He Leu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Met Leu He He Asp Thr Xaa Gin Gin Xaa Ser Val Arg Lys Leu Trp 

15 10 15 

Lys Gly Val Lys His Val He Leu Xaa Ser Pro Thr Phe Gly Ser Gin 

20 25 30 

Met lie Val Glu Gin Phe Met Ser Lys Phe Ser Gin Asp He Glu Val 
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35 40 45 

He Ser Phe Ser Thr Tyr Leu .Gly Asp Thr Arg He Val Asp Lys Glu 

50 55 60 

Ala Pro Asn His Val Leu Thr Thr Gly Val Lys Lys Lys Leu Tyr Met 
65 70 75 80 

Gly Ser Thr His Ser Xaa Ser Thr Met Cys Gin Arg He Ser Ala Leu 

85 90 95 

Ala Glu Gin Leu Lys He His He Arg He Arg Trp Val Ala Thr Ala 
100 105 HO 

Cys 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Met Ser Lys Leu Leu Met He Gly Thr Gly Pro Val Ala He Gin Leu 

15 10 15 

Ala Asn He Cys Tyr Leu Lys Ser Asp Tyr Glu He Asp Met Val Gly 

20 25 30 

Arg Ala Ser Thr Ser Glu Lys Ser Lys Arg Leu Tyr Gin Ala Tyr Lys 

35 40 45 

Lys Glu Lys Gin Phe Glu Val Lys He Gin Asn Glu Ala His Gin His 

50 55 60 

Leu Glu Gly Lys Phe Gly Asn 
65 70 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE: protein 



PCT/US97/02318 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



Met Phe 



Trp 



Glu 



Gin Leu 



Asn 



Phe Arg 



His lie Asp Val Pro Tyr Asn 
10 15 



1 



5 



Gin lie Asn Glu lie Xaa Thr lie lie Asp Arg Phe He Asn Xaa Lys 

20 25 30 

Asn Lys Ala Xaa Tyr He Glu Pro Arg Phe Asn Asn Pro Thr Gly Arg 

35 40 45 

Ser Leu Thr Asn Glu Gin Lys Lys Asn Xaa He Thr Tyr Xaa Glu Arg 

50 55 60 

His Asn He Pro He He Xaa Asp Asp He Phe Arg Asp He Xaa Phe 
65 70 75 80 

Ser Asp Pro Thr Pro Ala He Lys Thr Tyr Asp Lys Leu Gly Lys Val 

85 90 95 

He His He Ser Ser Phe Ser Lys Thr He Ala Pro Ala He Arg lie 

100 105 110 

Gly Gly Leu Xaa Leu Leu Lys Lys 



(2) INFORMATION FOR SEQ ID NO:126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Met Thr Arg Ser Ala Leu Lys Pro Phe Lys Asn Lys Arg Val Met Val 

15 10 15 

Thr Gly Arg He Gin Arg Val Xaa Phe Lys Asn Tyr Leu Asp Arg His 

20 25 30 

Ser Thr Phe Xaa Pro Asn Val Arg He Leu Leu Lys Asp Val Phe Val 

35 40 45 

Ser Gly Val Ser He Asp His Leu Trp Leu Tyr Glu Thr Asn Lys Tyr 

50 55 60 

Tyr Ala Leu Ala Met Glu Leu He His Gin Arg Val Lys Phe Ser Ala 
fi 5 70 75 80 



115 



120 
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Asn 



Val Gly Thr He Phe Thr Lys Leu Asn Arg Asn 



85 



90 



(2) INFORMATION FOR SEQ ID NO: 127: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 



(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 7: 

Met Gly Ala Glu Asn He He Met Ala Phe He Leu Thr Arg Trp Ala 

1 5 10 15 

Trp Phe Cys Arg Val He Arg Thr Ser Val Met Gin Tyr Thr Ala Ser 

20 25 30 

Asp His Val Arg Phe Ala Lys Thr He Gly Met Asn Asp Met Lys He 

35 40 45 

He His Lys His He Met Pro Leu Thr Leu Ala Asp He Ala He He 

50 55 60 

Ser Ser Ser Ser Met Cys Ser Met He Leu Gin He Ser Gly Phe Ser 
65 70 75 80 

Phe Leu Gly Leu Gly Val Lys Ala Pro Thr Ala Glu Trp Gly Met Met 

85 90 95 

Leu Asn Glu Ala Arg Lys Val Met Phe Thr His Pro Glu Met Met Phe 

100 105 110 

Xaa Pro Gly He Ala He Gly He He Val Met Ala Phe Asn Phe Leu 

115 120 125 

Ser Asp Ala Leu Gin Asn xaa Tyr Trp He Pro Arg He Ser Phe Leu 

130 135 140 

Lys He Asn Phe Arg Xaa Leu 
145 150 

(2) INFORMATION FOR SEQ ID NO: 128: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

Met lie GIu Ser Asp Ser Xaa Xaa Asn Ala Xaa Leu Leu Ala GIu Ser 

15 10 15 

Xaa His Ala Gly Val Glu Thr Ala Tyr Lys Ala Val Met Lys Pro Xaa 

20 25 30 

Glu Gly Thr lie Leu Thr Gly Gly Lys Asp Ala Ala Gin Ala Ala He 

35 40 45 

Glu Lys Ala Asn Asn Thr Glu Asp Cys He Arg He Asn Gly Xaa His 

50 55 60 

Tyr Trp 
65 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Met Leu Arg He Ala He Ala Lys Gly Arg Leu Met Asp Ser Leu He 

15 10 15 

Asn Tyr Leu Asp Val He Glu Tyr Thr Thr Leu Ser Glu Thr Leu Lys 

20 25 30 

Asn Arg Glu Arg Gin Leu Leu Leu Ser Val Asp Asn He Glu Cys He 

35 40 45 

Leu Xaa Lys Gly Ser Asp Val Pro He Tyr Val Glu Gin Gly Met Ala 

50 55 60 

Asp He Gly He Val Gly Ser Asp He Leu Asp Glu Arg Gin Tyr Asn 
65 70 75 80 

Val Asn Asn Leu Leu Asn Met Pro Phe Gly Ala Cys His Xaa Ala Val 

85 90 95 

Ala Ala Lys Pro Glu Thr Thr Asn Tyr Arg Lys He Ala Thr Ser Tyr 

100 105 HO 

Val His Thr Ala Glu Thr Tyr Phe Lys Ser Lys Gly He Asp Val Glu 
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115 120 125 

Leu lie Lys Leu Asn Gly Ser Val Glu Leu Ala Cys Val Val Asp Met 

130 135 140 

Val Asp Gly He Val Asp He Ala Gin Thr Gly Thr Thr Leu Lys Ala 
145 150 155 160 

Asn Gly Leu Val Glu Lys Gin His He Ser Asp He Asn Ala Arg Leu 

165 170 175 

He Thr Asn Lys Ala Ala Tyr Phe 
180 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

Met Asn Asn Met Pro Thr Val Leu He Asp Ala He Ala He Gly Gin 

15 10 15 

Ser Asn Val His Gly Met Leu Lys Glu Gly Leu He Tyr Ala Xaa Val 

20 25 30 

He Gly Ser Asp Leu Gly Pro Xaa Asn Tyr Thr Asp Xaa Leu Phe Ser 

35 40 45 

Tyr He Xaa Val Val Thr Arg Leu Asn Thr Lys Arg Cys 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 131: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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Met Met Arg Val Ala Leu Ser lie Gin Gin Ser Arg lie Glu Ala Gin 

1 5 10 15 

Arg Ser Asn Xaa Lys Ala Val lie Ser Gin Ser Asp Val Leu Ala Arg 

20 25 30 

Met Thr lie Glu Glu His Glu Gin Asp Gly Glu Lys Ala Tyr Gin Tyr 

35 40 45 

Val Leu Asn Ala Glu Pro Ser Asn Leu Phe Glu Thr Lys lie Arg His 

50 55 60 

Ser Ser Asn lie Lys lie Asn Asn Xaa Arg Phe lie Asn Pro Ser lie 
65 70 75 80 

Asn Asp Val Val Gin Ala He Arg Asn Gly Asn 
85 90 



(2) INFORMATION FOR SEQ ID NO: 132: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



Met Met Ala Gin Gly He Gly Arg Leu Thr Gly Lys Pro Gly Val Val 

15 10 15 

Leu Val Thr Ser Xaa Pro Gly Val Ser Asn Leu Thr Thr Gly Leu Leu 

20 25 30 

Thr Ala Thr Ser Glu Gly Asp Pro Val Leu Ala Leu Gly Gly Gin Val 

35 40 45 

Lys Arg Asn Asp Leu Leu Arg Leu Ala His Gin Ser He Asp Asn Ala 

50 55 60 

Ala Leu Leu Lys Tyr Ser Ser Lys Tyr Ser Glu Glu Val Gin Asp Pro 
65 70 75 80 

Glu Ser Leu Ser Glu Val Met Thr Thr Cys Asn Ser Asn Cys Tyr Xaa 

85 90 95 

Arg Xaa Xaa Trp Arg Lys Xaa His 
100 



(2) INFORMATION FOR SEQ ID NO: 133: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

Met Met He Gly Glu Asp Phe Ser His Tyr Leu Xaa Val Arg Pro Gly 

15 10 15 

Ala Phe Phe Leu Thr Gly Cys Gly Asn Glu Ser Lys Gly lie Thr Ala 

20 25 30 

Pro His His Asn Pro Lys Phe Asp He Asp Xaa Lys Ser Leu Lys Tyr 

35 40 45 

Ala Val Ala Val Phe Leu Lys He He Glu Leu Glu Gin Val Phe Lys 

50 55 60 

Xaa Xaa 
65 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



Met val Leu Gly Val Ser Xaa Phe Leu Glu Thr Xaa He Asn Asn Leu 

15 10 15 

Ser Glu Glu Leu Ser Val He Asn Thr Ser Lys Leu Glu Ser He Thr 

20 25 30 

His Tyr His Phe Asp Xaa Val Asp Leu Leu He Thr Xaa His Asp He 

35 40 45 

Pro Lys Gin Thr Leu Asn He Leu Pro Lys His Leu Thr Thr He Lys 

50 55 60 

Val Ala Pro Leu Phe Ser Glu Asp Asp Arg His Lys He Arg His Val 
65 ™ 75 80 

Val Lys Gin Lys Gin Asn Pro Val Gin Ala His His His Met Asp Thr 

318 



WO 97/30070 PCIYUS97/02318 

85 90 95 

Val Asn Phe Leu Val Gly Asn Thr Glu Gin Lys Ser Arg His Thr Val 

100 105 110 

Gin lie Leu Glu Glu Ala Gin Lys lie Leu Gin Ala His His Ala lie 

115 120 125 

Gly Arg Arg He Xaa Arg He Ser Phe Arg Ala 
130 135 

(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Met Leu He Asp Trp Arg Met Ala Ser Ala Leu Leu Val Thr He Pro 

1 5 10 15 

He Ser He Phe Ala Phe Lys Lys Val Met Ser Gly Phe Asn Glu Thr 

20 25 30 

Tyr Ala Glu Gin Met Lys Ser Asn Asn Tyr Met Asn Ser Ala He Val 

35 40 45 

Glu Phe He Glu Gly He Glu Val He Lys Thr Phe Asn Gin Ser Gin 

50 55 60 

Ser Ser Tyr 
65 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
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Met Ala lie lie Arg Gin Arg Tyr Gin Glu Gin Leu Asp Asp Leu lie 

1 5 10 15 

Lys Glu Leu Arg Arg Leu Gly Ala Asn Val Tyr Val Ser lie Glu Asn 

20 25 30 

Gly lie Lys Ser Leu Ser lie Asp Asp Arg Gly Phe Xaa Arg Gin Thr 

35 40 45 

Val Lys Asn Asp Lys His lie Asn Gin Leu Asn Tyr Asp lie Asn Glu 

50 55 . 60 

Arg Val lie Met Leu lie Thr Lys Gin Gin Xaa lie Ala Ser Asp Leu 
65 70 75 80 

Arg Met Met lie Ser Ser Xaa Lys Asn Arg Leu Arg Phe Arg Lys Asn 
85 90 95 

Arg Arg 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Met Lys Glu Arg Gly His lie Glu Gin Leu Trp Arg Glu Glu Lys Tyr 

15 10 15 

His Val Leu Leu His Ser Gin Gin Ser Tyr Gin Met lie Arg Asn Ala 

20 25 30 

Leu Lys Thr Asp Leu Ser Leu His Gin Val Gin Gin Met He Asp Asp 

35 40 45 

Ala Leu Leu Xaa Xaa Pro Ser He Gly Ser Val Cys Asn Ala Phe Ala 

50 55 60 

His Met Trp Gly Tyr Phe Lys Lys Cys Ala Asn Glu Glu Glu Arg Lys 
65 70 75 80 

His Ser Xaa Leu Leu Thr Ala Asp Phe He Asn Gly Lys He Asp Thr 

85 90 95 

Gin Thr Leu Leu Asp Phe Leu Ala Glu Leu Ala Asn Lys Tyr Asp Val 

100 105 110 

Gin Tyr Leu Leu Gin Ser Arg Val Leu Asn Thr Lys Arg Lys Arg 
115 120 125 
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(2) INFORMATION FOR SEQ ID NO: 138: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

( C ) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



Met Asn lie Asp Gly Leu Asp Ala 

1 5 
Lys Asn Asn lie Asp Asp Asp Val 
20 

Lys Glu Tyr Val Val Arg Ala Lys 

35 40 
Lys Gly Tyr Trp Thr Gly Asn Leu 
50 55 



Leu Leu Asn Gin Phe His Asp Met 

10 15 
Asp Asp Thr Leu Gin Glu Lys Ala 
25 30 
Leu Lys Ala Arg Glu Val Met Asn 
45 

Ser Arg Asn lie Arg Tyr 
60 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



Met Val Gly Val Xaa Gly Lys He 

1 5 
Pro Gly Asp He Arg Asn Val He 
20 

Val Leu Ser Gin Leu Phe Gin Tyr 

35 40 
Gly His Ser Leu Gly Asn Leu Leu 

50 55 . 

Asn Asp Phe Gly His Ala He Lys 



Arg Asp Glu Met Val He Pro Ala 

10 15 
Ala Ala Leu Ser Asp Ser Glu Ser 
25 30 
Arg Phe Glu Glu Asn Gin He Ser 
45 

He Ala Gly Met Thr Xaa He Thr 
60 

Ala Leu Ser Lys He Leu Asn He 
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65 70 75 80 

Lys Gly Arg Val lie Pro Ser Thr Asn Thr Lys Trp Cys Asn 
85 90 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 



Met Gly Leu Lys Gly Gin Gly Val Leu Leu Val Asn His Xaa Gin Asn 

15 10 15 

Val Asp Pro Ser Xaa His Gly Gly Gly Gin Glu Tyr Gly Val Arg Ser 

20 25 30 

Gly Thr Val Asn Val Gin Met lie Leu Gin Trp Xaa Xaa Arg 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 141: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Met Asp Glu Gin Gin Gin Leu Thr Asn Ala Tyr His Ser Asn Lys Leu 

15 10 15 

Ser His Ala Tyr Leu Phe Glu Gly Asp Asp Ala Gin Thr Met Lys Gin 

20 25 30 

Val Ala lie Asn Phe Ala Lys Leu lie Leu Cys Gin Thr Asp Ser Gin 

35 40 45 

Cys Glu Thr Lys Val Ser Thr Tyr Asn His Pro Asp Phe Met 
50 55 60 
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(2) INFORMATION FOR SEQ ID NO: 142: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



Met Gly Pro Asn Gly Ala Gly Lys 

1 5 
Gly Glu Phe Asn Ala Thr Gly Thr 
20 

Gin Gin Gin Xaa Gin Gin lie Thr 

35 40 
Asp Leu Asp Phe Pro lie Ser Val 

50 55 
Tyr Lys Glu lie Gly Trp Phe Arg 
65 70 



Ser Ser Leu lie Lys Ser Leu lie 

10 15 
Lys Xaa Xaa Tyr Asn Lys Pro lie 
25 30 
Tyr He Pro Gin Lys Ala His He 
45 

Glu Gin Val He Leu Ser Gly Cys 
60 

Arg Pro Asn Lys Ser Ala Arg Asp 
75 80 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



Met Asp Asp Leu Arg Lys He Asn Gin Glu Leu Gly He Thr He Leu 

1 5 10 15 

He Asn Leu His Phe Val Asp Leu Ala Lys Glu Tyr Gly Thr Arg He 

20 25 30 

He Gly Leu Arg Asp Gly Glu Val Gly Leu 
35 40 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



Met Asp Ala Tyr Lys Glu Val Arg 

1 5 
His Asn Ala Thr Val Ser Asn Ala 
20 

Asn Ala Ala Val Asp Ala Ala Gin 

35 40 
Val Val Lys Ser Gin Gin Glu Val 

50 55 
Asp Lys lie Asn Ala lie Gin Thr 
65 70 
Asp Thr Glu Val Glu Asn Ala Tyr 
85 



Gin Ser Ala Thr Ala Arg Lys Ala 

10 15 
Thr Asp Glu Glu Val Xaa Glu Ala 
25 30 
Thr Glu Gly Leu His Asp lie Gin 
45 

Ala Asp Thr Lys Ala Lys Val Leu 
60 

Gin Ala Lys Val Xaa Pro Ala Ala 

75 80 
Xaa Tyr Thr 
90 



(2) INFORMATION FOR SEQ ID NO: 145: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



Met Asp Xaa Lys Xaa Leu His His Xaa Ser Pro Leu Pro Met Phe Gly 

15 10 15 

Thr Cys Ala Gly Leu lie Val Leu Xaa Gin Asp lie Xaa Gly Glu Glu 

20 25 30 

Gly Tyr Leu Asn Lys Leu Xaa lie Pro Val His Arg Asn Ser Phe Gly 
35 40 45 
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Arg Gin Val Xaa Xaa Phe Glu Thr Glu Leu Asp He Lys Gly He Xaa 

50 55 60 

Thr Asp He Val Gly Val Phe He Arg Ala Pro His He Glu Lys Val 
65 70 75 80 

Gly Gin Gly Val Xaa He Leu Cys Lys Val Asn Gly Lys He Val Ala 

85 90 95 

Val Gin Gin Gly Lys 
100 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Met Glu Leu Lys Xaa Leu Xaa Gin Asn Ser Thr Leu Pro Met Phe Gly 

1 5 10 15 

Thr Cys Ala Gly Val He Val Leu Ala Gin Asp He Xaa Xaa Glu Xaa 

20 25 30 

Gly Tyr Xaa Asn Lys Leu Asn He Thr Val Xaa Arg Xaa Ser Phe Gly 

35 40 45 

Arg Xaa Val Asp Ser Phe Glu Thr Xaa Leu Asp He Lys Gly He Ala 

50 55 60 

Thr Asp He Xaa Gly Val Phe He Arg Ala Pro His He Glu Lys Val 
65 70 75 80 

Gly Xaa Gly Val Asp lie Leu Cys Xaa Xaa Asn Glu Lys He Val Xaa 

85 90 95 

Val Xaa Gin Gly Lys Tyr Leu Gly Val Ser Phe His Pro Glu Leu Thr 

100 105 HO 

Asp Asp Tyr Arg Val Thr Asp Tyr Phe He Asn His He Val Xaa Lys 
115 120 125 

Ala 



(2) INFORMATION FOR SEQ ID NO: 147: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 



PCT/US97/02318 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Met Lys Asn Val Pro Leu Ser Xaa Xaa Xaa Asn Val Xaa Xaa Met lie 

15 10 15 

Asp Lys Gin Glu Phe Met Xaa Val Ser Gly Met Asn Arg Met Ala Tyr 

20 25 30 

Asn Asp Gin Tyr lie lie Gly Gin Arg Gly Asp Glu Phe lie Leu Tyr 

35 40 45 

Lys Phe Gly Asp Glu Ser Met Arg Val Tyr Asn Thr Glu Phe Glu Met 

50 55 60 

Gin Gin Asp Leu Asn Glu Leu Gly Gin Asn Leu Gin Leu Lys Pro Glu 
65 70 75 80 

Asn Ala Tyr Gin 



(2) INFORMATION FOR SEQ ID NO: 148: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Met lie Xaa Ala Leu Tyr Xaa Trp Ser Ser Xaa Xaa Thr Tyr lie Ala 

15 10 15 

Leu Gly Asn Met Met Xaa Thr Ala Ala Xaa Leu Gly lie Asp Ser Tyr 

20 25 30 

Pro Met Glu Gly Phe Ser Leu Asp Thr Val Thr Asp Xaa Leu Ala Xaa 

35 40 45 

Xaa Gly He Leu Asp Thr Glu Gin Phe Gly Leu Ser Val Met Val Ala 

50 55 60 

Phe Gly Tyr Arg Gin Gin Asp Pro Pro Lys Asn Lys Thr Arg Gin Ala 
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65 70 75 80 

Tyr Glu Asp Val Asn Glu Gly Val Gly Pro Xaa Glu 
85 90 

(2) INFORMATION FOR SEQ ID NO:149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Met Leu Leu Ala Ser Phe Val Leu lie Thr Leu Gly Gin Ser Val Glu 

1 5 10 15 

Gin Leu Arg Ala Ala lie lie Tyr Val Val Leu Asn lie He Gly Ser 

20 25 30 

Trp Leu Phe Leu Leu Gly He Gly Leu Leu Tyr Lys Thr Val Gly Thr 

35 40 45 

Leu Asn Phe Ser His He Ala Met Arg Leu Asn Asp Met Gly Asp Asn 

50 55 60 

Arg Thr Val Thr Met He Ser Leu He Phe Leu Val Ala Phe Ser Ala 
65 70 75 80 

Lys Ala Ala Leu Val Leu Phe Met Trp Leu Pro Lys Ala Tyr Ala Val 

85 90 95 

Leu Asn Thr Glu Leu Ala Ala Leu Phe Ala Ala Xaa Met Thr Xaa Val 

ioo ios 110 

Gly Ala Tyr Ala Leu He Arg lie Leu His Phe Thr He Trp He Lys 

115 120 125 

His Asn Asp Leu He His Pro Leu Leu Ala Thr Met Ala Ala Tyr 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE: protein 



PCTYUS97/023I8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



Met Gin Pro Ser Met Gin Tyr Phe Asn Glu Trp Trp Lys Gin Leu Phe 

15 10 15 

Gly Glu Ser Glu Gly Lys Asp Phe Lys Gly lie Tyr Pro Ser Ser Ala 

20 25 30 

Asn Tyr Thr Thr Asp Leu His Ser Leu Gly Gin Tyr Val Gin Glu Gly 

35 40 45 

Arg Arg Phe Leu Phe Glu Thr Xaa Val Lys Val Asn His Pro Lys Tyr 

50 55 60 

Asp lie Thr He Glu Lys Asp Ser Asp Asp Leu Asp Gly Leu Asn Tyr 
65 70 75 80 

Leu Ala Gly Lys Thr He Asp Asp Gly 
85 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



Met Thr Leu He Xaa Leu Ala Met 

1 5 
Leu He Lys Asn Ala Val Asp Ala 
20 

Arg He Asn Thr Tyr Xaa Met Glu 

35 40 
Ser Asn Asn His Ala Gin Cys Met 
50 55 



He He He Glu Tyr Ser Phe Leu 

10 15 
Gly Val He Asn Asn Ala Thr Thr 
25 30 
Ser Leu Ala Thr Val Thr Thr Glu 
45 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Met Phe Tyr Gly Leu Tyr Glu Glu Met Xaa His Leu Leu Tyr Cys Ser 

15 10 15 

Ser Ala Gly His Glu Pro Gly Tyr lie Tyr Arg Ala Val Lys Glu Glu 

20 25 30 

Phe Xaa Glu lie Ser Val Arg Gly Arg Val Leu Gly lie Ser Ser His 

35 40 45 

Thr Arg Tyr Gin Gin Gin Glu lie Pro lie Tyr Leu Asp Asp Leu lie 

50 55 60 

lie lie Leu Thr Asp Gly Val Thr Glu Ala Arg Asn Ser Glu Gly Thr 
65 70 75 80 

Phe lie Asp Lys His Lys Leu Leu Glu Tyr lie Lys Lys His Lys His 

85 90 95 

Leu His Pro Gin Asp lie Val Xaa lie lie Tyr Glu Ala lie Leu Arg 

100 105 110 

Leu Gin Asn Pro Asn Lys Lys Asp Asp Met Thr lie Leu lie lie Lys 

115 120 125 

Arg Val Asn 
130 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 179 amino acids 
<B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Met Arg Asp Glu He Asp Pro Val Arg Arg Thr Gly He Lys Gin Leu 

1 5 10 ' 15 

Met Asn Lys Gin Tyr Gin Gin Asp His Leu Val Ala Ser Ser Asp Lys 

20 25 30 

Gly Lys Glu Lys Leu Arg Asn Ser Asp Thr Val Ala Val Leu Leu Pro 
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35 40 45 

Ala Xaa Thr Phe Tyr Leu Gly Lys Glu Asp Tyr Ala Ala Ala Arg Gly 

50 55 60 

Met Leu Asp Asn Asn Gly Ala lie Ala Leu Ala Thr Asp Tyr Asn Pro 
65 70 75 80 

Gly Ser Ser Val Thr Asn Asn Leu Gin Leu Val Met Ala lie Ala Ala 

85 90 95 

Leu Lys Leu Lys Leu Ser Pro Ser Glu Val Trp Asn Ala Val Thr Val 

100 105 110 

Asn Ala Ala Lys Ala lie Asp lie Asn Ala Gly Thr lie Asn Thr Gly 

115 120 125 

Asp Lys Ala Asn Leu Val lie Trp Asp Ala Pro Asn His Glu Tyr lie 

130 135 140 

Pro Tyr His Xaa Gly Tyr Gin Ser Arg Arg Lys Lys Leu Leu Lys Thr 
145 150 155 160 

Val Arg Xaa Ser Xaa lie Thr His Tyr Leu Leu Lys His Lys lie lie 
165 170 175 

Arg Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 155 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Met Thr lie Leu lie Asp Thr Tyr Gin Gin Tyr Leu Xaa Xaa Phe Glu 

1 5 10 15 

Ala His Xaa Xaa Arg Val Leu Gin Asp Asp Gin Tyr lie Glu Ala Leu 

20 25 30 

Glu Thr Leu Met Asp Xaa Tyr Ser Glu Phe lie Leu Asn Pro lie Tyr 

35 40 45 

Glu Gin Gin Phe Asn Ala Trp Arg Asp Val Glu Glu Lys Ala Gin Leu 

50 55 60 

lie Lys Ser Leu Gin Tyr lie Thr Ala Gin Cys Val Lys Gin Val Glu 
65 .70 75 80 

Val lie Arg Ala Arg Arg Leu Leu Asp Gly Gin Ala Ser Thr Thr Gly 
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85 90 95 

Tyr Phe Asp Asn lie Glu His Cys lie Asp Glu Glu Cys Gly Gin Cys 

100 105 110 

Ser He Thr Ser Asn Asp Lys Leu Leu Leu Xaa Gly Xaa Gly Ala Tyr 

115 120 125 

Pro Met Thr Leu Xaa Gin Val Ala Lys Glu Thr Gly Ala Ser Val He 

130 135 140 

Gly He Asp He Asp Pro Thr Ser Arg Xaa Thr 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



Met Lys Leu Asn Asn Tyr Ser Leu 

1 5 
Xaa Asn Cys Asp Leu Asn Phe Tyr 
20 

Gly Lys Asn Gly Val Gly Lys Ser 

35 40 
Asn Asn Ser Gly Asn He Pro Lys 

50 55 
Asn He Lys Xaa His Gin He Phe 
65 70 



Lys Val Lys Asn Lys Gin Leu Xaa 

10 15 
Leu Gly Gin He Asn His He Xaa 
25 30 
Leu Leu Ala Lys Asp Phe Leu Leu 
45 

Ser He Ser Gin Asn Gly Asn Leu 
60 

Pro Asn Gly Tyr 
75 



(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
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Met Val Gin Leu His Gin Leu Phe 

1 5 
Xaa Glu Tyr Met Xaa Pro Gly Gly 
20 

Xaa lie Asn Xaa His Gly lie Asn 

35 40 
Xaa Xaa Xaa Glu Ser Thr Ser Gly 
50 55 



Pro Lys His Glu Val Phe Xaa Arg 

10 15 
Arg Met Lys Asp Arg Pro Ala Lys 
25 30 
His Gly Leu lie Thr Xaa Asn Thr 
45 



(2) INFORMATION FOR SEQ ID NO: 157: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



Met Glu Gly Gly lie Ser Tyr lie Val Arg Xaa Ser Lys lie Asp Pro 

15 10 15 

Val Asn lie Val Gly lie Gly lie Asp Phe Thr Ser Ser Thr lie lie 

20 25 30 

Phe Thr Asp Glu Asn Leu Asn Pro Val His Asn Leu Lys Gin Phe Lys 

35 40 45 

Asn Asn Pro His Ala Tyr Val Lys Leu Trp Lys His His Gly Ala Tyr 

50 55 60 

Lys Glu Ala Glu Lys Leu Tyr Gin Pro Ala lie Glu Asn Asn Asn Xaa 
65 70 75 80 

Trp Xaa Gly His Tyr Gly Tyr Asn Val Ser Ser Glu Trp Met lie Pro 

85 90 95 

Lys lie Met Glu Val Met Asn Arg Ala Pro Asp He Met Glu Lys Thr 

100 105 110 

Ala Tyr He Met Glu Ala Gly Asp Xaa He Val Xaa Xaa Leu Thr Asn 

115 120 125 

Lys Asn Val Arg Ser Asn Cys Gly Leu Gly Phe Gin Ser He Val Gly 

130 135 140 

Lys Lys Glu Thr Gly Ser Ser He Met Asp Leu Phe Gly 
145 150 155 
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(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Met Thr He Asn Leu Ser Glu Thr Phe Ala Asn Ala Lys Asn Glu Phe 

15 10 15 

He Asn Ala Val Asn Asn Gly Glu Pro Gin Glu Arg Gin Asn Glu Leu 

20 25 30 

Xaa Gly Asp Met He Asn Xaa Leu Phe Gly Gly Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:159: 

Met Leu Asp Glu Xaa Xaa Xaa Asn Leu Asp Asp Tyr Met Arg Tyr Leu 

15 10 15 

He Thr Lys Lys Glu Xaa Leu Ser Lys Leu He Asp Ser Leu Met Leu 

20 25 30 

Thr Leu Glu Asn Lys Tyr He Asp He Ala Glu Ala Phe Xaa He Gin 

35 40 45 

Cys Ala Arg Glu He Asn Asn Gin Glu He Glu Asn He Xaa Ser Glu 

50 55 60 

Leu Asn Lys Val Glu Ala Tyr Tyr Ala Gin He Glu Thr Gin He Gin 

65 70 75 80 

Gin Thr Ser Thr Glu Lys He Ala Thr Glu Lys Thr Ser Tyr Leu He 
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85 90 95 

Asn Tyr Met Asn Ala Val Ala . 
100 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Met Xaa Arg lie Glu Gin Ala Glu Val Arg Met Glu Glu lie Asp Val 

1 5 10 15 

Leu Met lie Glu Ala Ser Ala Asp Tyr Gly Lys He Lys Glu Leu Xaa 

20 25 30 

Glu Glu Lys Xaa Xaa Leu Xaa He Pro He 
35 40 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 amino acids 
<B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Met Glu Tyr He Tyr Cys Ala Arg Pro Asp Ser Thr He Ala Gly Lys 

15 10 15 

Asn Val His Ala Xaa Arg Lys Ala Ser Gly Lys Lys Leu Ala Gin Glu 

20 25 30 

Ser Pro Val Asn Ala Asp Met Val He Gly Val Pro Xaa Ser Ser Leu 

35 40 45 

Ser Ala Xaa Ser Gly Leu Cys 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Met Pro Cys lie Asn Leu Glu Phe Gly Thr Gly Leu Gly Thr Lys Gly 

15 10 15 

Xaa Gly Gly Ser Xaa Gin Gin Tyr Tyr Pro Leu Xaa Asp Glu lie Asp 

20 25 30 

Phe Lys Xaa lie 
35 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

Met Ala Glu Leu lie His His Phe Asn Gin Tyr Asp Cys Val Glu Glu 

15 10 15 

Thr Glu Leu Phe Asn lie Val Gin Gin Val Val Ala His Ala lie Asn 

20 25 30 

Pro Thr Leu Pro His Xaa Asn Glu Leu Lys Asp lie Leu Phe Gly Pro 

35 40 45 

Thr lie Thr Val Lys Ala Leu Leu Asn Met Arg Met Glu Asn Lys Val 

50 55 60 

Lys Gin Tyr Leu Asn lie Glu Leu Asp Asn Pro lie Lys Lys Glu Val 
65 70 75 80 
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(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Met Ala Xaa Leu lie His His Phe Asn Gin Tyr Asp Cys Val Val Glu 

1 5 10 15 

Thr Glu Leu Phe Asn lie Val Gin Gin Val Val Ala His Ala lie Asn 

20 25 30 

Pro Thr Leu Pro His Ala His Glu Xaa Lys Asp lie Leu Phe Gly Pro 

35 40 45 

Thr lie Thr Val Asn Ser Val Val Lys Tyr Glu Asn Gly Lys 
50 55 60 

(2) INFORMATION FOR SEQ ID NO:165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Ala Pro Xaa Thr Lys Xaa Asn Xaa Lys Lys Glu Arg lie Val Gly 

5 10 15 

Ala Gly Gly Gly Met Val Ala Val Thr Val Thr Gly His Lys Glu 

20 25 30 

Val Asp Val Glu lie Lys Glu Glu Xaa Val Asp Pro Asp Asp lie 

35 40 45 

Met Leu Pro Lys Thr 
50 



(2) INFORMATION FOR SEQ ID NO: 166: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 166: 

Met Leu Lys Glu lie His Glu Gin Pro Ala Val Met Arg Arg lie He 

15 10 15 

Gin Glu Tyr Gin Asp Ala Glu Gly Xaa Leu Xaa He Gly Ser Lys Thr 

20 25 30 

Xaa Ser Met Xaa Leu Xaa Lys Gin Asp Arg He Tyr Gly Tyr Cys Ser 

35 40 45 

Arg Val Gin Ser Tyr Pro Cys Arg Val 
50 55 



(2) INFORMATION FOR SEQ ID NO: 167 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 : 

Met He Ala Val Asn Trp Asn Thr Gin Glu Asp Met Thr Asn Met Phe 

15 10 15 

Trp Arg Gin Asn He Ser Gin Met Trp Val Glu Thr Glu Phe Lys Val 

20 25 30 

Ser Lys Asp He Ala Arg Trp Lys Xaa Leu Ser Glu Ala Glu Pro Asp 

35 40 45 

Pro Phe Lys Lys Ala Leu Ala Gly Leu Ala Gly Leu Asp Thr His Xaa 

50 55 60 

Gly Asp Asp Gly Met Pro Leu Val Met Leu His Thr Thr Asp Leu Arg 
65 70 75 80 

Lys Lys Ala Val Tyr Ser Phe Met Xaa Met Met Glu Gin He His Ala 
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85 



90 



95 



Arg Lys Ala Tyr Xaa His He Xaa Thr Thr Leu Leu Pro Ser 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 168 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Met Val Ser Asp Gin Gin Asp Phe Met Tyr Thr Leu Arg Met Met He 

1 5 10 15 

Val Thr Leu Gly Gly His Val Leu Asp Phe Glu Val Glu Asp Val Lys 

20 25 30 

Val Glu He Gin Asn He Glu Thr Phe Glu Gin His Phe Pro He Gin 

35 40 45 

Asp Lys Asp Phe Thr Lys Ala Asn He Asp Tyr Lys Asp He Ala Asp 

50 55 60 

Glu He Arg Glu Ala Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

Met Pro Pro Thr Asn Thr Ser Thr He He Pro Met Cys His Pro Leu 

15 10 15 

Pro Leu Thr Xaa He Asp Val His Phe Ser Trp Asp Glu Thr Asn Ala 



20 



25 



30 
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Pro Leu Tyr Thr Leu Asn He Gin Thr Thr Val Ser Thr Thr Ala Lys 

35 40 45 

Thr Gly Val Glu Met Glu Ala Leu Thr Ala Ala Ser Ala Thr Ala Leu 

50 55 60 

Thr He Tyr Asp Met Thr Lys Ala Val Asp Lys Gly Met He He Gly 
65 70 75 80 

Glu Thr Tyr Leu Glu Ser Lys Ser Gly Gly Lys Ser Gly Asp Phe Gin 

85 90 95 

Arg Gin Ser Gly Xaa 
100 

(2) INFORMATION FOR SEQ ID NO: 170: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:170: 

Met Thr Lys Phe He Cys Val Thr Gly Gly Val Val Ser Ser Leu Gly 

1 5 10 15 

Lys Gly He Thr Ala Ser Ser Leu Gly Arg Leu Leu Lys Asp Arg Gly 

20 25 30 

Leu Asn Val Thr Xaa Gin Xaa Phe Xaa Pro Tyr Leu Asn Val Asp Pro 

35 40 45 

Gly Thr Met Ser Pro Tyr Gin His Gly Glu Val Phe Arg Asn Gly 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 171: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
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Met Pro Pro Glu Glu Trp He Cys Thr Met Val Asn Ser Met Arg He 

15 10 15 

Gly Leu Pro Ala Ser Leu Asp Lys Val Gly Glu Val Leu Arg Leu Gin 

20 25 30 

Ser Gin Lys Asp Lys Ala Gly Lys Asn Leu He Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 94 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Met Thr Lys Glu Gin Gin Leu Xaa Glu Arg He He Ala Ala Xaa Gly 

1 5 10 15 

Gly Met Asp Asn He Asp Ser Val Met Asn Cys Met Thr Arg Val Arg 

20 25 30 

He Lys Val Leu Asp Glu Asn Lys Val Asp Asp Gin Glu Leu Arg His 

35 40 45 

He Asp Gly Val Met Gly Val He His Asp Glu Arg He Gin Val Val 

50 55 60 

Val Gly Pro Gly Thr Val Asn Lys Val Ala Asn His Met Ala Xaa Leu 
65 70 75 80 

Ser Gly Val Asn Leu Gly Asp Pro He Pro Gin His His Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 
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Met Lys Gin Lys Xaa Leu Pro Lys Val Arg Met Ser Glu lie Gly Phe 

15 10 15 

lie Leu Gin Ala Thr Asn Leu Val Pro Phe Leu Thr Val Lys Gin Gin 

20 25 30 

Phe Thr Leu Leu Lys Lys Lys Asn Lys Asn Val Met Ser Asn Glu Asp 

35 40 45 

Tyr Gin Gin Leu Met Ser Gin Leu Gly Leu Thr Ser Leu Leu Asn Lys 

50 55 60 

Leu Pro Ser Glu lie Ser Gly Gly Gin Lys Gin Arg Val Ala He Ala 
65 70 75 80 

Gin Ser Val He Tyr Glu Ser Val Asp Tyr Phe Ser Gly 
85 90 

(2) INFORMATION FOR SEQ ID NO: 174: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

Met Asn Met His He Leu Tyr Xaa Xaa Arg Xaa Lys His Asn Leu Glu 

1 5 10 15 

He Asp Xaa Leu Ala Gin Gin Leu Xaa Glu hyd^ Tyr Gly Thr Lys Tyr 

20 25 30 

Glu Ala His Xaa Met Gly Glu Trp Glu Asn His His His Asp Pro Lys 

35 40 45 

Phe Xaa Asp Ala Met His Leu Xaa Asp Phe Cys Gly Cys Thr He 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

Met Ala Phe Lys Asp Gly Thr lie Asn Pro Arg Lys Ser Asn Gin Leu 

15 10 15 

Lys Asp Tyr Val Phe He Asp Asp Gly Trp Ala Lys His Gly Thr Tyr 

20 25 30 

Cys Val Val Arg Arg He Gin He His He Glu Thr Trp Asp Arg Thr 

35 40 45 

Ala Leu Glu Glu Gin Glu Ala Thr Phe Gly Ser Gly Asn Asp He Val 

50 55 60 

Gly Ala Pro Leu Thr Gly Gly Lys Glu Phe Gly 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

Met Asp Ser Phe Val Asn He Asp Gly Thr Asp Asn Leu Leu Val Leu 

15 10 15 

Lys Thr Leu Pro Gly Asn Ala Gin Ser He Gly Ala lie Leu Asp Gin 

20 25 30 

He Asn Trp Glu Glu Val Leu Gly Thr He Cys Gly Asp Asp Thr Cys 

35 40 45 

Leu He He Cys Arg Ser Lys Glu Ala Ser Asp Glu He Lys Ser Arg 

50 55 60 

He Phe Asn Leu Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 177: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 112 amino acids 

342 



WO 97/30070 PCT/US97/02318 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



Met Val Asp Asp Ala Leu His Thr Glu Met lie Val Asp Gly Thr His 

15 10 15 

Ser His Pro Ala Ser Val Ala lie Ala Tyr Arg Met Lys Gly Asn Glu 

20 25 30 

Arg Xaa Tyr Leu lie Thr Asp Ala Met Arg Ala Lys Gly Met Pro Glu 

35 40 45 

Gly Glu Tyr Asp Xaa Gly Gly Gin Lys Val Thr Val Xaa Ser Gin Gin 

50 55 60 

Ala Arg Leu Ala Asn Gly Ala Leu Ala Gly Ser lie Leu Lys Met Asn 
65 70 75 80 

His Gly Leu Arg Asn Leu lie Ser Phe Thr Gly Asp Thr Leu Xaa His 

85 90 95 

Leu Trp Arg Val Thr Ser Leu Asn Gin Ala He Ala Leu Gly He Arg 
100 105 HO 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

Met Ala Pro Tyr Thr Asn Lys Gly Tyr Ala Val Lys Glu Leu Cys Ser 

15 10 15 

Tyr Leu Gin He Asn Met Asp Asp Val Tyr Thr He Gly Asp Arg Lys 

20 25 30 

Leu He Tyr Leu Cys Leu He Pro 
35 40 



(2) INFORMATION FOR SEQ ID NO: 179: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

Met Val Lys Xaa Xaa lie Tyr Pro Arg Asn Glu lie His Phe Ser Tyr 

1 5 10 15 

Thr His He Ala Phe Thr He Asp Asp Ser Glu Phe Lys Tyr Trp His 

20 25 30 

Gin Arg Xaa Lys Asp Asn Asn Val Asn He Leu Gly Gly Arg Val Arg 

35 40 45 

Asp He Arg Asp Arg Gin Ser He Tyr Phe Thr Asp Pro Asp Gly His 

50 55 60 

Lys Leu Glu Leu His Thr Gly Thr Leu Glu Asn Arg Leu Asn Tyr Tyr 
65 70 75 80 

Lys Glu Ala Lys Pro His Met Thr Phe Xaa Lys 
85 90 



(2) INFORMATION FOR SEQ ID NO: 180: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

Met Leu Val Gin Ser Gly Lys Pro Val Ala Val Phe Lys Thr His Glu 

1 5 10 15 

Glu Ala Pro Arg Val Leu He Ser Asn Ser Val Leu Val Pro Glu Trp 

20 25 30 

Ala Asn Trp Asp His Phe Asn Glu Leu Asp Lys Lys Gly Leu lie Met 

35 40 45 

Tyr Gly Gin Met Thr Ala Gly Ser Trp He Tyr He Gly Xaa Ser Arg 
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50 55 
Tyr Cys Ala Arg Tyr Leu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Met His His Asp His Pro Asn Lys Leu Lys Ser Asp Glu Lys He Leu 

15 10 15 

Leu Asp Gin Met Val Ser His Phe Pro Asn Phe Glu Asp Asp Phe Lys 

20 25 30 

Asn Ala Ala Gin Gly Ala Trp Val Lys Asn Ala Thr Asp Glu Leu Lys 

35 40 45 

Asp He Ser Asn Asp Leu Glu Lys He Gin Asp He Lys Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

Met Ser Leu Ala Glu Arg Xaa Lys Ala Xaa Lys Ala Phe Arg His Xaa 

15 10 15 

Val Lys Asn Met Asp Asp Val Arg Met Pro Phe Ser Ser Val His Asp 

20 25 30 

Ala Pro Ser Lys He He 
35 
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(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 



Met Lys Pro Asn Ser Lys Ser Asn Lys Lys lie Met Lys Asn Tyr Asn 

15 10 15 

Trp Glu Tyr Phe Lys Ala Gin lie Asn Gin Lys Leu Ser Glu Pro Glu 

20 25 30 

Thr Lys Lys lie Tyr Ser Gin Arg Lys He Asp Val Glu Leu Val Phe 

35 40 45 

Gly He Tyr Glu Gly Tyr Phe Gly Phe His Ser Asn Val Ser Ser Arg 
50 55 60 

Asn 
65 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 



Met Trp Asn Phe He Lys Cys Val Phe Lys Phe Val Phe Ser Leu Val 

15 10 15 

Ala He Thr Thr Leu Val Ala Gly Val Gly Val Val Ala Phe Ala Tyr 

20 25 30 

He Phe Lys Lys Asp Phe Glu Asp He Glu Arg Lys Thr Lys Glu He 

35 40 45 

He Ser Asp He Glu Ser Lys Asn Asn 
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(2) INFORMATION FOR .SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

Met Glu lie Asp Met Leu Gly Asp Ala Tyr Glu Phe Leu lie Gly Arg 

1 5 10 15 

Phe Ala Ala Thr Ala Gly Lys Lys Ala Gly Glu Phe Tyr Thr Pro Gin 

20 25 30 

Gin Val Ser Lys lie Leu Ala Lys lie Val Thr Asp Gly Lys Asp Lys 

35 40 45 

Leu Arg His Val Tyr Asp Pro Thr Cys Gly Ser Gly Ser Leu Leu Leu 

50 55 60 

Arg Val Gly Lys Glu Thr Gin Val Tyr Arg Tyr Phe Gly Gin Glu Arg 
65 70 75 80 

Asn Asn Thr Thr Tyr Asn Leu Ala Arg Met Asn Met Leu Leu His Asp 

85 90 95 

Val Arg Tyr Glu Asn Phe Glu lie Arg Asn Asp Asp Thr Leu Glu Asn 

100 105 110 

Pro Ala Phe Leu Gly Asn Thr Phe Asp Ala Val lie Ala Asn Pro Pro 

115 120 125 

Tyr Arg Cys Glu Met Asp Ser Arg Phe Lys lie 
130 135 

(2) INFORMATION FOR SEQ ID NO: 186: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:186: 
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Met Tyr Leu Lys Asp Thr Arg He Glu He Val Gly Val Asn Lys Lys 

15 10 15 

Asp Pro Leu Gin Tyr Ala Glu Ala He Asp Lys Leu Val Ser Ser Gly 

20 25 30 

Ser Phe Thr Arg Asn Glu Val Arg He Met Leu Gly Glu Glu Pro Ser 

35 40 45 

Asp Asn Pro Glu Leu Asp Glu Tyr Leu He Thr Lys Asn Tyr Glu Lys 

50 55 60 

Ala Asn Xaa Gly Glu Asn Asp Glu Lys Glu Lys Asp Glu Asn Thr Leu 
65 70 75 80 

Lys Gly Gly Asp Glu Asp Glu Ser Gly Asp 



(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

Met Tyr Leu He Glu Pro He Arg Asn Gly Glu Tyr He Thr Asp Gly 

1 5 10 15 

Ala He Ala Leu Ala Met Gin Val Tyr Val Asn Gin His He Phe Leu 

20 25 30 

Asp Glu Asp He Leu Phe Pro Tyr Tyr Cys Asp Pro Lys Val Glu He 

35 40 45 

Gly Arg Phe Gin Asn Thr Ala He Glu Val Asn Gin Asp Tyr lie Asp 

50 55 60 

Lys His Ser He Gin Val Val Arg Arg Asp Thr Gly Gly Gly Ala Val 

65 70 75 80 

Tyr Val Asp Lys Gly Ala Xaa Asn Met Cys Cys He Leu Glu Gin Asp 

85 90 95 

Thr Ser He Tyr Gly Asp Phe His Arg Phe Tyr Gin Pro Ala He Lys 



85 



90 



100 



105 



110 



Ala Leu 



His 



Thr 



Leu 



Gly 



Ala 



Thr 



Asp 



Val 



He 



Gin Ser Gly Arg Asn 
125 



115 



120 
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Asp Leu Thr Leu Asn Gly Lys Lys Val Ser Gly Asp Xaa Xaa Val Glu 

130 135 140 

Arg Cys 
145 

(2) INFORMATION FOR SEQ ID NO: 188: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Met Tyr Val Xaa Asp Asp Ala Leu Asn Asp Val Xaa Pro Ala Arg Xaa 

15 10 15 

Met Gly Met Val Ser Val Trp Tyr Lys Gin Glu Asp Ala Glu lie Glu 

20 25 30 

Pro Leu Glu Glu Glu Val Asp Phe Thr lie Thr Thr Val Glu Glu Leu 

35 40 45 

Leu Thr lie Leu Pro He Lys Asn Asp Asn Xaa Arg Arg Lys Leu Trp 

50 55 60 

He Tyr Leu Leu Glu Lys Asp Gly Thr Ser He His Tyr Ser Thr Leu 
65 70 75 80 

Gly Glu Gly Tyr Pro He Val Leu He His Thr Val Leu Asp Asn Xaa 

85 90 95 

Ser Val Phe Asn Xaa Leu Ala Ala Gin Leu Pro Gin Ser Phe Gin Val 

100 105 HO 

Val Leu He Asp Leu Pro Trp Thr Trp Leu Phe Leu He Asn Leu Val 

115 120 125 

Thr Leu Lys 
130 

(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

Met Gly Phe Gin Xaa Xaa Leu Ser lie. Lys Thr Xaa His Gly Gly Ala 

1 5 10 15 

lie Val Gin Ser Tyr Pro Leu Leu Met Arg Gin Glu Lys Ser Asn Asp 

20 25 30 

Trp Asp Phe Lys Gin He Xaa Asp Glu He Leu Glu Asn Leu Asn Asn 

35 40 45 

Glu Ser He Asp Thr He Gly Val Val Leu Arg Asn 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

Met Gin Ser Tyr Pro Ser Thr Lys Phe Val Gin Phe Tyr Lys Asp Asp 

15 10 15 

Val Ser Pro Ser Met Val Asp Gly Asn Gly Arg Leu Lys Ser Asp Thr 

20 25 30 

Asn Val He Ser He Glu Gly Gly Lys Tyr Asp Glu Asn Lys 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 191: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 
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Met Leu Lys Leu Arg Glu Ala Thr Asn Glu Tyr lie Gly Ala Asn Leu 

15 10 15 

Asp Pro Ser His Leu Trp Trp Gin Gly lie Asp Pro lie Ala Ala lie 

20 25 30 

Arg lie Leu Gly Gin Ala Asn Ala lie His His Phe His Ala Lys Asp 

35 40 45 

Thr Tyr lie Asn Gin Glu Asn Val Asn Met Tyr Gly Leu Thr Asp Met 

50 55 60 

Gin Pro Tyr Gly Asn Val Ala Thr Arg Ala Trp Thr Phe Arg Thr Val 
65 70 75 80 

Gly Tyr Gly His Ser Pro Tyr Val Trp Ala Asp lie lie Ser Gin Leu 

85 90 95 

lie lie Asn Gly Tyr Asp Tyr Val Leu Ser lie Glu His Glu Asp Pro 

100 105 110 

lie Met Ser Val Glu Glu Gly Phe Gin Lys Ala Xaa Gin Thr Leu Lys 

115 120 125 

Ser Val Asn lie Tyr Asp Lys Pro Ala Asp Met Trp Trp Ala 
130 135 140 



<2> INFORMATION FOR SEQ ID NO: 192: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STKANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 



Met Val Gin Met His Leu lie Gly 

1 5 
Phe Ser Gly Leu Val lie Gly Val 
20 

Gly Ala He Val Leu Phe Gly Phe 

35 40 
Thr He Ala Thr Val He He Val 

50 55 
Gly Thr Leu He Asn Lys Phe Asn 
65 70 



Val Val Asn Ala Cys He Val Thr 

10 15 
Lys Gly Met Pro Thr Pro He Ala 
25 30 
Asn Asn Ala Val Thr Ser He Val 
45 

Ser He Val Leu Ala Tyr He He 
60 

Leu Met Asn He Asn Phe Lys Met 
75 80 
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Pro Ser Lys Lys Asn His lie Lys Glu Ser Val 
85 90 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

Met Leu lie Leu Ser Lys Trp Gin Gly Val lie Leu Gin Asp Asp lie 

15 10 15 

Ala Gin Gly Leu Asn Thr Ser Tyr Glu Thr Ala Glu Lys Val Lys His 

20 25 30 

Gin Tyr Gly His Ala Phe Tyr Asp Ser Ala Ser Asp Gin Asp He Phe 

35 40 45 

Thr Val Glu Gin Val Asp Ser Asp Glu Thr Val Gin Tyr Thr His Lys 

50 55 60 

Asp Leu Ser Asp Phe He Glu Ala Arg Val Glu Glu He Phe Phe Glu 
65 70 75 80 

Val Phe Asp Val Leu Gin Asp Leu Gly Leu Thr Gin Val Asn Gly Gly 

85 90 95 

Phe He Val Thr Gly Gly Ser Thr Asn Leu Leu Gly Val Lys Glu Leu 

100 105 110 

Leu Ser Asp Met Val Ser Glu Lys Val Xaa 
115 120 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
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Met lie Phe His Gin Val Glu Val Asn Gin Tyr Ala Val Phe Gin Tyr 

1 5 10 15 

Pro Pro Tyr Asp lie Thr Leu Ser Thr Lys lie Ala Glu Val lie Lys 

20 25 30 

Glu Tyr Asp Leu Asp Leu Leu His Met His Tyr Ala Val Pro His Ala 

35 40 45 

lie Cys Gly He Leu Ala Arg Glu Met Ser Gly Lys Asp He Lys He 

50 55 60 

Met Thr Thr Leu His Gly Tyr 
65 70 

(2) INFORMATION FOR SEQ ID NO: 195: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

Met Xaa Xaa Thr Pro Thr Phe Xaa Phe Asn Asp Tyr His He Met He 

15 10 15 

Val His Pro Trp Xaa Leu Asp Asp Val Leu His Ser Asp Tyr Gin Xaa 

20 25 30 

Glu Val Asp Glu Glu Xaa He He Glu Ala Thr Xaa Xaa Leu Asp Tyr 

35 40 45 

Tyr Ala Gly Leu Ser Phe Arg Thr Leu Val Pro Lys Tyr Pro Ala Met 

50 55 60 

Ser Pro His He Asn Leu Ser Thr Thr Val His He Thr Gly 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 196: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Met Thr His Tyr His Asn Val Ser Arg Lys Gin Gin Glu Val Xaa Leu 

15 10 15 

Xaa Gly Gin lie Xaa Pro Xaa Asn Thr Tyr His Asn Lys Glu lie Tyr 

20 25 30 

Ala Phe Ala Gly Tyr Leu Asn Xaa 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 : ^ 

Met Gin Lys Pro Phe Tyr Thr Asn Val Leu lie Ala Lys Leu Gin Ala 

15 10 15 

lie Tyr Arg Arg Val Tyr Glu Phe Thr Ala Glu Glu Lys Arg Thr Leu 

20 25 30 

Thr Trp Gin Asp Ala Val Val Asp Leu Ser Gin Asp Ser lie His Lys 

35 40 45 

Gly Val Asp Thr lie Phe Leu Ser Lys Thr Glu Met Xaa lie Leu Glu 

50 55 60 

lie Leu lie Ser Lys Lys Asn Gin lie Gly Ser Arg Xaa Thr lie lie 
65 70 75 80 

Thr Xaa Leu Trp Asp Asp Glu Ala Phe Val Ser Asp Asn Thr Leu Thr 

85 90 95 

Val Asn Val Asn Arg Leu Arg Lys Lys Leu Ser Glu lie Ser Met Asp 

100 105 110 

Ser Ala lie Glu Thr Thr Val Gly Lys Gly Tyr Met Ala His Glu 



35 



40 



115 



120 



125 



(2) INFORMATION FOR SEQ ID 



NO: 198: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Met Thr Leu Lys Asp lie Ala His Val Gly Lys Phe Gly Cys Ala Asn 

15 10 15 

Cys Tyr Ala Thr Phe Lys Asp Asp lie lie Asp He Val Arg Arg Val 

20 25 30 

Gin Gly Gly Gin Phe Glu His Val Gly Lys Thr Pro His Ser Ser His 

35 40 45 

Lys Lys He Ala Leu Lys Arg Lys He Glu Glu Lys Asn Glu Tyr Leu 

50 55 60 

Lys Lys Leu He Glu He Gin Asp Phe Glu Glu Ala Ala He Val Arg 
65 70 75 80 

Asp Glu He Lys Ala Leu Lys Ala Glu Ser Glu Val Gin His Asp Asp 
85 90 95 

Ala 



(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C } STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

Met Phe Ala Tyr Asp Gly Trp He His Val Gly Asn Val Ala Gly Glu 

15 10 15 

Leu Lys Asn Pro Lys Arg Asp Leu Pro Leu Ala He Ser Val Gly He 

20 25 30 

Gly Cys He Met Ala Val Tyr Leu Leu He Asn Ala Thr Phe Leu Leu 

35 40 45 

Thr Leu Pro He Glu Leu Leu Ala Gly Asn Leu Asn Ala Ala Ser Arg 
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60 



(2) INFORMATION FOR SEQ ID NO: 200: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 



Met Lys Lys Leu Ala Asn Tyr Leu Trp Val Glu Lys Val Gly Asp Leu 

15 10 15 

Tyr Val Phe Ser Met Thr Pro Glu Leu Gin Asp Asp He Gly Thr Val 

20 25 30 

Gly Tyr Val Glu Phe Val Ser Pro Asp Glu Val Lys Val Asp Asp Glu 

35 40 45 

He Val Ser He Glu Ala Ser Lys Thr Val He Asp Val Gin Thr Pro 

50 55 60 

Leu Ser Gly Thr He He Glu Arg Asn Thr Lys Ala Glu Glu Glu Pro 
65 70 75 80 

Thr He Leu Asn Ser Glu Asn Pro Glu Glu Asn Trp Leu Phe Lys Leu 

85 90 95 

Asp Asp Val Asp Lys Glu Ala Phe Leu Ala Leu Pro Glu Ala 
100 105 HO 



(2) INFORMATION FOR SEQ ID NO:201: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



356 



WO 97/30070 

Met lie Gin Val His Gin Ala Arg lie 

1 5 
Val Lys Ala Gin Leu Asp Asp Phe lie 

20 25 
Val Leu Val Ala Pro Leu lie Pro Gly 

35 40 
Leu Lys Leu Leu Ala Glu lie Tyr Leu 
50 55 
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lie Ala Gly Thr Gin Glu Gin 
10 15 
Ala Thr Phe Glu Val Asp Glu 
30 

lie Glu Gin Arg Cys Lys Thr 
45 



(2) Information for seq id no : 202: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



Met lie Gin Val His Gin Ala Arg lie He Ala Gly Thr Gin Xaa Gin 

15 10 15 

Gly Lys Ala Gin Leu Asp Asp Phe He Ala Xaa Xaa Xaa Val Asp Glu 

20 25 30 

Val Leu Val Ala Pro Leu He Pro Gly He Val Gin Arg Cys Lys Thr 

35 40 45 

Leu Lys Leu Leu Xaa Xaa He Tyr Leu 
50 55 

(2) INFORMATION FOR SEQ ID NO: 203: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 



Met 



Phe Asp Ser lie Arg Glu Thr He Asp Tyr Ala Ala Glu Asn Asn 
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1 



10 



15 



Met Ser Phe Ala Asp lie Met Val Lys Glu Glu Met Glu Leu Ser Gly 

20 25 30 

Lys Ser Arg Asp Glu Val Arg Ala Gin Met Lys Gin Asn Leu Asp Val 

35 40 45 

Met Arg Asp Ala Val lie Lys Gly Thr Thr Gly Asp Gly Val Glu Val 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

Met Tyr Thr Gly Ser Asp Val Arg Thr Val Gly Asp Leu Gly Ser lie 

15 10 15 

Lys Gin Pro Leu Pro His Phe Leu lie Pro Asn Val Pro Phe Asn Leu 

20 25 30 

Glu Thr Leu His lie lie Phe Pro Tyr Ser Leu Ser Met Ala lie Val 

35 40 45 

Gly Leu Val Glu Ser Leu Leu Thr Ala Lys lie Val Asp Asp Ala Thr 

50 55 60 

Asp Thr Tyr Ser Ser Lys Asn Arg Glu Ser Arg Gly Gin Gly lie Ala 
65 70 75 80 

Asn Met lie Thr Gly Leu Phe Gly Gly Met Gly Gly Cys Ala Met lie 

85 90 95 

Gly Gin Ser Val lie Asn Val Lys Ser Gly Ala His Ser Arg Leu Ser 

100 105 110 

Thr Phe Thr Ala Gly lie Val Leu He Phe Met He He Val Leu Gly 

115 120 125 

Gly Leu Val Val Gin Val Pro Met Pro He Leu Ala Gly He Met Val 

130 135 140 

Met Val Ser Val Gly Thr Phe Asp Trp Asn Ser Phe Lys Tyr He Lys 
145 150 155 160 

Lys Ala Pro Xaa Asn Arg Cys Ser Cys Tyr Ala His Tyr Gly Asp Tyr 

165 170 175 

Arg Phe Asn Asp Ser 



50 



55 



60 
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(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

Met Glu Gly Glu Thr Val Gly lie Asn Asp Pro Asn lie Glu lie Tyr 

15 10 15 

Ser Glu Thr Leu Asp Leu Tyr His Thr Leu Val Pro Leu Asn Gin Glu 

20 25 30 

Leu His Lys Leu Arg Leu Lys Thr Gin Asn Leu Xaa Asn Glu Asn Tyr 

35 40 45 

Asn lie Asn Asp Val 
50 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 121 amino acids 
(B> TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

Met Gin Xaa Xaa Asp Xaa Val Gly Phe Gly Lys Gly Tyr Met Ala Pro 

15 10 15 

Leu Tyr Gly Ser Lys Lys Asn Xaa Val Tyr Asp Asn Lys Xaa Ser Asp 

20 25 30 

Lys Asn Ser Val Gly Leu Pro Asn Ser Arg Ala Leu lie Trp Gin Gin 

35 40 45 

Phe Asp Tyr Asn Asn Ser Arg Glu Arg Xaa Asp Xaa Arg Pro Leu Val 
50 55 60 
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Met Thr Gin Gin Asp Gly Glu lie Pro Thr Leu Asp Pro lie Arg Ser 
fi 5 70 75 80 

lie Ala Pro Xaa Val Tyr Ser He Asn Met Asn Met Tyr Thr Arg Leu 

85 90 95 

Leu Leu Leu Asp Glu Asn Asp His Leu Thr Thr Lys Gly Ser Leu Ser 

100 105 no 

Arg Asp Tyr Ala Val Asn Lys Arg Gin 
115 120 

(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

Met Thr Thr Val Tyr Tyr Asp Gin Asp Val Lys Thr Asp Ala Leu Gin 

1 5 10 15 

Gly Lys Lys He Ala Val Val Gly Tyr Gly Ser Gin Gly His Ala His 

20 25 30 

Ala Gin Asn Leu Xaa Asp Lys Trp He Leu Met 
35 40 

(2) INFORMATION FOR SEQ ID NO:208: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

Met Cys Leu Arg Glu Asp Xaa Val Thr Asp Ala Asn He Val His His 

15 10 15 

lie He Tyr Val Asp Glu Asp Phe Asn Lys Ala Leu Asp Leu Asp Asn 
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20 25 30 

Leu Met Ser Val Cys Tyr Ser Cys His Asn Lys lie His Ala Asn Asp 

35 40 45 

Asn Asp Lys Ser Asn Leu Lys Lys lie Arg Val Leu Lys lie 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

Met Lys Gin Ala Arg lie Leu Phe Asp Glu Ser Lys Ala Met lie Lys 

1 5 10 15 

Ala Ser Pro Xaa Leu Arg Glu Asn Phe Arg Pro Leu Arg Asp Glu lie 

20 25 30 

His Tyr Asp Ala Leu Tyr Leu Lys lie Cys Thr Gly Phe Arg Gin 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 109 amino acids 
<B) TYPE: amino acid 

<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

Met Lys Asn Ser Tyr Gin Ala Gin Lys Val He Glu Glu Val He Lys 

1 5 10 is 

Glu Lys Pro Lys Ala Arg Trp Leu Phe Leu Thr Leu Pro Thr Lys Asn 

20 25 30 

Ala He Asp Gly Asp Thr Leu Glu Gin Ser Leu Lys His Leu Thr Lys 
35 40 45 
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Ala Phe Asp Arg Leu Ser Arg Tyr Lys Lys Val Lys Gin Asn Leu Val 

50 55 60 

Gly Phe Met Arg Ser Thr Glu Val Thr Ala Asn Lys Asn Asp Gly Ser 
65 70 75 80 

Tyr Xaa Gin His Met His Val Leu Val Cys Val Glu Asn Ala Tyr Phe 

85 90 95 

Arg Lys Lys Glu Asn Tyr Leu Thr Gin Arg Arg Met Gly 
100 105 



(2) INFORMATION FOR SEQ ID NO: 211: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

Met Tyr Gly Glu Leu Leu Lys Leu Lys Asn Val Glu Val Arg Leu Thr 

15 10 15 

Asp lie Ala Gly Phe His Ala Lys Gly Tyr lie Phe Glu His Lys Asp 

20 25 30 

Tyr Ser Ser Met Val lie Gly Ser Ser Asn Leu Thr Ser Asn Ala Leu 

35 40 45 

Lys Val Asn Ser Glu His Asn Val Leu Leu Ser Thr Met Lys Asn Gly 

50 55 60 

Asp Leu Val Asp Ser Val Lys Asn Glu Phe Glu Leu Leu Trp Gin Lys 
65 70 75 80 

Ser Thr Pro Leu Thr Glu Gin Trp lie Asn Ser Tyr Lys Glu Ser Xaa 

85 90 95 

Glu Tyr Arg Ser Leu Glu Lys Leu Ala Glu Val Glu Gin Thr Gin Met 

100 105 110 

Leu Leu Ala Asp Lys Val Lys Lys Ser Val Glu lie Val Pro Asn Leu 

115 120 125 

Met Gin Ala Glu Ala Leu Arg Ser Leu Lys Ala lie Arg Asp Lys Xaa 

130 135 140 

Lys Asp Lys Ala Leu lie lie Ser Cys Asn Trp Val Gin Gly Lys Thr 
145 150 155 160 

lie Leu Cys Ala Leu Asp Cys 
165 
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(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



Met Tyr Gin Gin Ser Ser Phe Lys Glu Asn Leu He His Trp Phe Asp 

15 10 15 

Glu Asn Gin Arg Glu Met Pro Trp Arg Gin Thr Thr Asn Pro Tyr Tyr 

20 25 30 

He Trp Leu Ser Glu Val Met Leu Gin Gin Thr Gin Val Lys Thr Val 

35 40 45 

He Asp Tyr Tyr His Arg Phe Gly Xaa Arg Phe Pro Thr Val Glu Val 

50 55 60 

Leu Ser Gin Ala Ser Glu Asp Glu Val Leu Lys Tyr Trp Glu Gly Leu 
65 70 75 80 

Gly Tyr Tyr Ser Arg Ala Arg Asn Phe His Thr Ala He Lys Glu Val 

85 90 95 

Xaa Asp Lys Tyr Glu Gly Leu Val Pro Lys Asp Pro Asp Gin Phe Lys 

100 105 110 

Ala Leu Lys Gly Val Gly Pro Tyr Thr Gin Ala Ala Val Met Ser He 

115 120 125 

Ala Tyr Asn Val Pro Leu Ala Thr Val Asp Gly Asn Val Phe Arg Val 

130 135 140 

Trp Ser Arg Leu Asn Asp Asp Tyr Arg Asp He Lys Leu Gin Ser Thr 
145 150 155 160 

Arg Lys Ser Tyr Glu Gin Glu Leu Leu Pro Tyr Val Thr Thr Glu Ala 

165 170 175 

Gly Thr Phe Asn Gin Ala Met Met Glu Leu Gly Ala Leu He Cys Xaa 

180 185 190 

Pro Lys Asn Pro Leu Cys Leu Phe Xaa Pro Val Gin Glu Asn Cys Glu 

195 200 205 

Ala Phe Asp Lys Gly Pro Phe Glu Lys Leu Pro Val Lys Ser Lys Asn 

210 215 220 

Val Ser Lys Xaa Val He Glu Gin Ser Val Xaa Leu He Arg Asn Asn 
225 230 235 240 
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Gin Gly Gin Tyr Leu Leu Gin Lys Arg. Arg Glu Xaa Leu Xaa Tyr Gly 

245 250 255 

Met Trp Gin Xaa Pro Met Xaa Asp Ser Glu His Xaa Arg Arg Lys Met 

260 265 270 

Xaa Glu Lys lie Gly His Asp lie Xaa Pro Xaa Glu Thr Pro lie Xaa 

275 280 285 * 

Glu Leu Thr His Gin Phe Thr His Leu Thr Trp Lys lie Lys Val Tyr 

290 295 300 

Ala Ala Ser Gly Ala lie Asn lie Xaa Thr Leu Pro Asp Asp Met Xaa 
305 310 315 320 

Trp Val 



(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

Met Ser Asn Thr Asp Lys Tyr Leu Arg Asp lie Ala Arg Glu Leu Lys 

15 10 15 

Gly lie Arg Lys Glu Leu Gin Lys Arg Asn Glu Thr Val lie lie Asp 

20 25 30 

Ala Asn Leu Asp Ser Val Arg Ser Ala Val Leu Xaa Asn Lys Xaa Lys 

35 40 45 

Pro Lys Tyr Asn Glu Pro Leu Phe 
50 55 

(2) INFORMATION FOR SEQ ID NO: 214: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

Met Lys His Phe Glu Ala Gly Ala Asp Gin Ser lie Val Gin Gly Phe 

15 10 15 

Gin Tyr Gin Gin Thr Val Thr Arg Val Asp Asn Pro Thr lie Leu Leu 

20 25 30 

Lys Leu Leu Arg Asn Leu Leu Gin Leu Thr Pro lie Ser Tyr 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

Met Thr Val Phe Val Met Gin Leu Gin Ser Asn Leu Asn Ser lie Glu 

15 10 15 

Glu Leu lie Ser Gin Ser Arg Trp Ser Tyr Arg Xaa Xaa Pro Arg Thr 

20 25 30 

Val Asn Tyr Xaa Tyr Asn Gin Asp Lys Leu Met His Xaa Leu Gly Asp 

35 40 45 

lie Xaa Ala Gin Tyr Gly Xaa Gin His Asp Thr Gly Leu Xaa Pro His 

50 55 60 

Glu Trp Xaa Tyr His lie Ala Pro Arg Gly Xaa Ala Asp lie Ala Gin 
65 70 75 80 

His Asn Arg Asp Gly Gin Pro lie Tyr Val Ser Leu Ser Tyr Ser Tyr 

85 90 95 

Pro Tyr lie Val Cys Val Val Asp Lys Xaa Pro Gly Gly lie Asp lie 

100 105 110 

Glu Lys He Ser Xaa Arg Leu Xaa Trp Arg Asn Arg 



35 



40 



45 



115 



120 



(2) INFORMATION FOR SEQ ID 



NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 98 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

Met Asn Ala Leu Leu Asn His Met Asn Thr Glu Gin Xaa Glu Ala Val 

1 5 10 15 

Lys Thr Thr Glu Gly Pro Leu Xaa Xaa Met Ala Gly Ala Gly Ser Gly 

20 25 30 

Lys Thr Arg Val Leu Thr His Arg He Ala Tyr Leu Leu Asp Glu Lys 

35 40 45 

Asp Val Ser Pro Tyr Asn Val Leu Ala He Thr Phe Thr Asn Lys Ala 

50 55 60 

Ala Arg Glu Met Lys Glu Arg Val Gin Lys Leu Val Gly Asp Gin Ala 
65 70 75 80 

Glu Val Met Arg Met Ser Thr Phe His Ser Met Cys Val Arg He Xaa 
85 1 90 95 

Thr Ser 



(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

Met Lys He Ala He Val Gly Ser Gly Asn Gly Ala Val Thr Ala Ala 

1 5 10 15 

Val Asp Met Val Ser Lys Gly His Asp Val Lys Leu Tyr Cys Arg Asn 

20 25 30 

Gin Ser He Ser Lys Phe Gin Asn Ala He Glu Lys Gly Gly Phe Val 

35 40 45 

Ala Arg Lys Met Thr Leu Leu Lys Asp Cys Gly He Ser Leu Arg Thr 
50 55 60 
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Ser Val Val Thr Phe Gly Thr Glu Ser 
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(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

Met Lys lie Ala lie Val Gly Ser Gly Asn Gly Ala Val Thr Ala Ala 

1 5 10 15 

Val Asp Met Val Ser Lys Gly His Asp Val Lys Leu Tyr Cys Arg Asn 

20 25 30 

Gin Ser lie Ser Lys Phe Gin Asn Ala lie Glu Lys Gly Gly Phe Asp 

35 40 45 

Phe Asn Asn Glu Gly Asp Glu Arg Phe Val Lys Phe Thr Asp lie Ser 

50 55 60 

Asp Asp Met Glu Tyr Val Leu Lys Asp Ala Glu lie Xaa Gin Val lie 
65 70 75 80 

He Pro Ser Ser Tyr He Glu Tyr Tyr Ala Asp Gly Asn Gly Arg Ala 
85 90 95 

Cys Asn 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 : 

Met Asn Pro Leu Leu He He Gly Lys Arg Ser Tyr Ser Leu Tyr Leu 
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1 5 10 15 

Trp His Tyr Pro lie lie Val Phe Val Asn Ser Tyr Tyr Xaa Gin Gly 

20 25 30 

Gin He Pro Val Tyr Val Tyr He He Glu He Val Leu Thr Ala Leu 

35 40 45 

Met Ala Glu He Ser Tyr Arg Phe He Glu Thr Pro He Arg Lys Lys 

50 55 60 

Gly Phe Lys Ala Cys Ala Phe Xaa Pro Lys Lys Lys Gly Gin Cys Ala 
65 70 75 80 

Arg Thr Xaa Leu Xaa He Leu Leu Leu Ala Pro Ser Phe Val Cys Ser 

85 90 95 

Xaa Gly Gin Gly Gly Cys Thr Trp Gin Thr Thr 
100 105 



(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

Met Asp Gly Ser Gin Val Thr Gly Val He Val Thr Gly Thr Xaa Asn 

15 10 15 

Arg Glu Asn Tyr Gly He Xaa Phe Thr Gin Ala Leu Glu Thr Gly Gly 

20 25 30 

Val Met Leu Gly Lys Asp Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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Met Arg Leu Ser Cys Phe lie Gin Lys Lys Xaa Xaa Pro lie Asn Gin 

15 10 15 

Met Leu lie Asp Glu Ser Met Pro Phe lie Val lie Gly Lys Pro Thr 

20 25 30 

Ser Asp lie Asp His Gin Phe Thr His lie Asp Asn Asp Asn lie Leu 

35 ' 40 45 

Ala Ser Xaa Asn Leu Thr Arg His Val lie Glu Gin Gly Val Asp Glu 

50 55 60 

Leu lie Phe lie Thr Glu Lys Gly Asn Phe Glu Val Ser Lys Asp Arg 
65 70 75 80 

He Pro Gly Phe Glu Thr Val Ala Ser Pro Phe Asn Leu Asp Tyr Gin 

85 90 95 

He He Glu Thr Ser Asn Glu Arg Glu Val He Leu Asn Tyr Met Gin 

100 105 HO 

Asn Leu His Thr Arg Leu Lys Asp Pro Asn He Lys Gin Ala He He 

115 120 125 

Ser Leu Asp Ala Met Leu His Leu Ala lie Leu Ser Val Leu Tyr Glu 

130 135 140 

Leu Asn He Glu He Pro Lys Asp Val Met Thr Ala Thr Phe Asn Asp 
145 150 155 160 

Ser Tyr Leu Xaa Glu He Ala Ser Pro Pro Gin Xaa Cys He Asp He 

165 170 175 

Lys Pro Arg Met Leu Gly Gin Gin Ala Gly Ser Ala He Leu Xaa He 

180 185 190 

Leu Lys Asn Lys Ala Gin Asp Val He Glu Leu Val He He Asp Thr 

195 200 205 

Arg He Xaa Asn Lys Lys He Asn Thr Ala He Ala Lys Gly Phe Ser 

210 215 220 

Asn Glu Leu Ser Asn Gly Trp Glu Arg Ser Lys 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
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Met Xaa Xaa Asp Xaa Thr Xaa Ser 

1 5 
lie Xaa Xaa Pro Xaa Asp Gly Val 
20 

Xaa Tyr Ala Xaa Xaa Xaa Met Ala 

35 40 
Xaa Ser Arg Ser Arg Tyr Xaa 
50 55 



Xaa Pro His Leu Xaa Ser Gin Thr 

10 15 
Ser Leu His Leu Xaa Thr Xaa Xaa 
25 30 
Xaa Pro Asp Arg Leu Ser Lys Ser 
45 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 



Met lie Thr Leu Gin Met Val Glu 

1 5 
lie Leu Glu Lys Leu Ser Lys Gin 
20 

Gly Thr Arg Ala Glu Glu Leu Gin 

35 40 
Glu Val Pro lie lie Val Glu Arg 

50 55 
Thr Met Asp Tyr Lys Val Ser Gin 
65 70 
Leu Ala Pro Met Leu Asp Asn 
85 



He Leu His Gly Phe Met Met Gin 

10 15 
Gin He Glu Ala He He Val Thr 
25 30 
Leu Arg Leu Lys Leu Ala Glu Val 
45 

Asp He Tyr Lys Ala Thr Ala Lys 
60 

Leu Gin Tyr Gin Thr He His Gin 
75 80 



(2) INFORMATION FOR SEQ ID NO: 224: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 114 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Met Thr Asp Gly Pro lie Lys Val Asn Ser Glu lie Gly Ala Leu Lys 

15 10 15 

Thr Val Leu Leu Lys Arg Pro Gly Lys Glu Leu Glu Asn Leu Val Pro 

20 25 30 

Asp Tyr Leu Asp Gly Leu Leu Phe Asp Asp lie Pro Tyr Leu Glu Val 

35 40 45 

Ala Gin Lys Glu His Asp His Phe Ala Gin Val Leu Arg Glu Glu Gly 

50 55 60 

Val Glu Val Leu Tyr Leu Glu Lys Leu Ala Ala Glu Ser lie Glu Asn 
65 70 75 80 

Pro Gin Val Arg Ser Glu Phe lie Asp Asp Val Leu Ala Glu Ser Lys 

85 90 95 

Lys Thr lie Leu Gly His Glu Glu Glu lie Lys Thr Leu Phe Ala Thr 
100 105 110 

Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

Met Lys Ser Lys Leu Ser Asn Ser Gly lie Asp Val Thr Val Lys Asp 

1 5 10 15 

Val Glu Lys Tyr Met Asn Arg Tyr Asn Glu Val Met Met Gly Lys Xaa 

20 25 30 

Gly Glu Lys Ala Lys Glu Leu Cys Leu Ser Leu Leu Pro Tyr 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 226: 



371 



WO 97/30070 PCT/US97/02318 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 



Met Pro Ser Gly Leu Pro lie His Glu Trp Leu Met Leu Lys Gin Phe 

15 10 15 

Ser Lys Tyr Lys Ser Ala Pro Ser Leu Pro He Leu Arg Arg Gly Gin 

20 25 30 

His Tyr Lys Leu His Phe Asp Phe Asp Ala Thr Pro Ala Gly Ser Val 

35 40 45 

Tyr Phe He He He Phe Tyr Asn Lys Asn Gly Thr Asn 
50 55 60 



{2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 



Met Met Asn Ala Ala Ser Thr Ser 

1 5 
Thr Glu Met Thr His Gin Asp Asp 
20 

Thr Lys He Asp Asp Asn Gin Tyr 

35 40 
He Ala Glu Pro Ser Asp Thr Tyr 

50 55 
Lys Leu Phe Gly His Val Trp Leu 
65 70 
lie Lys Glu Asn He Lys Gin Leu 
85 



Leu He Phe Arg Cys Leu Thr He 

10 15 
Leu Glu Tyr Lys Ser Met Arg Val 
25 30 
Gly Asn Asp Arg He Asn Val He 
45 

Pro Thr He Ser Asn Asp Phe Leu 
60 

Val Glu Arg Trp Met Asp Asp Asp 

75 80 
Lys Asp Asp Leu Gin Ser Gin Asp 
90 95 
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Thr Leu Thr Ala lie Asn Leu lie Ser Tyr Gly Ser Lys Ser Asn Val 

100 105 110 

Ser Ala Thr Tyr Val Ala Gin His Leu Asp Cys Lys Val Tyr Arg Thr 

115 120 125 

Ser His Glu Asp Asp Asp Leu Lys Glu Trp Leu Thr Glu His Val Pro 

130 135 140 

Arg Lys 
145 

(2) INFORMATION FOR SEQ ID NO: 228: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 228: 



Met Asn Met Phe Gly Leu Gly Asn Met Met Met Arg Tyr Val Met Lys 

15 10 15 

Lys Lys Asn Val Asp Ser Leu Pro Thr Leu lie Asn Gin Ala lie Glu 

20 25 30 

Pro His He Lys Leu He Ala Cys Xaa Met Ser Met Asp Val Met Gly 

35 40 45 

He Gin Arg Glu Glu Leu Arg Asp Glu Val Glu Tyr Gly Gly Val Gly 

50 55 60 

Thr Tyr He Gly Ala Thr Glu Asn Ala Asn His Asn Leu Phe He 
65 70 75 

<2) INFORMATION FOR SEQ ID NO: 22 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
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Met Ser Pro Lys Leu Gly Ala Val Thr Gly Asn Pro Arg lie Arg Asn 

15 10 15 

Lys Ser Ser He Leu Gly Lys He Gin Thr He Glu Tyr Ala Ser Leu 

20 25 30 

He Gly Cys He Lys Arg Ser Gin Thr Leu Ala Gly Ala Val Asn Thr 

35 40 45 

He Ser Gly Val Phe Thr Leu Phe Lys Lys Ser Ala Val Val Asp Val 

50 55 60 

Gly Tyr Trp Asp Thr Asp Met He Thr Glu Asp He Ala Val Ser Trp 
65 70 75 80 

Lys Leu His Leu Arg Gly Tyr Arg He Lys Tyr Glu Pro Leu Ala Met 

85 90 95 

Cys Trp Met Leu Xaa Pro Glu Thr Leu Gly Gly Leu Trp Lys Gin Arg 

100 105 110 

Val Arg Trp Ala Gin Arg Gly Asp Thr Lys Tyr Tyr Tyr Glu Thr Phe 

115 120 125 

Xaa Ser Thr Xaa Thr Lys Thr Lys Gly Phe Pro Leu Tyr He Val He 
130 135 140 

Val 
145 



(2) INFORMATION FOR SEQ ID NO:230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

Met Thr Asn His Asp Gly Asp Gly Glu Lys Gly Glu Glu Xaa Phe Leu 

15 10 15 

Lys Lys He Leu Gly Glu Glu Tyr Thr Gly Asn Val Xaa Glu Gly Asn 

20 25 30 

Gly Arg Asn Phe Val Thr Gin Pro Ala Ser Ser Xaa Thr Ala Val Lys 

35 40 45 

Ser Val Asn He Pro Asp Ser He Lys Thr Leu Lys Val Gly Asp Thr 

50 55 60 

He Arg Phe Lys Cys Cys Ser Arg Ala He 
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(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Met Tyr Val Pro Leu Ser Val Asn Ala Asn Lys Gin Tyr Arg Phe Asn 

15 10 15 

lie Met Leu Glu Asp lie Lys Ala Tyr Leu Pro Lys Glu Lys Xaa Trp 

20 25 30 

Asp Val Phe Leu Glu Val Gin He Gly Thr Glu Val Phe Xaa Val Arg 
35 40 45 

Val Gly 
50 

(2) INFORMATION FOR SEQ ID NO:232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Met Asn He Glu Xaa Asp He Asn Gly Arg Pro Lys His He Tyr Ser 

15 10 15 

He Tyr Arg Xaa Met Met Lys Gin Lys Lys Gin Phe Asp Gin He Phe 

20 25 30 

Asp Leu Leu Ala lie Arg Val lie Val Asn Ser He Asn Asp Cys Tyr 

35 40 45 

Ala He Leu Gly Leu Val His Thr Leu Trp Lys Pro Met Pro Gly Arg 
50 55 60 
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Phe Lys Asp Tyr lie Ala Met Pro Lys Gin Asn Leu Tyr Gin Ser Leu 
65 70 75 80 

His Thr Thr Val Val Gly Pro Asn Gly Asp Pro Leu Glu lie Gin lie 

85 90 95 

Arg Thr Phe Asp Met His Glu He Ala Glu His Gly Val Ala Ala His 

100 105 110 

Trp Ala Tyr Lys Glu Gly Lys Lys Val Ser Glu Lys Asp Gin Thr Tyr 

115 120 125 

Gin Asn Lys Leu Asn Trp Leu Lys Glu Leu Ala Glu Ala Asp His Thr 

130 135 140 

Ser Ser Asp Ala Gin Glu Phe Met Glu Thr Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

Met Lys Leu Lys Gly Lys Asn Xaa He He Leu Thr Gly Leu Asp Arg 

15 10 15 

Gly Tyr Val Phe Glu Glu Gly Met Ala Ser Val Val Leu Lys Asp Asp 

20 25 30 

Met He Met Gly Met Leu Ser Pro Xaa Val Arg Lys Arg Ser Gly Asn 
35 40 45 

Arg Thr 
50 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 : 

Met Ala lie Gly Ser Ser Lys He Asn Phe Ser Asp lie Xaa His Tyr 

1 5 10 15 

Val Thr Gly Xaa Thr Asp Thr Lys Ala Thr Phe Leu Leu His Asn Val 

20 25 30 

Arg Met Pro Arg Met He Ala Gly Leu Phe He Gly Gly Ala Leu Ala 

35 40 45 

Val Ser Gly Leu Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:235: 

Met Gly Pro Ser Ser Ser Gly Gly He Thr Val He Gin He Leu Lys 

1 5 10 15 

Leu Leu Glu His Val Asp Leu Pro Ser Met Gly Pro Arg Ser Val Asp 

20 25 30 

Tyr Leu His His Leu He Gin Ala Met His Leu Ala Tyr Ser Asp Arg 

35 40 45 

Ala Gin Tyr Leu Ala Asp Asp Asn Phe His Glu Glu Cys Arg Tyr Ser 
50 55 60 

His 
65 

(2) INFORMATION FOR SEQ ID NO:236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 6: 

Met Pro Ser Lys Leu Leu Lys Gin Lys Lys lie Lys Thr lie His Arg 

1 5 10 15 

Leu Thr Leu lie Thr Thr Gin Leu Val Ala Gly Glu Gly Gly Tyr Val 

20 25 30 

Asp Gly Tyr lie Glu Thr lie Glu Glu Thr Asp Ser Ser Ala lie Asp 

35 40 45 

He Asp Tyr His Thr Ala Val Xaa Cys Glu Xaa Xaa His Val Gly Gly 

50 55 60 

He His 
65 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 232 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

Met His Phe Thr Leu Val Phe He Leu Phe Leu Gly Gly He Tyr Met 

15 10 15 

Thr Phe Glu Lys Glu Xaa Val Leu Glu Thr Leu Phe Pro Glu Asp Val 

20 25 30 

Leu Ser He Ala Lys Gly Leu Thr Asp Gly Glu Val Glu Phe Leu Gin 

35 40 45 

Gin Val Asp Ser Leu Leu Glu Ser Lys Tyr Arg Glu Asn He Asn Xaa 

50 55 60 

His Trp He Asp Ala Thr Val Pro Glu Asp Tyr Phe Lys Asp Leu Gly 
65 70 75 80 

Glu Leu Asn Tyr Phe Asn Asn Pro Leu Leu Tyr Lys Asp Arg Pro Asn 

85 90 95 

Ala Lys Met Pro Ser Gin Leu Phe Gin Phe Phe Met Ser Tyr Leu Leu 

100 105 110 

Ala Arg Phe Asp He Ser Leu Ala Thr Leu Leu Gly Val His Gin Gly 
115 120 125 
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Leu Gly His Asn Thr Phe Tyr Phe Gly Gly Ser Lys Glu Gin lie Ala _ 

130 135 140 

Lys Tyr Val Pro Lys Leu Gin Ser His Glu Leu Arg Thr Cys Phe Ala 
145 150 155 160 

Leu Thr Glu Pro Glu His Gly Ser Asp Val Ala Gly Gly Leu Glu Thr 

165 170 175 

Val Xaa Glu Arg Gin Gly Asp Thr Trp Val He Asn Gly Glu Lys Lys 

180 185 190 

Trp He Gly Gly Ala His Val Ser Asp Val He Pro Val Phe Ala Val 

195 200 205 

Asn Lys Lys Thr Gly Lys Pro His Cys Phe Val Val Arg Pro Glu Gin 

210 215 220 

Asp Gly Val Asp He Glu Val Asn 
225 230 

(2) INFORMATION FOR SEQ ID NO: 238: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238: 



Met Asn Leu Thr Asn Ser Phe Leu 

1 5 
Val Val Phe Gly Gin Thr Lys Ala 
20 

Gin Gly Lys Ser Val Xaa Glu Ala 

35 40 
Lys Val Gin Asp He He Xaa Pro 

50 55 
Ala Cys Ala Ser Trp Asp Gin Tyr 
65 70 
Lys Phe He Glu Arg Phe Arg Xaa 
85 



He Trp Lys Met Xaa Arg Thr Met 

10 15 
Xaa Phe Ala Lys Leu Gly Asn Ser 
25 30 
Asn Asn Val Glu Asp Ala Val Asp 
45 

Asn Asp Val Val Leu Leu Ser Pro 
60 

Ser Thr Phe Glu Glu Arg Gly Glu 

75 80 
His Leu Pro Ser Tyr 
90 



(2) INFORMATION FOR SEQ ID NO: 239: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

Met Lys Ser Phe Thr Asp Leu Glu Lys Xaa Asn Leu Glu His Thr Tyr 

15 10 is 

Val Asp Gly Gin Leu Val Val Ser Gly Arg Met Pro Ala Arg Lys Pro 

20 25 30 

Asn He Gin Glu Asp His Tyr Phe Gly Ala He Ser Lys His Asp Ser 

35 40 45 

Ser Asp Glu Phe Val Met Phe lie Leu Arg Ala Asn Gin Asp Gly He 

50 55 60 

Thr Leu Val Glu Lys Thr Asn Phe Leu Gly Val Asn Gly Ser Ala Thr 
65 70 75 80 

Tyr Xaa He Thr Leu Asn Gin Val Val Val Pro Gin Ser Gin He He 

85 90 95 

Thr His Asp Ala Lys Pro Val Cys Gly Asn Leu Phe Ala Arg Asn Leu 

100 105 no 

Leu Leu Thr Lys Phe Gin 
115 



(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 



Met Lys Glu Asn Asp Val Val Leu Arg Xaa Val Thr Lys Leu Val Val 

15 10 15 

Phe He Leu Leu Thr Phe Gly Phe Tyr Val Phe Phe Ala Gly His Asn 

20 25 30 

Asn Pro Gly Gly Gly Phe He Gly Gly Leu He Phe Ser Ser Ala Phe 
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35 40 45 

lie Leu Met Phe Leu Ala Phe Asn Val Glu Glu Val Leu Glu Ser Leu 

50 55 60 

Pro lie Asp Phe Arg lie Leu Val lie lie Gly Ala Leu Val Ser Ser 
65 70 75 80 

lie Thr Ala lie lie Pro Met Phe Leu Gly Lys Pro Phe Gly Ala Gin 
85 90 95 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

Met Gin His His Lys Val Ala He He Gly Ala Gly Ala Ala Gly He 

15 10 15 

Gly Met Ala He Thr Leu Lys Asp Phe Gly He Thr Asp Val He He 

20 25 30 

Leu Glu Lys Gly Thr Val Gly His Ser Phe Lys His Trp Pro Lys Ser 

35 40 45 

Thr Val Arg Ser Arg His His Leu Arg Leu Met Asp Leu Ala Cys Leu 
50 55 60 

He 
65 

(2) INFORMATION FOR SEQ ID NO:242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
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Met Lys Tyx Lys Lys Leu Lys Val 

1 5 
Ser Val Phe lie Asp Val Thr lie 
20 

Val Glu Ala His Gin He Cys Asp 

35 40 
Lys Gly He Ser Ser Val Tyr Val 
50 55 



Leu Lys Gly Val Thr His Gly Ser 

10 15 
Val Val Asp Ala Asn Leu Ser Leu 
25 30 
Asn Val Glu His His Leu His Xaa 
45 

His Pro Glu Pro Asp His Leu 
60 



(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 



Met Met Leu Leu He Phe Asn Asn He Gly He He Ser He Thr Asp 

15 10 15 

Phe Thr Asn Arg Ala Thr Met Gin Asn Glu Asn Lys Asp Pro Tyr Gly 

20 25 30 

Glu Lys Leu Ala Tyr Gly He Ala Phe Asn Gly Ser Val Asp Met Gin 

35 40 45 

Gly Asp Lys Gin Val Thr He Pro Lys Tyr Ser Val Val Thr He Thr 

50 55 60 

Gly Glu Asn Ser Lys Asn Tyr Arg Val Thr Ala Asp Asn Lys Thr Tyr 
65 70 75 80 

Tyr Val Ser Lys Asp Lys Leu Glu Tyr Phe Asn Pro Ala Gly Leu Tyr 

85 90 95 

Gin Thr His Ser Phe Lys Lys Leu Ala Pro Tyr Met Lys Ser Asn Tyr 

100 105 110 

Ser Asn Tyr Tyr Ala Tyr Phe Asn Ser Gin Leu His Lys Lys His Arg 

115 120 125 

Phe Ser Tyr Lys Thr Leu Val Pro Asp Asp Asp Asn Arg Xaa Arg Leu 

130 135 140 

Xaa Pro Xaa Thr Gin Xaa Pro Xaa Gin Leu Xaa Xaa Xaa Arg Trp Xaa 
145 150 155 160 
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lie lie Gin Tyr Xaa Arg Xaa Cys Leu 
165 

Asn Xaa Xaa lie Xaa Leu lie Leu Thr 
180 185 
Leu Gly Met Asp Xaa Val Leu Pro lie 
195 200 
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Ser Xaa Cys Arg Xaa Xaa Lys 
170 175 
lie Thr Phe Gly Leu Leu Lys 
190 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

Met Val Ala Asn Asn Ala Asn lie Ser Pro Lys Pro Gly Thr Arg Tyr 

15 10 15 

Lys Ala Xaa Ala Thr Asn Thr Asn Val lie Thr Ala lie He Val Ala 

20 25 30 

Leu Gly Cys He Cys Glu Asn Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 84 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

Met Met Ala Leu Lys Thr Lys Val Leu Pro Lys Thr Val Val He He 

15 10 15 

Thr Val He Ser Ala Leu Thr Pro Lys Asp Cys He Asp Leu Phe His 

20 25 30 

Lys Thr lie Pro Gly Met Val Ala Asn Asn Ala Asn He Ser Pro Lys 
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35 40 45 

Pro Gly Thr Arg Tyr Lys Ala Lys Ala Thr Asn Thr Asn Val lie Gly 

50 55 60 

Ala lie lie Val Gly Leu Gly Leu Gly Phe Trp Lys His lie Xaa His 
65 70 75 80 

Ser Pro Tyr Phe 



(2) INFORMATION FOR SEQ ID NO: 246: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 



Met Ser Gin Leu Leu Ala Val Cys 

1 5 
Ala Arg Glu lie Gin Glu Leu Gly 
20 

Gly Arg Xaa Phe Cys Gly Arg Arg 

35 40 
Lys Phe Met Gly Trp Ala Gin Glu 

50 55 
Pro Val Leu Thr Gin Gin Thr Val 
65 70 
Arg Ala Pro Leu Gly lie Phe Tyr 
85 



Pro Met Gly Leu Glu Ala Gly Val 

10 15 
Tyr Glu Thr Asn Cys Trp Lys Met 
25 30 
Thr Gin Xaa Ala lie Gly Xaa Gly 
45 

Asp Pro Ser Lys Leu Trp Gly Gly 
60 

Asp Gly Phe lie Arg Thr Asn Pro 
75 80 

Trp 



(2) INFORMATION FOR SEQ ID NO:247: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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Met Ser Tyr lie Trp Thr Asp Gin Asn Arg Tyr Glu Gly Trp Leu Glu 

15 10 15 

Val Glu He Leu Ala Cys Glu Gly Trp Ser Glu Leu Gly His He Pro 

20 25 30 

Lys Ala Asp Val Gin Lys He Arg Gin Asn Ala Lys Val Asn Val Glu 

35 40 45 

Arg Ala Gin Glu He Glu Gin Glu Thr Arg His Asp Val Val Ala Phe 

50 55 60 

Thr Arg Gin Val Ser Glu Thr Leu Gly Glu Glu Arg Lys Trp Val His 
65 70 75 80 

Tyr Gly Leu Thr Ser Thr Asp Val Val Asp Thr Ala Leu Ser Phe Val 

85 90 95 

He Lys Gin Ala Asn Asp He He Glu Lys Asp Leu Glu Arg Phe He 

100 105 HO 

Asp Val Leu Ala Glu Lys Ala Lys Asn Tyr Lys Tyr Thr Leu Met Met 

115 120 125 

Gly Arg Thr His Gly Val His Ala Glu Pro Asn Tyr He Trp Val Ser 

130 135 140 

Lys Met Ala Leu Trp Tyr Thr Glu Met Gin Xaa 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

Met Xaa Ala Leu Leu Ser He Xaa Xaa He Xaa Thr Pro Glu Ser Ala 

15 10 15 

Leu Ser Ala Pro Lys Asp Lys Tyr He Thr His Pro Xaa He Xaa Gly 

20 25 30 

Ser Val Glu Gly Tyr Xaa Thr Met Asp Leu Xaa Xaa Ala Leu Ala Phe 

35 40 45 

Xaa Gly Xaa He Asp Asn Gly Tyr Lys Phe Lys Gly Leu Thr Gly Ser 
50 55 60 
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Xaa Glu Asn Xaa Lys lie Leu Ser Ala 
65 70 
His lie Thr Trp Gly Leu He Tyr Phe 
85 

Gly His Gin Gin Leu Pro Gly Asn Phe 
100 105 
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Ser Gin Val Phe He Gly Gly 

75 80 
Trp Xaa Leu Gin Tyr Gly Xaa 
90 95 



(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

Met Thr Gin Ser He Leu Asn He Ala He He Thr Leu Ser Gin Lys 

15 10 15 

Lys Cys Ser Glu Asn Met Leu Gly Arg Val Asn Ser Val Phe Lys Thr 

20 25 30 

Met Val Phe Ala Phe Arg Pro He Ala Leu Phe Leu Gly Gly He Leu 

35 40 45 

Leu Glu Asn Lys Gly Gly Phe Tyr Ala Leu Thr He Ser Ala Val Leu 

50 55 60 

Phe He Pro Leu Val Leu Tyr He Leu. Lys Asn Arg Phe Tyr Gin Val 
65 70 75 80 

Glu Glu Tyr 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

386 



WO 97/30070 



PCT/US97/02318 



Met Lys Leu Ser Asn Gly Val Lys Arg Leu Leu Lys His Ser Pro Thr 

1 5 10 15 

Thr Phe Xaa Asp Pro Xaa Asn Ala Ala Met Xaa Ala Asp Thr Asp Gly 

20 25 30 

Leu Ala Gly Leu Gin His Met Xaa Gly Asp Pro Thr Leu Leu Tyr Tyr 

35 40 45 

Thr Thr Asp Glu Ala Lys Glu Ala Arg Asp Ala Phe Lys Glu Lys Arg 

50 55 60 

Asp Pro Asp Phe Asp Gin Phe Pro Lys Phe Pro 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

Met Glu Asp Phe Asp Tyr He He Glu Leu Arg Glu Asn Glu Asn Gin 

15 10 15 

Glu He His Tyr Glu Glu Met Lys Lys Gin Thr Val Gin Ser Lys Asn 

20 25 30 

Asn Leu Asp Phe Gly Asp Pro Phe Gly Gly Gly Phe Gly Phe Phe Glu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 2 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 
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Met Gly Val His Ser Met Lys Met Lys Arg Phe lie Ala He Val Met 
1 5 10 15 

. Ala Leu Phe Leu Val Leu Xaa Gly Cys Ser Asn Ser Asn Asp Asn Asn 
20 25 30 

Glu Ser Lys Lys Asp Asp Ala Asp Asn Gly Lys Lys Gin Glu He Gin 

35 40 45 

Val Ala Ala Ala Ala Ser Leu Thr Asp Val Thr Lys Lys Arg Ala Ser 

50 55 60 

Glu Phe Lys Lys Glu His Lys Asn Ala Asp He Lys Phe Asn Tyr Gly 
65 70 75 80 

Gly Ser Gly Ala Xaa Arg Lys Gin He Xaa Ser Gly Ala Pro Leu Val 

85 90 95 

Asp Val Xaa Met Xaa Xaa Xaa Xaa Leu Lys Met 
100 105 



(2) INFORMATION FOR SEQ ID NO: 2 53: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

Met Pro He He Thr Ser Tyr He Thr Gin Asp Xaa Gly Thr Xaa Thr 

1 5 10 15 

Val Val He Ser Gly Val Glu Leu Gly Asn Lys Glu Thr Leu Leu Leu 

20 25 30 

Asp Asn Gly Phe Asp Val Glu Val Asp Val Ser Val He Asp Pro Phe 

35 40 45 

Gin He Thr Gly Lys Gin Arg Arg Lys He Phe Ala Leu Val Lys Asp 

50 55 60 

He Glu Glu 
65 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:254: 



Met Met lie Tyr Leu Gly Leu Phe lie Phe Ser lie Ser Val Gin Tyr 

15 10 15 

lie lie Tyr Leu Phe lie Asn lie Met lie Ser Gly Val Asn lie Pro 

20 25 30 

Gly Tyr Phe Ser Thr lie Ala Ala lie Leu Leu Leu Gly Gly lie Gin 

35 40 45 

Leu lie Ser lie Gly Val Val Gly Glu Tyr lie Gly Arg lie Tyr Tyr 

50 55 60 

Glu Val Lys Ala Arg Pro Lys Tyr lie lie Gin Ala Thr Asn Leu Ser 
65 70 75 80 

Ser lie Glu Asn Asp Glu Lys Asp Thr His Lys Val Tyr Ser Lys 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 255: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: 

Met Ala Gin Met Gly Ala Tyr Val Pro Cys Lys Glu Ala Val Leu Pro 

15 10 15 

lie Phe Asp Gin lie Phe Thr Arg lie Gly Ala Ala Asp Asp Leu Val 

20 25 30 

Ser Gly Lys Ser Thr Phe Met Val Glu Met Leu Glu Ala Gin Lys Ala 

35 40 45 

Leu Thr Tyr Ala Thr Glu Asp Ser Leu lie lie Phe Asp Glu lie Gly 

50 55 60 

Arg Gly Thr Ser Thr Tyr Asp Gly Leu Ala Leu Ala Gin Ala Met He 
65 70 75 80 

Glu Tyr Cys Ser Trp Lys His Arg Met Leu Lys Arg Tyr Phe Gin His 
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85 90 95 

lie lie Met Asn 
100 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

Met Thr Gin Thr Asn Pro Ser Phe Asn Pro Ser Pro Arg Tyr Lys Ser 

15 10 15 

Lys Lys Gly Trp Tyr Lys Asp Lys Pro Pro Lys Glu Lys Gly Gly Met 

20 25 30 

Pro lie Glu Val Glu lie Ala Gly Pro lie Val lie Glu Asn Lys Phe 

35 40 45 

lie Asp Pro Lys Thr Asn Thr Glu Lys Val lie lie Thr Xaa Glu Gly 

50 55 60 

Leu Xaa Lys 
65 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

Met lie Leu Leu Lys Arg Gin His Gin Leu Val Lys Glu Ala Gin Glu 

15 10 15 

Gin Thr Pro Phe lie Phe Val Asp Phe His Ala Glu Thr Thr Ser Glu 
20 25 30 

390 



WO 97/30070 PCT/US97/02318 

Lys Tyr Ala Met Gly Trp His Leu Asp Gly Arg Ala Ser Ala Val Val 

35 .40 45 

Gly Thr His Thr His He Gin Thr Glu Val Asn Val Phe Tyr Gin Arg 

50 55 60 

Gly Gin Gly He Leu Arg Met Leu Val 
65 70 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Met He Thr 
1 

Ala Ala Asp 

Arg Asn Val 
35 

Met Val Asp 
50 

Asp Leu Leu 
65 

Ala Gly Ser 
Lys He Leu 
Thr 



Gly Ala Ala Gin Met Asp Gly Gly He Leu Val Val Ser 

5 10 15 

Gly Pro Met Pro Gin Thr Arg Glu His He Leu Leu Ser 
20 25 30 

Gly Val Pro Ala Leu Val Val Phe Leu Asn Lys Val Asp 

40 45 
Asp Glu Glu Leu Leu Glu Leu Val Glu Met Glu Val Arg 

55 60 
Ser Glu Tyr Asp Phe Pro Gly Asp Asp Val Pro Val He 

70 75 80 

Ala Leu Lys Ala Leu Glu Gly Asp Ala Gin Tyr Glu Glu 

85 90 95 

Glu Leu Met Glu Ala Val Asp Thr Tyr He Pro Leu Gin 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 259: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



Met He Asn Glu Gin Arg Leu Leu Asn Thr Phe Leu Glu Leu Val Gin 

15 10 15 

He Asp Ser Glu Thr Gly Asn Glu Ser Thr He Gin Pro He Leu Lys 

20 25 30 

Glu Lys Phe He Ala Leu Gly Leu Arg Cys 
35 40 



(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 



Met Asn Gin Leu Leu Val Arg His Phe Leu Asn Arg Gin Gin Arg Asp 

1 5 10 15 

Asp He Phe He Gly Thr Lys Val Gly Asn Arg lieu Thr Lys Asp Gly 

20 25 30 

Ser Thr Thr Trp Asp Pro Ser Lys Ser Tyr He Lys Glu Ala Val Lys 

35 40 45 

Gly Ser Leu Lys Arg Leu Gly He Asp His He Asp Leu Tyr Gin Leu 

50 55 60 

His Gly Gly Thr He Asp Asp Pro Leu Asp Glu Thr He Ser Ala Cys 
65 70 75 80 

Asp Glu Leu Lys Gin Glu Gly He He Arg Ala Tyr Gly He Ser Ser 

85 90 95 

He Arg Pro Asn Val He Asp Tyr Tyr Leu Lys His Ser Gin He Glu 

100 105 110 

Thr lie Met Ser Gin Phe Asn Leu He Asp Asn Arg Pro Glu Ser Leu 

115 120 125 

Leu Asp Ala He His Asn Asn Asp Val Lys Val Leu Ala Arg Gly Pro 

130 135 140 

Val Ser Lys Gly Leu Leu Thr Ser Asn Ser Gly 
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(2) INFORMATION FOR SEQ ID NO:261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

Met Ala lie Gin Leu His Glu Ser Gly Val Leu Gin Asp Asn Asn Val 

1 5 10 15 

Gin Leu Leu Gly Thr Glu Leu Thr Ser lie Gin Gin Ala Glu Asp Arg 

20 25 30 

Glu Met Phe Arg Thr Leu Met Asn Asp Leu Asn Val Pro Val Pro Glu 

35 40 45 

Ser Asp He Val Asn Thr Val Glu Gin Ala Phe Lys Phe Lys Glu Gin 

50 55 60 

Val Gly Tyr Pro Leu He Val Arg Pro Ala Phe Thr Met Gly Gly Tyr 
65 70 75 80 

Arg Arg Arg Tyr Leu Val He Met Met Lys Asn Tyr Met Lys Ser Val 

85 90 95 

Ser Asn Gly Leu Pro Leu 
100 

(2) INFORMATION FOR SEQ ID NO:262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

Met Tyr He Cys Xaa Gin Lys Lys Asp Asp Asp Gly He Tyr Xaa Tyr 
15 10 15 
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Tyr Leu Gly Thr Ala Gly Tyr lie Glu Gly Ser Glu Lys Gin Asp Lys 

20 25 30 

Met Pro Asn Gly Ser Asn Val Val Thr Met Asp Leu Ala Leu Asp Lys 

35 40 45 

Ala Val Arg Asp Asp lie Tyr Arg Tyr lie Thr Asn 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 263 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 263: 



Met Glu Lys Arg Thr His Arg He Phe Met He Gly Gin Arg Gly He 

15 10 15 

Leu Glu He Pro His Ser Arg Gly Val Pro Phe He Met Ser Asp Thr 

20 25 30 

Ala Gly Arg Asp Lys Pro Leu Pro Met Arg Pro Asp Lys Leu Lys Asn 

35 40 45 

Leu Gly Met Leu Thr Glu Pro Gly Leu Tyr Tyr Leu Tyr Thr Asp His 

50 55 60 

Thr Ala Gin He Asp Asp Phe Pro Xaa Pro Arg Glu Trp Arg Asp Ala 
65 70 75 80 

Ala Trp Phe Leu Glu Val Lys Pro Pro Gin Thr Gly Gly Asp Val He 

85 90 95 

Gin He Leu Thr Arg Asn Ser 
100 



(2) INFORMATION FOR SEQ ID NO: 264: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

394 



WO 97/30070 



PCT/US97/02318 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

Met Leu Lys Arg Lys Ser Leu Gin Lys Xaa Ser Glu Arg Lys Thr Ala 

15 10 15 

lie Glu Ala Met Asn Asp Ser Thr Thr Glu Glu Gin Gin Ala Ala Lys 

20 25 30 

Asp Lys Val Asp Gin Ala Val Val Thr Ala Asn Ala Asp lie Asp Asn 

35 40 45 

Ala Thr Ala Asn Thr Asp Val Asp Asn Ala Lys Thr Thr Asn Glu Ala 

50 55 60 

Thr Arg Ala Ala lie Thr Pro Asp Ala Asn Val Lys Pro Ala Ala Lys 
65 70 75 80 

Gin Ala Xaa Ala Asp Lys Val Gin Ala Gin Glu Thr Ala lie Asp Ala 

85 90 95 

Asn Asn Gly Ser Thr Thr Glu Glu Lys Glu Ala Ala Lys Gin Gin Val 

100 105 110 

Gin Ser Glu Lys Thr Ala Ala Asp Ala Ala lie Asp Ala Ala His Ser 

115 120 125 

Asn Val Glu Val Glu Ala Ala Lys Asn Ala Glu lie Ala Lys He Glu 

130 135 140 

Ala He Gin Pro Ala Thr Thr Ser Lys Asp Asn Ala Thr Gin Ala He 
145 150 155 160 

Ala Thr Lys Ala Asn Glu Arg Lys Thr Ala Leu Ala Gin Thr Gin Asp 

165 170 175 

He Thr Ala Glu Glu He Ala Ala Ala Asn Ala Asp Val Asp Asn Ala 

180 185 190 

Val Thr Gin Ser Lys 
195 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 



Met Thr Lys He He Phe Met Gly Thr Arg Ala Phe Ser Thr Thr Val 
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1 



5 



10 



15 



Leu Glu Met Leu lie Ala Glu His Asp Val lie Ala Val Val Thr Gin 

20 25 30 

Pro Asp Arg Pro Val Gly Arg Lys Arg Val Met Thr Pro Pro Pro Val 

35 40 45 

Lys Lys Val Xaa Met Lys Tyr Asp Leu Pro Val Tyr Gin Pro Glu Lys 

50 55 60 

Leu Ser Gly Ser Glu Glu Leu Glu Gin Leu Xaa Gin Leu Asp Val Asp 
65 70 75 80 

Leu lie Val Thr Ala Ala Phe Gly Thr He He Thr 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 66: 

Met Cys Tyr Gly He Lys Glu Gin Val Asn Thr Phe He Gin Asn Gly 

15 10 15 

Arg Val Glu His Leu Xaa He Asp Phe Lys Pro His Ser Tyr Gin Lys 

'20 25 30 

Tyr Ala He Asp Lys Val He Asp Asn Glu Lys Xaa Gly Leu Phe Leu 

35 40 45 

Asp Met Gly Leu Gly Lys Thr Val Ser Thr Leu Thr Ala Phe Ser Glu 

50 55 60 

Leu Gin Leu Leu Asp Thr Lys Lys Met Leu Val He Ala Pro Lys Gin 
65 70 75 80 

Val Ala Lys Asp Thr Trp Val Asp Glu Val Asp Lys Trp Asn His Leu 

85 90 95 

Asn His Leu Lys Val Ser Leu Val Leu Gly Thr Pro Lys Glu Arg Asn 

100 105 110 

Asp Ala Leu Asn Thr Glu Ala Asp He Tyr Gly Thr Asn Lys Glu Xaa 

115 120 125 

Thr Lys Trp Leu Cys Asp Gin Tyr Lys Lys Glu Trp Pro Xaa 
130 135 140 



85 



90 



(2) INFORMATION FOR SEQ ID NO: 266: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 142 amino acids 
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(2) INFORMATION FOR SEQ ID NO: 267: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:267: 



Met Xaa Leu Lys Glu Lys Thr lie 

1 5 
Gin Val Arg lie Lys Ser Gly Pro 
20 

Gin Glu lie Val Thr Asp Lys Phe 

35 40 
Phe Xaa Arg Glu Thr Pro Val Glu 
50 55 

Leu 
65 



Asp Val Val Leu Glu Val Gly Glu 

10 15 
Phe Ala Asn Gin Val Gly Xaa Val 
25 30 
Lys Leu Thr Val Leu Val Asp Met 
45 

Xaa Val Phe Asp His lie Glu Lys 
60 



(2) INFORMATION FOR SEQ ID NO:268: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 



Met Leu Gly Lys lie Val Gly Xaa Glu Asp Lys Val Lys Ala Trp Lys 

1 5 10 15 

Xaa Asp Trp Glu Glu Thr Thr Pro Lys Asp Gly Lys Glu lie Lys Lys 

20 25 30 

Ala He Gly Gin Asp Thr Pro Val Ser Leu Phe Asp Glu Phe Asp Lys 

35 40 45 

Lys Leu Tyr Thr Tyr Gly Asp Asn Trp Gly Arg Gly Gly Glu Val Leu 
50 55 60 
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Tyr Gin Ala Phe Gly Leu 



65 



70 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

Met Thr Glu His Lys Ser Gly Phe Val Ser lie lie Gly Arg Pro Asn 

15 10 15 

Val Gly Lys Ser Thr Phe Val Asn Arg Val lie Gly His Lys lie Ala 

20 25 30 

lie Met Ser Asp Lys Ala Gin Thr Thr Arg Asn Lys He Gin Gly Val 

35 40 45 

Met Thr Arg Asp Asp Ala Gin He He Phe Asn Asp Thr Pro Gly He 

50 55 60 

His Lys Pro Lys His Lys lieu Gly Asp Tyr Thr Asp Glu Ser Arg Leu 
65 70 75 80 

Lys Tyr He Ser Glu He Asp Ala He Met Val Tyr Gly 



(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 



Met Ala Lys Gin Lys Lys Phe Met Lys He Tyr Glu Ala Leu Lys Glu 

15 10 15 

Asp He Leu Asn Gly Gin He Gin Tyr Gly Glu Gin He Pro Ser Glu 
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85 



90 
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20 25 30 

His Asp Leu Val Gin Leu Tyr Gin Ser Ser Arg Glu Thr Xaa Arg Lys 

35 40 45 

Ala Leu Asp Xaa Xaa Ala Leu Asp Gly Met lie Gin Lys lie His Gly 

50 55 60 

Lys Gly Ser Leu Val lie Tyr Gin Glu Val Thr Glu Phe Pro Phe Ser 
65 70 75 80 

Glu Leu Val Ser Phe Xaa Glu Met Gin Glu Glu Met Gly Val Ala Tyr 

85 90 95 

Leu Thr Glu Val Val Val Asn Glu Val Val Glu Ala His Glu Val Pro 

100 105 HO 

Glu Val Gin His Ala Leu Asn lie Asn Ser Arg 
115 120 

(2) INFORMATION FOR SEQ ID NO; 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 271: 

Met Arg Leu Val Val Lys Val Ser Ser Leu Lys Ala Val lie Gin Met 

1 5 10 15 

He Leu Asp He Ala Lys Asp Tyr Leu Phe Thr Xaa Glu Val Val Gin 

20 25 30 

Thr Phe Glu Asn Glu Arg Phe Lys Thr Lys His Thr His Gly Thr Gly 

35 40 45 

Cys Thr Phe Ser Ala Val He Thr Ala Glu Leu Ala Lys Gly Arg Pro 

50 55 60 

Leu Phe Glu Ala Val His Lys Ala Lys Lys Phe He Ser Met Ser He 
65 70 75 80 

Gin Tyr Thr Pro Glu He Xaa Arg Gly Arg Gly Pro Val Asn His Phe 

85 90 95 

Ala Tyr Leu Lys Lys Glu Gly Leu Asp Asp Glu Leu Ser Lys 
100 105 110 

<2) INFORMATION FOR SEQ ID NO: 272: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

<C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

t 

Met Lys Arg Phe lie Tyr lie Asp Pro Glu He Tyr Asn Leu Gly He 

15 10 15 

Pro Val Leu Gly He Cys Tyr Gly Met Gin Leu Thr Thr Lys Leu Leu 

20 25 30 

Gly Gly Lys Val Xaa Arg Ala Asn Glu Arg Glu Tyr Gly Lys Ala He 

35 40 45 

He Asn Ala Lys Ser Asp Glu Leu Phe Ala Gly Leu Pro Ala Glu Gin 

50 55 60 

Thr Val Trp Met Ser His Ser Asp Lys Val He Glu lie Pro Glu Gly 
65 70 75 80 

Phe Glu Val He Ala Asp Ser Pro Ser Thr Asp Tyr Ala Ala He Glu 

85 90 95 

Asp Lys Lys Arg Arg He Tyr Gly Val Gin Phe His Pro Glu Val Pro 

100 105 110 

Ser Tyr Arg He Trp 
115 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

Met Glu He He Val Xaa Glu Asn Leu Val Leu Lys Glu Lys Glu Arg 

15 10 15 

Leu Gin Val Leu Tyr Lys Asp He Pro Ser Asn Lys Leu Lys Val Val 
20 25 30 
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Asp Gly Leu lie lie Gin Ala Ala Arg Leu Arg Val Met Leu Asp Tyr _ 

35 40 45 

Met Trp Glu Asp lie Lys Glu Lys Gly Asp Tyr Asp Leu Phe Thr Gin 

50 55 60 

Ser Glu Lys Ala Pro Pro Tyr Glu Arg Glu Arg Pro Val Gly Lys Leu 
65 70 75 80 

Phe Xaa Ala Arg Asp Ala Ala Tyr Gin Lys lie lie Lys Gin Leu Ser 

85 90 95 

Asp Leu Leu Ala Xaa Glu Lys Glu Asp Thr Glu Thr Pro Ser Asp Asp 
100 105 110 

Tyr Leu 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

Met Thr Ser Tyr Tyr Ala Asn Gin Pro Phe Val Arg lie Arg Asp lie 

15 10 15 

Gly Thr Phe Pro Thr Thr Lys Glu Val Leu Gly Ser Asn Tyr Cys Asp 

20 25 30 

lie Gly lie Tyr Val Asp Glu Thr Thr Gin Thr Ala lie Leu Val Ser 

35 40 45 

Val lie Asp Asn Leu Val Lys Gly Ala Ser Gly Gin Ala lie Gin Asn 

50 55 60 

Leu Asn He Leu Tyr Asp Phe Glu Val Thr Thr Gly Leu Asn Gin Ser 
65 70 75 80 

Pro Val Tyr Pro 



(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 amino acids 
<B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 275: 



Met Val Val Thr Gly Asp Gin Thr Gin lie Asp Leu Pro Lys Gly Val 

15 10 15 

Lys Ser Gly Leu Lys Glu Ala Xaa Ser Arg Leu His Asn Val Lys Gly 

20 25 30 

He Ser He Leu Lys Leu Ala Gin Ser Asp Val Xaa Arg His Pro Leu 

35 40 45 

Val Ser Lys Xaa He Glu His Tyr Glu Gly Glu Asn 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

Met Lys Phe He He Leu Xaa Glu Glu Asn He Asp Ser Val He Phe 

15 10 15 

Lys Gly Tyr Ser Asp Gin Asp Glu Val Arg Phe Gin Glu Leu Phe Glu 

20 25 30 

Asn Met Lys Gin Asn Leu Asp Ser Glu lie Gin Arg Gly Glu Val Thr 
35 40 45 

Pro Gin 
50 



(2) INFORMATION FOR SEQ ID NO: 277: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Met Met Ser Asn Lys Leu Asp Gly lie Asn Lys Met lie Thr Ala Lys 

15 10 15 

His Lys Xaa Met Asp Asp Leu Tyr Asp Val Lys Arg Glu Val Lys Ala 

20 25 30 

Leu lie Asp Glu Ser Asp Glu Leu Asn His Ser lie Glu Gin Leu Tyr 

35 40 45 

Gin His Leu Gly Xaa Arg Tyr His Ser Ser Asn Met Ala Ser Arg Met 

50 55 60 

Glu Gin Phe Arg Asp Xaa Phe His Phe Ala Lys Arg Arg Ser Xaa Glu 
65 70 75 80 

Ala Leu Tyr Glu Gin Gin Gin His lie Gin His Xaa lie Arg Lys Ala 

85 90 95 

Glu Glu Glu Met lie Asp Leu Glu Met Arg Arg Asn Val Glu lie Glu 

100 105 110 

Thr Val Thr Lys Glu Glu Asn Lys Trp Lys Gin 
115 120 

(2) INFORMATION FOR SEQ ID NO:278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 



Met lie Phe Gly Lys Gly Thr Ala Lys Ala Thr Ser Tyr Gly Ala Gly 

15 10 15 

lie lie His Phe Leu Gly Gly lie His Glu lie Tyr Phe Pro Tyr Val 

20 25 30 

Leu Met Arg Pro Leu Leu Phe lie Ala Val lie Leu Gly Gly Met Thr 

35 40 45 

Gly Val Ala Thr Tyr Gin Ala Thr Gly Phe Gly Phe Lys Ser Pro Ala 
50 55 60 
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Ser Pro Gly Ser Phe lie Val Tyr Cys Leu Asn Ala Pro Arg Gly Glu 
65 70 75 80 

Phe Leu His Met Leu Leu Gly Val Phe Leu Ala Ala Leu Val Ser Phe 

85 90 95 

Val Val Ala Ala Leu lie Met Lys Phe Thr Arg Glu Pro Lys Gin Asp 

100 105 110 

Leu Glu Ala Ala Thr Ala Gin Met Glu Asn Thr Lys Gly Lys Lys Ser 

115 120 125 

Ser Val Ala Ser Lys Leu Val Ser Ser Asp Lys Asn Val Asn Thr Glu 

130 . 135 140 

Glu Asn Ala Ser Gly Asn Val Ser Glu Thr Ser Ser Ser Asp Asp Asp 
145 150 155 160 

Pro Glu Ala Leu Leu Asp Asn Tyr Asn Thr Glu Asp Val Asp Ala His 

165 170 175 

Asn Tyr Asn Asn lie Asn His Val lie Phe Gly Cys Asp Ala Gly Met 

180 185 190 

Gly Ser Ser Ala Met Gly Ala Ser Met Leu Arg Asn Lys Phe Lys Lys 

195 200 205 

Ala Gly He Asn Asp He Thr Gly Tyr Lys Tyr Cys Asp 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

Met Glu Asp Thr Ala Met Leu Leu Ser Thr Arg Glu Lys Glu Met He 

15 10 15 

Ala Leu Leu He Lys Tyr His Ser Gin Tyr He Thr He His Asp He 

20 25 30 

Ala Gin Gin Leu Ala Val Ser Ser Arg Thr lie His Arg Glu Leu Lys 

35 40 45 

Gly Val Glu Ala Tyr Leu Thr Ser Phe Ser Leu Thr Leu Glu Arg Ala 

50 55 60 

Asn Lys Lys Gly Leu Arg lie Ala Gly Ala Asp Ser Asp Leu Asn Asp 
65 70 75 80 
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Leu Lys Gin Ser lie Ala Gin His Gin Thr lie Asp Leu Ser Val Glu 

85 90 95 

Glu Gin Lys Val lie lie lie Tyr Ala Leu lie Gin Ala Lys Glu Pro 

100 105 110 . 

Val Lys Gin Tyr Ser Leu Ala Gin Glu lie Gly Val Ser Val Gin Thr 

115 120 125 

Leu Ala Lys Met Leu Asp Asp Leu Glu Leu Asp Leu Asn Lys Tyr Gin 

130 135 140 

Leu Ser Leu Ser Arg Lys Arg Gly Glu Gly lie Tyr Leu Val Gly Thr 
145 150 155 160 

Glu Ser Lys Lys Arg Glu Phe Leu Ser Gin Leu Met Val Asn Asn Leu 

165 170 175 

Asn Ser Thr Ser Val Tyr Ser Ser Asn 



(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

Met Pro Ala Ser Val lie Gly Leu Val Leu Leu Phe Val Leu Leu Cys 

15 10 15 

Thr Gly Ala Val Lys Leu Gly Glu Val Glu Lys Val Gly Ala Thr Leu 

20 25 30 

Thr Asn Asn lie Gly Leu Leu Phe Val Pro Ala Gly lie Ser Val Val 

35 40 45 

Asn Ser Leu Gly Val lie Ser Gin Ala Pro Phe Leu lie lie Gly Leu 

50 55 60 

lie lie Val Ser Thr lie Leu Leu Leu lie Cys Thr Gly Tyr Val Thr 
65 70 75 80 

Gin He He Met Lys Val Thr Ser Arg Ser Lys Gly Asp Lys Val Thr 

85 90 95 

Thr Lys He Asn Asp Arg Gly Gly Thr Ser Ser Cys Leu Thr Thr 



180 



185 



100 



105 



110 



(2) INFORMATION FOR SEQ ID 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
iJ» TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



Met Leu Gly Lys 

1 

Gin Lys Ser Val 
20 

Thr Asn Asp Thr 
35 

Thr Gin Asn Asp 
50 

Ser His Ser Met 
65 

Asn Lys Ala Asn 

Lys Gin Gin Ala 
100 



Xaa Met Xaa Arg Ala Thr Ala Asp Thr Asn Asp Thr 

5 10 15 

Gly Pro Val Xaa Asn Asn Lys Ala Lys Asp Met Gin 

25 30 
Gin Lys Ser Val Gly Ser Ala Ala Asn Asn Lys Ala 

40 45 
Gly Ala Asn Ala Ser Pro Ala Thr Val Ser Asn Gly 

55 60 
His Gin Asp Met Leu Asn Val Thr Lys Pro Glu Glu 

70 75 80 

Ala Lys Ser Asp Gin Gin Gly Lys Val Asn Lys Pro 
85 90 95 

Lys Thr Leu Pro Asp Thr Gly Met Ser Gin 
105 110 



(2) INFORMATION FOR SEQ ID NO: 282: 



<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 



Met Xaa Xaa Ala Lys Phe Xaa Xaa 

1 5 
Gly Xaa Pro Asn Val Ser Glu Thr 
20 

Ser Pro Pro Thr Xaa Asn Asn Ser 



Leu lie Ala Thr Pro Glu Asn Thr 

10 15 
Thr Asp Asn Gly Lys Ala Asp Ala 
25 30 
Asp Ala Xaa Thr Gly Xaa Thr Thr 
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35 40 45 

Val Thr Ser Ala Thr Asp Asp Ala Xaa Asp Lys Pro Gin Ala Asn Asn 

50 55 60 

Asn Ser Ser Ala Asp Ala Ser Thr His Ser Pro Thr Met Asp Asn Ala 
65 70 75 80 

Val Thr Ser Lys Pro Glu Val Glu Ser Thr Asn Asn Gly Thr Thr Asp 

85 90 95 

Lys Pro Val Thr Glu Thr Asp Asn Ala Xaa Pro Ala Glu Ser Thr Thr 

100 105 110 

lie Asn Asn Ser Thr Pro Thr Ala Thr Xaa Glu Asn Ala Arg Pro Gly 

115 120 125 

Ser Xaa Ala Pro Ala Pro Thr Thr Ala Ser Thr Glu Ala Ala Ser Ser 

130 135 140 

Ala Asp Ser Lys Asp Asn Ala Ser Val Asn Asp Ser Lys Gin Asn Ala 
145 150 155 160 

Glu Val Xaa Asn Ser Ala Xaa Ser Gin Ser Thr Asn Gly Lys Val Ala 

165 170 175 

Gin Pro Xaa Ser Glu Xaa Lys Ala Lys Ala Glu Lys Asp Gly Arg Asp 

180 185 190 

Ser Thr lie Gin Xaa Met Val Xaa Ser Thr Thr Glu Thr Leu Pro Ser 

195 200 205 

Ala Asp He Thr Glu Pro Xaa Val Pro Ser Asn Thr Ser Lys Asp Lys 

210 215 220 

Glu Glu Ser Thr Thr Asn Gin Xaa Asp Ala Gly Gin Leu Lys Ser Glu 
225 230 235 240 

Thr Asn Val Ala Ser Asn Glu Ala Xaa Xaa Ser Pro Ser Lys Ala Asp 

245 250 255 

Thr Glu Val Ser Asp Lys Pro Ser Thr Ser Ala Ser Ser 
260 265 

(2) INFORMATION FOR SEQ ID NO: 283: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 



Met Gin Asn Lys He Thr Glu He Leu Asn Asn Asn Glu Thr Thr Asn 

407 



WO 97/30070 PCTVUS97/02318 

15 10 15 

Glu Glu Lys Gly Asn Asp lie Gly Pro Val Arg Ala Ala Tyr Glu Glu 

20 25 30 

Gly Xaa Xaa Xaa lie Asn Ala Ala Xaa Thr Thr Gly Asp Val Thr Thr 

35 40 45 

Ala Lys Asp Thr Ala Gly Thr Lys Ser Ser Thr Thr Ser Cys Lys Ser 
50 55 60 

Cys 
65 



(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 284: 

Met Lys Lys Lys Leu Leu Val Leu Thr Met Ser Thr Leu Phe Ala Thr 

1 5 10 15 

Gin Leu He Asn Ser Asn His Ala Lys Ala Ser Val Thr Glu Ser Val 

20 25 30 

Asp Thr Lys Phe Val Val Pro Glu Ser Gly He Asn Lys He He Pro 

35 40 45 

Ala Tyr Asp Glu Phe Lys Asn Ser Pro Lys Val Asn Val Ser Asn Leu 

50 55 60 

Thr Asp Asn Lys Asn Phe Val Val Ser Glu Asp Lys Leu Asn Lys He 
65 70 75 80 

Val Asp Ser Ser Ala Ala Ser Lys He Val Asp Lys Asn Phe Ala Val 

85 90 95 

Pro Glu Ser Lys Leu Gly Asn He Val Pro Glu Tyr Lys Glu He Asn 

100 105 110 

Asn Arg Val Asn Val Ala Thr Asn Asn Pro Ala Ser Gin Gin Val Asp 

115 120 125 

Lys His Phe Val Ala Lys Gly Pro Glu Val Asn Arg Phe He Thr Gin 

130 135 140 

Asn Lys Val Asn His His Phe He Thr Thr Gin Thr His Tyr Lys Lys 
145 150 155 160 

Val He Thr Ser Xaa Lys He Asn Thr Cys Xaa Leu Thr Cys Xaa Ser 
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165 170 175 

Cys Lys Arg He Leu Leu He Xaa Thr Leu Leu Leu His lie Arg Leu 
180 185 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 85: 

Met Gly Xaa He Pro Val Ser Xaa Pro Ala Glu Xaa Val Xaa Ser Asp 

15 10 15 

Lys Gin Asn Tyr Asp Xaa Tyr Val Ala Lys Thr Gin Thr Ser Gin Asn 

20 25 30 

Lys Gin Leu Glu Gin Glu Lys Gin Phe Asp Ser Val Gly Lys Gin Gly 

35 40 45 

Thr Ala Ser Lys Ser Ser Asp Glu Xaa Val Ser Ser Thr Thr Xaa Ser 

50 55 60 

Met Pro Asn Tyr Ser Xaa Gly Gly 
65 70 



(2) INFORMATION FOR SEQ ID NO: 286 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 : 



Met 
1 



Tyr Ser Asn Xaa Thr Xaa His Arg Ala He Val Ser Xaa Pro Asp 
5 10 15 
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Xaa His Lys Glu Ser Phe Lys Thr Ser Glu Val Pro Pro Ala lie Xaa 

20 25 30 

Gly Xaa Met Xaa Pro Xaa Xaa Leu Glu Asn Gly Arg lie Pro Val Ser 

35 40 45 

Lys Pro Ser Glu Lys Val Glu Ser Asp Lys Gin Lys Tyr Asp Xaa Tyr 

50 55 60 

Val Ala Lys Thr Gin Thr Ser Gin Xaa 
65 70 



(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 87: 



Met Phe Leu Gly Asp Val Ala Leu Ser Ala lie Arg Pro Pro Thr His 

1 5 10 15 

He Lys Ala Asp Gly Asp Ala Asp Val Thr Tyr Gly Val Ala He Val 

20 25 30 

Gly Asp Lys Val Thr Arg Lys Glu Leu Thr He Glu Ala Thr Val Thr 

35 40 45 

Pro Leu Cys Pro Cys Ser Gin Xaa He Ser Glu Tyr Ser Ala His Thr 

50 55 60 

Gin Arg Gly Val Val Thr Val Lys Thr Tyr He Asn Lys Asp Xaa Asn 
65 70 75 80 

He Val Asp Asp Tyr Lys Asn Lys He Leu Asp Ala Met Glu Ala Asn 

85 90 95 

Ala Ser Ser He Leu Tyr Pro He Leu Lys Arg Pro Asp Glu Lys Arg 

100 105 110 

Val Thr Glu Arg Ala Tyr Glu Asn Pro Arg Phe Val Glu Asp Leu He 

115 120 125 

Arg Leu He Ala Ala Asp Leu Val Glu Phe Asp Trp Leu Asp Gly Phe 

130 135 140 

Asp He Glu Cys Arg Asn Glu Glu Ser He His Gin His Asp Ala Phe 
145 150 155 160 

Ala Lys Leu Lys Tyr Arg Lys 
165 
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(2) INFORMATION FOR SEQ ID NO:288: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
. {B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Met Thr Xaa Xaa Thr Pro Ala Gin Lys Xaa Asp Xaa Ser Xaa Xaa Val 

15 10 15 

Asn Xaa Gly Thr Thr Xaa Ala Gly Val His Thr Ala Gin Ser Tyr Ala 

20 25 30 

Thr Thr Leu His Gin Ala Xaa Ser Thr Xaa Lys Thr Ser Xaa Xaa Xaa 
35 40 45 

Gin 



(2) INFORMATION FOR SEQ ID NO:289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 89: 

Met Tyr Xaa His Trp He Leu Gly Asp He Asn Ser Ala Tyr He He 

15 10 15 

Xaa Val Val Ser Lys Tyr Thr Pro Thr Ser Asp Gly Glu Leu Asp He 

20 25 30 

Ala Gin Gly Thr Ser Met Arg Thr Thr Asp Lys Tyr Gly Tyr Tyr Asn 

35 40 45 

Tyr Ala Gly Tyr Ser Asn Phe He Val Thr Ser Asn Asp Thr Gly Gly 

50 55 60 

Gly Asp Gly Thr Val Lys Pro Glu Xaa Lys Leu Tyr Lys He Gly Asp 

411 



WO 97/30070 PCT/US97/02318 

65 70 75 80 

Tyr Val Trp Glu Asp Val Asp Lys Asp Gly Val Gin Gly Thr Asp Ser 

85 90 95 

Lys Glu Lys Pro Met Ala Asn Val Leu Val Thr Leu Thr Tyr Pro Asp 

100 105 110 

Gly Thr Thr Lys Ser Val Arg Thr Asp Ala Asn Gly His Tyr Glu Phe 

115 120 125 

Gly Gly Leu Lys Asp Gly Glu Thr Tyr Thr Val Lys Phe Glu Thr Pro 

130 135 140 

Ala Gly Tyr Leu Pro Thr Lys Val Asn Gly Thr Thr Asp Gly Glu Lys 
145 150 155 160 

Asp Ser Asn Gly Ser Ser Val Thr Val Lys lie Asn Gly Lys Asp Asp 

165 170 175 

Met Ser Leu Asp Thr Gly Phe Tyr Lys Glu Pro Lys Tyr Asn Leu Gly 

180 185 190 

Asp Tyr Val Trp Glu Asp Thr Asn Lys Asp Gly He Gin Asp Ala Asn 

195 200 205 

Glu Pro Gly He Lys Asp Val Lys Val Thr Leu Lys Asp Ser Thr Gly 

210 215 220 

Lys Val He Gly Thr Thr Thr Thr Asp Ala Ser Gly Lys Tyr 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 290: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 



Met Thr Leu Asp Ser Gly Phe Asp Lys Thr Pro Lys Tyr Ser Leu Gly 

15 10 15 

Asp Tyr Val Trp Tyr Asp Ser Asn Lys Asp Gly Lys Gin Asp Ser Thr 

20 25 30 

Glu Lys Gly He Lys Asp Val Lys Val Thr Leu Leu Asn Glu Lys Gly 

35 40 45 

Glu Val He Gly Thr Thr Lys Thr Asp Glu Asn Gly Lys Tyr Arg Phe 

50 55 60 

Asp Asn Leu Asp Ser Gly Lys Tyr Lys Val He Phe Glu Lys Pro Ala 
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65. 70 75 80 

Gly Leu Thr Gin Thr Val Thr Asn Thr Thr Glu Asp Asp Lys Asp Ala 

85 90 95 

Asp Gly Gly Glu Ala 
100 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Met Leu Met Xaa lie Pro Asp Xaa Asp Xaa His Val Xaa Leu Asp Val 

15 10 15 

Leu Cys Glu Ser His Asn Asp Val Asp Val Glu Ser Asp Ser Leu Val 

20 25 30 

Leu Ser Asp Val Glu Pro Asp Ser Asp Val His Thr Asp Val Asp Lys 

35 40 45 

Leu Thr Met Val Asp Met Arg Phe Glu Val Asp Val Xaa Ser Glu Ser 

50 55 60 

Leu Asn Asp Ala Asp Val Leu Cys Glu Ser Asp Ser Leu Leu Cys lie 
65 70 75 80 

Asp Ala Asp Val Glu Ser ^Leu Lys Asp Val Asp Asp Glu Ser Asp Ser 

85 90 95 

Leu Ala Leu Ala Glu Leu Leu Ser Asp Val Asp Val Leu Ala Asp Pro 

100 105 110 

Glu Leu Leu Val Leu Val Glu Ser Leu Ser Asp Val Asp Ala Leu Ser 
115 120 125 

Asp Ala 
130 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

Met Ser Thr Ser Glu Ser Leu Ser Asp Ser Thr Ser Thr Ser Gly Ser 

1 5 10 15 

Val Ser Gly Ser Leu Ser lie Ala Ala Ser Gin Ser Val Ser Thr Ser 

20 25 30 

Thr Ser Asp Ser Met Ser Thr Ser Glu lie Val Ser Asp Ser lie Ser 

35 40 45 

Thr Ser Gly Ser Leu Ser Ala Ser Asp Ser Lys Ser Met Ser Val Ser 

50 55 60 

Ser Ser Met Ser Thr Ser Gin Ser Gly Ser Thr Ser Glu Ser Leu Xaa 
65 70 75 80 

Asp His Lys Val His Leu lie Leu lie Val Ser His Tyr His 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

Met Thr Phe Gly Ser Val Phe Thr Ser Thr Pro Ser Xaa Ser Leu Thr 

15 10 15 

Asp Arg Phe Cys Arg Leu Asp Gin Glu Arg Phe Gly Asp lie Lys Asp 

20 25 30 

lie Thr Asp Lys Gly Tyr Tyr Gin Asn Ser Phe His Tyr Asp Val Arg 

35 40 45 

Lys Asp Val Thr Pro Phe Glu Lys Leu Asp Phe Glu Lys Asp Tyr Pro 

50 55 60 

Tyr Tyr Ala Ser Gly Gly Phe lie His Tyr Cys Glu Tyr Pro Lys Leu 
65 70 75 80 

Gin His Asn Leu Lys Ala Leu Glu Xaa Gly lie Gly Pro Xaa Ser Tyr 



85 



90 



85 



90 



95 
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Cys Xaa Ser Trp Leu Thr Leu Gly Xaa Lys Tyr Xaa Asp Leu Xaa His 
100 105 110 

Cys Tyr 

(2) INFORMATION FOR SEQ ID NO:294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

Met Gly Gin Leu Glu Thr Ser He Arg Asp Lys Asp Thr Thr Leu Gin 

15 10 15 

Ser Gin Asn Tyr Gin Asp Ala Asp Asp Ala Lys Arg Thr Ala Tyr Ser 

20 25 30 

Gin Ala Val Asn Ala Ala Ala Thr He Leu Xaa Lys Thr Ala Gly Gly 

35 40 45 

Asn Thr Pro Xaa Ala Asp Val Asp Arg Ala Met Gin Ala Val Thr Gin 

50 55 60 

Ala Asn Thr Ala Leu Asn Gly He Gin Tyr Leu Arg Thr Cys Glu Thr 
65 70 75 80 

Arg Leu Arg Thr Gin Gin Xaa Thr Asn Ala Ser Val Xaa Xaa Ser Lys 

85 90 95 

Gin Thr Arg Xaa He Asp Xaa Thr Xaa 
100 105 

(2) INFORMATION FOR SEQ ID NO:295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
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Met Leu Val Phe Thr Leu Xaa Asn Val Arg Arg Ser Met Val Glu Gin 

1 5 10 15 

Leu Asn Glu Asp Tyr Val Leu Tyr Leu Arg Ala Ser Gly Val Lys Ser 

20 25 30 

lie Thr Leu Met Leu His Val Leu Arg Asn Ala He Gin Val Ala Val 

35 40 45 

Ser Xaa Leu Cys Met Ser He Pro Met He Met Gly Gly Leu Val Val 

50 55 60 

He Glu Tyr He Xaa Ala Trp Xaa Gly Leu Gly Gin Leu Ser Leu Xaa 
65 70 75 80 

Ala Xaa Thr Trp Asn Thr He Xaa Gin Xaa Xaa Lys His Met Tyr 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 



Met Asn Thr Leu Lys Leu Thr He 

1 5 
Thr Ser He He Leu Gly Val Val 
20 

Thr Asp Arg Ala He Arg Ser Val 

35 40 
Ser Cys Arg He Arg His Glu Arg 
50 55 



He Ser Ser Val Met Val Met He 

10 15 
Ser Ala Leu Lys Arg Gly Lys Phe 
25 30 
Ala Phe Phe Leu Thr Ala Leu Ala 
45 

His 



(2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE 



TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

Met Asn Phe Asn Asn Leu Asp Gin Leu Tyr Arg Ser Val lie Met Asp 

15 10 15 

His Tyr Lys Asn Pro Thr Asn Lys Gly Val Leu Xaa Asn Gly Ser Met 

20 25 30 

Thr Val Asp Met Asn Asn Pro Thr Cys Gly Asp Arg lie Arg Leu Thr 

35 40 45 

Phe Asp lie Glu Asp Gly lie lie Lys Asp Ala Lys Phe Glu Gly Glu 

50 55 60 

Gly Cys Ser lie Ser Met Ala Ser Ala Ser Met Met Thr Gin Ala Val 
65 70 75 80 

Lys Gly Asn Pro Xaa Gly Glu Ala lie Gin Xaa Asp Pro Arg Asn Leu 

85 90 95 

Pro Xaa Met Met Leu Gly Trp Lys Pro Xaa Val Xaa Gin Gly Arg Met 
100 105 110 

Gly Arg Tyr 
115 



(2) INFORMATION FOR SEQ ID NO: 298: 



{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: 



Met Ser Xaa Xaa Xaa Gly Xaa Thr Val Thr Ala Thr Xaa Xaa Lys Val 

15 10 15 

Xaa Val Lys Pro Asp Asp Gin Val Leu Ala Gly Tyr Thr Asp Xaa Asn 

20 25 30 

Lys Leu Val Asn Phe Lys Ala Pro Lys Glu Met lie Gly 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 299: 

U) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(iil MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 



Met lie Asp Val lie Ser Glu Gly Gly Asn lie Val Pro His lie His 

15 10 15 

Leu Pro Val Gin Ser Gly Asn Asn Ala Val Leu Lys lie Met Gly Arg 

20 25 30 

Lys Tyr Thr Arg Glu Ser Tyr Leu Asp Leu Val Lys Arg He Lys Asp 

35 40 45 

Arg Leu Pro Asn Val Ala Leu Thr Thr Asp He He Val Gly Tyr Pro 

50 55 60. 

Asn Glu Ser Glu Glu Gin Phe Glu Glu Thr Leu Thr Leu Tyr Asp Glu 
65 70 75 80 

Val Gly Phe Glu His Ala Tyr Thr Tyr Leu Tyr Ser Gin Arg Asp Gly 

85 90 95 

Thr Pro Ala Ala Lys Met Lys Asp Asn Val Pro Leu Asn Val Lys Lys 

100 105 HO 

Glu Arg Leu Gin Arg Leu Asn Lys Lys Val Gly His Tyr Ser Gin He 

115 120 125 

Ala Met Ser Lys Tyr Glu Gly Gin Thr Val Thr Val Leu Cys Glu Gly 

130 135 140 

Ser Ser Lys Lys Asp Asp Gin Val Leu Ala Gly Tyr Thr Asp Lys Asn 
145 150 155 160 

Lys Leu Val Asn Phe Lys Ala Pro Lys Glu Met He Gly 
165 170 



(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
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Met He Ala Met Ser Phe Leu Tyr Leu Gin Gly Gly Arg Leu He Asp 

15 10 15 

Val Leu Thr Ala He Leu Ala Gly Ser Leu Gly Tyr Leu Val Thr Glu 

20 25 30 

He Leu Asp Arg Lys Leu His Ala Gin Phe lie Pro Glu Phe He Gly 

35 40 45 

Ser Leu Val He Gly He He Ala Val He Gly His Thr Leu He Pro 

50 55 60 

Thr Gly Asp Leu Ala Thr He He lie Ala Ala Val Met Pro lie Val 
65 70 75 80 

Pro Gly Val Leu He Thr Asn Ala lie Gin Asp Leu Phe Gly Gly His 

85 90 95 

Met Leu Met Phe Thr Thr Lys Ser Leu Glu Ala Leu Val Thr Ala Phe 

100 105 HO 

Gly He Gly Ala Gly Val Gly Ser Val Leu He Leu Val 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 



Met Phe Lys Ser Tyr Met Asp He Thr Pro Asn Glu Tyr Arg Asn Leu 

1 .5 10 15 

Ser Lys Tyr Asn Lys Cys Leu Met Leu Lys Pro Glu Pro Leu Val Gly 

20 25 30 

Lys Met Val Gin Glu Val Lys Glu He He Leu Asn Tyr He Glu His 

35 40 45 

Tyr Lys Asn His Pro Thr Asp Val lie His He Gly 
50 55 60 

(2) INFORMATION FOR SEQ ID NO:302: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 124 amino acids 
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(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Tyr Xaa Xaa Lys lie Gly Pro Ser lie Xaa Phe Lys Ala Xaa Tyr 

15 10 15 

Ala Pro Phe Leu Phe Glu Lys Leu Met Met Pro Leu Ser lie Leu lie 

20 25 30 

Pro Val Gly Ala He Ala Leu Ser Leu Leu Val Gly Tyr Gly Leu Leu 

35 40 45 

Glu Phe Val Gly Val Tyr Met Glu Pro He Met Arg Pro He Phe Lys 

50 55 60 

Thr Pro Gly Lys Ser Ala Val Asp Ala Val Ala Ser Phe Val Gly Ser 

€5 70 75 80 

Tyr Ser Leu Gly Leu Leu He Thr Asn Arg Val Tyr Lys Gin Gly Met 

85 90 95 

Tyr Asn Lys Arg Glu Ala Thr He He Ala Thr Gly Phe Ser Thr Val 

100 105 110 

Ser Ala Thr Phe Met He He Val Ala Lys Xaa Pro 



(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

Met Lys Lys Phe Asn Val Gin He Thr Tyr Thr Gly Met He Glu Glu 

15 10 15 

Thr He Glu Ala Glu Ser Leu Glu Glu Ala Glu Asn Glu Ala Arg Asp 

20 25 30 

He Ala Met Met Glu Val Pro Phe Asp Cys Asp Glu Tyr Glu He Asn 



115 



120 



35 



40 



45 
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Val Glu Glu Lys Glu 
50 
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(2) INFORMATION FOR SEQ ID NO: 304: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

Met Thr Asn Thr Leu Gin Val Lys Leu Leu Ser Lys Asn Ala Arg Met 

15 10 15 

Pro Glu Arg Asn His Lys Thr Asp Ala Gly Tyr Asp He Phe Ser Ala 

20 25 30 

Glu Thr Val Val Leu Glu Pro Gin Glu Lys Ala Val He Lys Thr Asp 

35 40 45 

Val Ala Val Ser He Pro Glu Gly Tyr Val Gly Leu Leu Thr Ser Arg 

50 55 60 

Ser Gly Val Ser Ser Lys Thr His Leu Val He Glu Thr Gly Lys He 
65 70 75 80 

Asp Ala Gly Tyr His Gly Asn Leu Gly He Asn He Lys Asn Asp Ala 

85 90 95 

He Ala Ser Asn Gly Tyr He Thr Pro Gly Val Phe Asp He Lys Gly 

100 105 110 

Glu He Asp Leu Ser Asp Ala He Arg Gin Tyr Gly Thr Tyr Gin He 

115 120 125 

Asn Glu Gly Asp Lys Leu Ala Gin Leu Val He Val Pro lie Trp Thr 

130 135 140 

Pro Glu Leu Lys Gin Val Glu Glu Phe Glu Cys Val Ser Glu Arg Gly 
145 150 155 160 

Ala Lys Gly Phe Gly Ser Ser Gly Val 
165 



(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



Met Thr Phe Thr Leu Ser Asp Glu Gin Tyr Lys Asn Leu Cys Thr Asn 

1 5 10 15 

Ser Asn Lys Leu Leu Asp Lys Leu His Lys Ala Leu Lys Asp Arg Glu 

20 25 30 

Glu Tyr Lys Lys Gin Arg Asp Glu Leu lie Gly Asp lie Ala Lys Leu 

35 40 45 

Arg Asp Cys Asn Lys Glu Leu Glu Lys Lys Ala Ser Ala Trp Asp Arg 

50 55 60 

Tyr Cys Lys Ser Val Glu Lys Asp Leu lie Asn Glu Phe Gly Asn Asp 
65 70 75 80 

Asp Glu Arg Val Lys Phe Gly Met Glu Leu Asn Asn Lys lie Phe Met 

65 90 95 

Glu Asp Asp Thr Asn Glu 
100 



(2) INFORMATION FOR SEQ ID NO:306: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

Met Asn Pro Pro Tyr Gly Arg Ser lie Lys Arg Trp Val Lys Lys Ala 

15 10 15 

Tyr Glu Glu Ser Leu Lys Gly Ala Thr Val Val Cys Leu lie Pro Ala 

20 25 30 

Arg Thr Asp Thr Thr Tyr Trp His Asp Tyr lie Phe Asn Lys Ala Asp 

35 40 45 

Asp lie Arg Phe Leu Arg Gly Arg Leu Lys Phe Gly Asp Ser Lys Asn 

50 55 60 

Arg Ala Pro Phe Pro Ser Ala lie lie Val Tyr Arg Gly Ala Gin Met 
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65 70 75 80 

Ser Lys Tyr Asn Ala Lys Lys Val Xaa Ser Thr Lys Gly He Xaa Phe 

85 90 95 

Trp He Ala Lys Xaa Glu Cys Xaa He Phe Thr Lys Tyr Leu Glu Ser 

100 105 110 

Asn He Glu Trp His 
115 

(2) INFORMATION FOR SEQ ID NO: 3 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

Met Tyr Pro He Lys Asn Leu Glu Asn Leu Tyr Asp Lys Glu Gly Tyr 

1 5 10 15 

Arg Asp Gly Glu Phe Lys Lys Gly Asp Lys Gly Thr Trp Thr He Ser 

20 25 30 

Thr Asp Phe Ala Lys Ser Asn Lys Gin Gly Glu Met Asn Ser Glu Gly 

35 40 45 

Met Val Leu His Phe Asn Arg Asn Thr Gly Ala Ala Thr Gly Tyr Tyr 

50 55 60 

Thr Val Arg Ala Thr Tyr Asp Val Val Asp Lys Leu Xaa Arg Ser Xaa 
65 70 75 80 

Gin He Ser Cys Cys Xaa 
85 

(2) INFORMATION FOR SEQ ID NO: 308 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 
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Met Cys Val Val Tyr Arg Thr Ser Val Leu lie Leu Leu Ala Ser Gly 

15 10 15 

Cys Ser Gly Val Phe Asp Thr Pro Glu Asp Ser Lys Glu Thr Gin lie 

20 25 30 

Lys Lys Ser Phe Ala Lys Thr Leu Asp Met Tyr Pro lie Lys Asn Leu 

35 40 45 

Glu Asp Leu Tyr Asp Lys Glu Gly Tyr Arg Asp Gly Glu Phe Lys Lys 

50 55 60 

Gly Asp Lys Gly Thr Trp Val Val Arg Ser Glu Met lie lie Gin Pro 
65 70 75 80 

Lys Arg 



(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

Met Lys Glu Asn Arg Thr Thr Met Lys Lys Asp lie Arg Leu Lys Trp 

1 5 10 15 

Ser Asp Asn Lys lie lie Pro Thr Lys Glu lie Xaa Asp Xaa Asn lie 

20 25 30 

Lys Lys Glu Xaa Glu Asn Phe Lys Phe Phe Val Gin Tyr Gly Asn Phe 

35 40 45 

Lys Asp Leu Ser Lys Tyr Lys Asp Gly Asp lie Ser Tyr Asn Pro Glu 

50 55 60 

Val Pro Ser Tyr Ser Pro Lys Tyr Gin Val Thr Asn Asp Asp Tyr Asn 
65 70 75 80 

Val Xaa Gin Xaa Arg Lys Arg Tyr Arg Tyr Thr Asp Arg lie Lys His 
85 90 95 

Xaa Ser Tyr Cys 
100 



(2) INFORMATION FOR SEQ ID NO: 310: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

Met Phe Leu Xaa Leu Asn Arg Xaa Thr Arg Ser Ala Xaa Gly Phe Tyr 

15 10 15 

Tyr Val Asn Ala He Lys Lys Xaa Val Asn Gly Arg Pro Gin Asp Asn 

20 25 30 

Gin He Glu Tyr Pro Val Glu Met He Asp Asn Lys Xaa He Pro Thr 

35 40 45 

Lys Asp He Xaa Asp Xaa Xaa He Lys Lys Glu He Glu Asn Phe Lys 

50 55 60 

Phe Phe Ala Gin Tyr Gly Asn Phe Lys Asp Leu Thr Lys Tyr Lys Gly 
65 70 75 80 

Gly Asp He Ser Tyr Asn Pro Glu Ala Pro He Tyr Ser Ala Lys Tyr 

85 90 95 

Xaa Leu Xaa Asn Asp Asp Tyr Asn Val Lys Gin Leu Arg Lys Arg Tyr 

100 105 HO 

Asp He Pro Thr Asn Lys Ala Pro Lys Leu Leu Leu Xaa Gly Thr Gly 

115 120 125 

Asn Leu Asp Gly Thr Ser He Gly Xaa Xaa Lys He Glu Xaa Thr Xaa 

130 135 140 

Gly Xaa Lys Xaa Gly Glu Asn Xaa His Phe Thr Xaa Asn Leu His Phe 
145 150 155 160 

Lys Pro Ser Asn Asp Glu Xaa Ser 
165 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



Met Lys Asn Asn Lys lie lie Leu Leu Asp Glu Val Lys Asp Asp Lys 

15 10 15 

Leu Lys Gin Lys lie Glu Asn Phe Lys Phe Phe Gly Gin Tyr Ala Asn 

20 25 30 

Leu Lys Glu Leu Arg Lys Tyr Asn Asn Gly Asp Val Ser lie Asn Glu 

35 40 45 

Asn Val Pro Ser Tyr Asp Val Glu Tyr Lys Met Ser Asn Lys Asp Glu 

50 55 60 

He Val Lys Glu Leu Arg Ser Arg Tyr Asn He Ser Thr 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 312: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

<C ) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

Met Gly He Ser Lys Ala Met Met Glu Lys Val Phe Val Ala Lys Ser 
k 1 5 10 15 

Arg Asn lie Arg Ser Glu Gin Thr Leu He Cys Gly Thr Arg Tyr Gly 

20 25 30 

Asn Val Met Ala Ser Arg Gly Ser Val He Pro Leu Phe He Asp Lys 

35 40 45 

He Lys Ala Gly Glu Pro Leu Thr He Thr Asp Pro Asp Met Thr Arg 

50 55 60 

Phe Leu Met Ser Leu Glu Asp Ala Val Glu Leu Val Val His Ala Phe 
65 70 75 80 

Lys His Ala Glu Thr Gly Asp He Met Val Gin Lys Ala Pro Ser Ser 

85 90 95 

Thr Val Gly Asp Leu Ala Thr Ala Leu Leu Glu Leu Phe Glu Ala Asp 

100 105 110 

Asn Ala He Glu He He Gly Xaa Leu Val Pro Trp Arg Glu Lys Ala 

115 120 125 

Xaa Thr Xaa Leu Pro Xaa Glu Glu Tyr Xaa Pro Xaa 
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130 



135 



140 



(2) INFORMATION FOR SEQ ID NO: 3 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

Met Met Gin Gin Leu Ser Leu Lys His Arg Leu Asn Asn Gly Asp Ser 

15 10 15 

Val Tyr Gly lie Ser Asn Ser lie Pro Asp Pro Leu Met lie Glu Val 

20 25 30 

lie Ala Ala Ser Gly Tyr Asp Xaa Val Val Xaa Asp Thr Glu His Val 

35 * 40 45 

Ala Xaa Xaa Asp Glu Thr Leu Ala His Ser Xaa Ser Arg Ser 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

Met Ser Phe Thr Ala Pro Thr Lys Gin Pro He Val Lys Pro Lys Glu 

15 10 15 

Lys Thr Lys Tyr Glu Phe Lys Ser Gly Gly Lys Lys Lys Met Val He 

20 25 30 

Ala Glu Ala Asn Lys Val Thr Pro He Gly Asn Phe He Pro Gly Thr 

35 40 45 

Tyr Arg lie Pro Ala Met Lys Ser Thr Glu Asn Gly Asp Phe Xaa Gly 



50 



55 



60 
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Tyr Leu Lys Phe Asp Phe Arg Gin Ser Asn Ser Glu Thr Val Asp Val 
65 70 75 80 

Thr Glu Asp Phe Glu Glu Ala Asn He Thr Val Thr Leu Lys Gly Asp 

85 90 95 

Thr Lys Leu Asn Asp Ser Ser Lys Lys Val Thr He Asn Asp Arg Glu 

100 105 HO 

Met Ala Phe Ser Ser Ser Lys Thr Tyr Gly Pro Tyr Pro Gin Asn Lys 

115 120 125 

Asp He Thr He Ser Ala Ser Gly Lys Ala Lys Gly Lys Thr Phe Thr 

130 135 140 

Thr Gin Thr Lys Thr He Lys Ala Ser Asp Leu Xaa Tyr Asn Thr Glu 
145 150 155 160 

Asp Asn Phe Glu Phe Trp Asp Xaa Glu Asp He Xaa Gly Leu Cys 
165 170 175 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 



Met Met He Ser Ser Pro Gin He 

1 5 
Lys He Thr Ala Thr Val Arg Leu 
20 

Asp Lys Glu Tyr Glu Leu Glu Gin 

35 40 
He Lys Thr Ser Glu Lys 
50 



He Asp Ala Glu Lys His Gly Asp 

10 15 
He Asn Glu Asn Gly Lys Gin Val 
25 30 
Gly Ser Gin Asp Arg Leu Gin Leu 
45 



(2) INFORMATION FOR SEQ ID NO:316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 



Met Xaa Lys Pro Thr Gin His Tyr lie Phe Asn Leu Pro Asn Asn Glu 

1.5 10 15 

Gly Leu Ser Ser Leu Leu Leu Asn Trp Ser Thr Tyr Gin Asp Ser lie 

20 25 30 

lie Ser Thr Glu lie Gin Asp Leu Asp Val Leu Thr Ser Gly Pro He 

35 40 45 

Pro Pro Asn Pro Ser Glu Leu He Thr Ser Arg Ala Phe Ala Asn Leu 

50 55 60 

Tyr Asp Thr Leu Leu Met Asn Tyr Tyr Leu Val lie He Asp Thr Pro 
65 70 75 80 

Gin 



(2) INFORMATION FOR SEQ ID NO: 317: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 204 amino acids 
(BJ TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 



Met Ser Lys Lys Glu Asn Thr Thr Thr Thr Leu 

15 10 
Pro Asn Ser Thr He Ser Glu Lys Xaa Arg Gly 

20 25 
Met Phe Ser Lys Ala Asn Gly Glu Val Lys Arg 

35 40 
Glu Lys Pro Gly Ala Gly Lys Ser Thr Val Val 

50 55 
Thr Tyr Ala Gin Ala Gly Tyr Lys Thr Leu Val 
65 70 75 

Arg Lys Pro Thr Gin Asn Tyr He Phe Asn Glu 

85 90 
Leu Ser Ser Leu He He Gly Arg Xaa Thr Met 

429 



Phe Val Tyr Glu Asn 
15 

He Arg Xaa Asn He 
30 

Leu Leu Xaa Thr Ser 
45 

Ser Asn Val Ala He 
60 

He Asp Gly Asp Met 
80 

His Asn Asn Asn Gly 
95 

Ser Glu Xaa He Thr 
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100 



105 



110 



Ser Thr Glu lie Glu Asn Leu Asp Leu Leu Thr Ala Xaa Pro Val Pro 

115 120 125 

Pro Asn Pro Ser Glu Leu lie Gly Ser Glu Arg Phe Lys Glu Leu Val 

130 135 140 

Asp Leu Phe Asn Lys Arg Tyr Asp lie lie lie Val Xaa Thr Pro Pro 
145 150 155 160 

Val lie Thr Val Thr Xaa Ala Gin Leu Xaa Ala Leu Val Leu Leu Lys 

165 170 175 

lie Val Leu Leu Val lie Asp Ser Glu Lys Asn Asp Lys Asn Glu Val 

180 185 190 

Lys Lys Ala Lys Ala Leu Met Glu Lys Ala Gly Arg 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:318: 

Met Thr Xaa Asn Thr Val Thr Leu His Thr Ala His He Val Ser Leu 

15 10 15 

Gly Asp He Glu Glu Ala Lys Ala Ser He Lys Pro Phe He Arg Arg 

20 25 30 

Thr Pro Leu He Lys Ser Met Tyr Leu Ser Xaa Asn He Thr Lys Gly 

35 40 45 

Asn Val Tyr Leu Lys Leu Glu Asn Met Gin Phe Thr Gly Ser Phe Lys 

50 55 60 

Phe Arg Gly Ala Ser Asn Lys He Asn His Leu Ser Asp Xaa Xaa Lys 
€5 70 75 80 

Ala Lys Gly He He Gly Ala Ser Ala Gly Asn His Ala Gin Gly Val 

85 90 95 

Ala Leu Thr Ala Lys Leu Leu Gly He Asp Ala Thr He Val Met Pro 

100 105 110 

Glu Thr Ala Pro lie Ala Lys Gin Asn Ala Thr Lys Gly Tyr Gly Ala 

115 120 125 

Lys Val He Leu Lys Gly Xaa Asn Phe Asn Glu Thr Arg Leu 



195 



200 



(2) INFORMATION FOR SEQ ID 



NO:318: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 142 amino acids 
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130 



135 



140 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

Met Xaa Xaa Thr Arg Xaa Arg Ser Xaa Xaa Xaa Thr Xaa Leu Asn Xaa 

15 10 15 

Arg Lys Ser Ala Val Met Ser Xaa Glu lie Thr Ser Xaa Asp Trp Xaa 

20 25 30 

Val Thr Xaa Pro His Glu Xaa lie Ser Xaa Gly Xaa Xaa Xaa Arg Val 

35 40 45 

Lys His Arg Asp Phe Asp Pro Pro Leu Tyr Val Glu Ala Glu Val lie 

50 55 60 

Ala Glu Glu Tyr Asn lie lie Ser Glu Asn Ser Thr Tyr Thr Phe Gly 
65 70 75 80 

Gin Pro Lys Glu Phe Lys Glu Ser Glu Leu Arg Glu Glu Phe Asn Lys 

85 90 95 

Arg Leu Asn lie lie His Gin Lys Leu Asn Asp Asn lie Ser Asn lie 

100 105 110 

Asn Thr lie Val Lys Asp Val Val Asp Ser Glu Leu Glu Tyr Phe Glu 

115 120 125 

Arg Lys lie His Lys Ser Asp Thr Pro Pro Glu Asn Pro Val Asn Asp 

130 135 140 

Met Leu Trp Tyr Asp Thr Ser Asn Pro Asp Val Ala Val Leu Arg Arg 
145 150 155 160 

Tyr Trp Asn Gly Arg Trp He Glu Glu Thr Pro Asn Asp Val Xaa Lys 

165 170 175 

Leu Gly Gly He Thr Arg Glu Glu Ser Ala He Gin 



180 



185 



(2) INFORMATION FOR SEQ ID 



NO:320: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 77 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

Met Ala Lys Glu Leu Glu Ser Gly Lys Lys Leu Ser Asp Asp Thr Phe 

1 5 10 15 

Lys Leu Ser Arg His Thr Gly Leu lie Ala Glu Glu Val Glu Glu Leu 

20 25 30 

Gly Phe Asn Glu Phe Val He Tyr Asp Asp Asn Gly Glu He Glu Gly 

35 40 45 

He Ala Tyr Asp Arg Leu Trp Val His Leu Xaa Pro He He Thr Xaa 

50 55 60 

Lys Pro Xaa Xaa Lys Xaa Arg Lys Thr Xaa Trp Xaa Xaa 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 321: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 



Met Val Gly Glu Gly Leu Ala He Lys Ala His Glu Glu Ser Lys Val 

1 5 10 15 

He Ala Pro Phe Asn Gly Leu He Ser Met He Val Pro Thr Lys His 

20 25 30 

Ala Val Gly He Gin Ser Glu Asp Gly Val Asp He Val He His He 

35 40 45 

Gly Val Lys Xaa Val Asp Leu Glu Gly Lys Gly Phe Lys Tyr Phe Val 

50 55 60 

Asn Gin Asn Asp His Val Glu Ala Gly Gin Thr Xaa Leu Gin Phe Asp 
65 70 75 80 

Gin Gin Tyr He Gin Gin Xaa Gly Tyr Asn Ala Asp Xaa He Val Val 
85 90 95 
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He Ser Asn Ser Ala Asp Leu Gly Lys Val Glu Leu Thr Met Asn Glu 

100 105 110 

He He Thr Thr Glu Asp Val He Phe Xaa He Phe Lys Asn 



{2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 

Met Gly Leu Lys Cys Leu Arg Thr Ser He Ala Trp Thr Arg He Phe 

15 10 15 

Pro Asn Gly Asp Glu Asp Val Pro Asn Glu Glu Gly Leu Ala Phe Tyr 

20 25 30 

Asp Arg He Phe Asp Glu Leu He Ala Gin Gly He Glu Pro Val Val 

35 40 45 

Thr Leu Ser His Phe Glu Met Pro Leu His Leu Ala Lys His Tyr Gly 

50 55 60 

Gly Phe Arg Asn Arg Glu Val Val Asp Tyr Phe Val His Phe Ala Arg 
65 70 75 80 

Val Val Phe Glu Arg Tyr Lys Asp Lys Val Thr Tyr Trp Met Thr Phe 

85 90 95 

Asn Glu He Asn Asn Gin Met Asp Thr Ser Asn Pro He Phe Leu Trp 

100 105 110 

Thr Asn Ser Gly Val Ala Leu Thr Glu Asn Asp Asn Pro Glu Glu Val 

115 120 125 

Leu Tyr Gin Val Ala His His Glu Leu Leu Ala Ser Ala Leu Ala Val 

130 135 140 

Arg Leu Gly Lys Glu He Asn Pro Lys Phe Lys He Gly Thr Met He 
145 150 155 160 

Ser His Val Pro He Tyr Pro Tyr Ser Cys His Pro Lys Asp Met Met 

165 170 175 

Glu Ala Gin He Ala Asn Arg Leu Arg Phe Phe Phe Pro Asp Val Gin 

180 185 190 

Val Arg Gly Tyr Tyr Pro Ser Tyr Ala Lys Lys Met Leu Ala Arg Lys 



115 



120 



125 



195 



200 



205 
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Gly Tyr Asp Val Gly Trp Gin Glu Gly Asp Asp Ser He Leu Gin Gin 

210 215 220 

Gly Thr Xaa Asp Tyr He Gly Phe Ser Tyr Tyr Met Ser Thr Ala Val 
225 230 235 240 

Lys His Asp Val Asp Thr Thr Xaa Glu Asn Asn He Val Asn Gly Gly 

245 250 255 

Phe Gly Phe He Cys Gly Glu Ser Ala Tyr Arg Asn Glu 
260 265 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

Met Tyr Xaa Arg Val Met Asp Leu Gly Val His Cys Ser Val Gly Gly 

15 10 15 

Gly Ala He Thr Arg Pro Xaa Glu He Thr Lys Arg Phe Gly Xaa Val 
20 25 30 

Met Glu Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:324: 

Met Xaa Asn Leu He Pro Lys Asn Pro Gin Xaa Pro Thr Gin Asp Glu 

15 10 15 

Leu Lys His Lys Ser Lys Pro Ala Gin Ser Phe Asn Xaa Asp Val Asn 
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20 



25 



30 



Gin Lys 



Asp 
35 



Thr 



Arg 



Ala 



Thr 



Ser 



Leu Phe 



Glu 



Thr Asp Pro 



Ser He 



40 



45 



Ser Asn Asn Asp Asp Ser Gly Gin Phe Asn Val Val Asp Ser Lys Asp 

50 55 .60 

Thr Arg Gin Phe Val Lys Ser He Ala Lys Asp Ala His Arg He Gly 
65 70 75 80 

Gin Asp Asn Asp He Tyr Ala Ser Val Met He Ala Gin Ala He Leu 

85 90 95 

Glu Ser Asp Ser Gly Arg Ser Ala Leu Ala Lys Ser Pro Asn His Asn 

100 105 HO 

Leu Phe Gly He Lys Gly Ala Phe Glu Gly Asn Ser Val Pro Phe Asn 

115 120 125 

Thr Leu Gly Ser 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Met Cys Val Gly Leu Ala He Met Thr Trp Val He Ser Leu Xaa Asn 

15 10 15 

Val Ser Val Lys His Pro Gly Thr His Xaa Thr He Tyr lie Lys Asn 

20 25 30 

He He Ser His Asp Gly Xaa Thr Met He Met Asn Asp Thr He Lys 

35 40 45 

Xaa Phe Ser Glu Xaa Pro Ala Leu Gly Leu Val Leu Ala Xaa Met He 

50 55 60 

Gly He Gly Val Ala Glu Xaa Asn Arg He Leu Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 124 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

Met Ser Lys Ser Glu Gin Ala Asp lie Lys Asp Val Gly Gly Phe Val 

15 10 15 

Gly Gly Tyr Leu Lys Glu Gly Lys Arg Arg Ala Gly Gin Val Met Asn 

20 25 30 

Arg Ser Met Leu Thr Leu Asp lie Asp Tyr Ala Ala Gin Asp Met Thr 

35 40 45 

Asp lie Leu Ser Met Phe Tyr Asp Phe Ala Tyr Cys Leu Tyr Ser Thr 

50 55 60 

His Lys His Arg Glu lie Ser Pro Arg Leu Arg Leu Val lie Pro Leu 
65 70 75 80 

Lys Arg Asn Val Asn Ala Asp Glu Tyr Glu Ser Tyr Trp Ala Leu Ser 

85 90 95 

Pro Gin lie Ser Leu Gly Met Asp Tyr Phe Asp Asp Thr Thr Tyr Gin 

100 105 110 

Pro His Arg Leu Asn Val lie Gly Leu Pro Pro Ser 



(2) INFORMATION FOR SEQ ID NO: 327: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

Met Met Lys Leu He Lys Asn Gly Lys Val Leu Gin Asn Gly Glu Leu 

15 10 15 

Gin Gin Ala Asp He Leu He Asp Gly Lys Val He Lys Gin He Ala 

20 25 30 

Pro Ala He Asp Pro Ser Asn Gly Val Gly His His Arg Cys Glu Arg 



115 



120 



35 



40 



45 
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Ser Leu Gly Gly His Leu Asp Leu Val Asp Val Gin Val Pro Phe Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

Met Thr lie Tyr Glu Glu Tyr Gly Tyr Phe Glu Gly Leu Asp Val Leu 

1 5 10 15 

lie Cys Gly Asp lie Lys Asn Ser Arg Val Ala Arg Ser Asn Tyr His 

20 25 30 

Ser Leu Lys Ala Leu Gly Ala Asn Val Met Phe Asn Ser Pro Asn Ala 

35 40 45 

Trp lie Asp Asp Ser Leu Glu Ala Pro Tyr Val Asn lie Asp Asp Val 

50 55 60 

lie Glu Thr Val Asp lie Val Met Leu Leu Arg lie Gin His Glu Arg 
65 70 75 80 

His Gly Leu Ala Glu Glu Thr Arg Phe Ala Ala Asp Asp Tyr His Gin 

85 90 95 

Lys His Gly Leu Asn Glu Val Arg Tyr Asn Lys Leu Gin Glu His Ala 

100 105 110 

lie Val Met His Pro Ala Pro Val Asn Arg Gly Val Glu lie Gin Ser 

115 120 125 

Asp Leu Val Glu Ala Ser Lys Ser Arg He Phe Lys Gin Met Glu Asn 

130 135 140 

Gly Val Tyr Leu Arg Met Ala Val lie Asp Glu Leu Leu Lys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO:329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

Met Asn His Leu Leu Ser Met Glu His Leu Ser Thr Asp Gin He Tyr 

1 5 10 15 

Lys Leu He Gin Lys Ala Ser Gin Phe Lys Ser Gly Glu Arg Gin Leu 

20 25 30 

Pro Asn Phe Glu Gly Lys Tyr Val Ala Asn Leu Phe Phe Glu Asn Ser 

35 40 45 

Thr Arg Thr Lys Cys Ser Phe Glu Met Ala Arg Ala Ala Leu Glu Leu 

50 55 60 

Gly Leu Lys Thr He Ser Phe Glu Thr Ser Thr Ser Ser Val Ser Lys 
65 70 75 80 

Gly 



(2) INFORMATION FOR SEQ ID NO:330: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met Met Met Ser Thr He Xaa Val Xaa Gly Arg Arg Asp Xaa He He 

15 10 15 

He Arg Ser Asp Tyr Thr Xaa Gly Pro Val Glu Asp Glu Asp Xaa Leu 

20 25 30 

Xaa Xaa His Pro Xaa Val Lys Glu Cys Ala Val Val Xaa Xaa Pro His 

35 40 45 

Asp He Arg Gly Asn He Val Lys Ala Phe Val He Leu Gin Asp His 

50 55 60 

Thr Ala Gly Asp Asp Thr Leu Val Lys Glu Leu Gin Gin Phe Val Lys 
65 70 75 80 

Asn Glu Val Ala Pro Tyr Lys Tyr Pro Arg Glu He Glu Phe Pro Phe 

85 90 95 

Val He Ser Gly He Phe 
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(2) INFORMATION FOR SEQ ID NO:331: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

Met Thr Met lie Gin He He Gly Val He Leu Val Thr He Val Leu 

1 5 10 15 

Leu Asn His Trp Asn Phe Trp He Leu Ala He Gly Phe He He Leu 

20 25 30 

He Ala Pro Xaa Thr Gly Val Ala Thr Leu Gly Phe Xaa He Ala Met 

35 40 45 

Asp Glu Ser Ser Ser Gly Arg Gly Ser Ser Ser Ser Leu Leu Gly Leu 

50 55 60 

Val Gin Phe Leu Phe Gly Gly Val Ala Ser Pro Leu Val Gly Val Lys 
65 70 75 80 

Gly Glu Asp Asn Pro He Pro Tyr He He He He He Ala Thr Ala 

85 90 95 

Xaa He Leu He He Xaa Gin He Tyr Asn Met Lys Val Xaa 
100 105 110 

{2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:332: 



Met Asn Asp Tyr Ser Thr Thr Val Ala Asn Lys Gin Leu 
15 10 
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Tyr lie lie Gly Gly Gin Xaa Gly Val Phe lie Asn Trp Leu Ser Xaa 

20 25 30 

Gly Cys Xaa Glu Ser Pro Glu Glu Val Ala Asp lie Leu Leu Ala Asn 

35 40 45 

Thr Xaa Lys Leu Gin Xaa Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ill amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

Met Lys Lys Met lie Leu lie Asn Val lie Thr Val Val Val Leu Leu 

15 10 15 

Ala He Gly He Ala Gly Phe Tyr Phe Trp Asn Lys Thr Thr Ser Tyr 

20 25 30 

Val Thr Thr Asp Asn Ala Lys Val Asn Gly Asp Gin He Lys He Ala 

35 40 45 

Ser Pro Ala Ser Gly Gin He Lys Ser Leu Asn Val Lys Gin Gly Asp 

50 55 60 

Lys Leu Asp Lys Gly Asp Lys Val Ala Thr Val Thr Val Gin Gly Gin 
65 70 75 80 

Asp Gly Glu Thr Lys Asp Met Asp Leu Lys Met Pro Gin Lys Gly Thr 

85 90 95 

He Gly Lys Leu Asp Gly Met Glu Gly Ser Met Arg Ala Ser Trp 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 334 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



protein 
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(xi) SEQOENCE DESCRIPTION: SEQ ID NO: 334: 

Met Xaa Gin Ala Asn lie Asp Glu Lys Asp lie Lys Asp Val Glu Val 

15 10 15 

Gly Lys Asp Val Asp Val Thr lie Asp Gly Gin Lys Ala Ser lie Lys 

20 25 30 

Gly Lys Val Asp Ser lie Gly Lys Ala Thr Ala Ala Ser Phe Ser Leu 

35 40 45 

Met Pro Ser Ser Asn Ser Asp Gly Asn Tyr Thr Lys Val Ser Gin Val 

50 55 60 

lie Pro Val Lys lie Thr Leu Glu Ser Glu Pro Ser Lys Gin Val Val 
65 70 75 80 

Pro Gly Met Asn Ala Glu Val Lys lie His Lys Asn Xaa Gly Gly Ser 

85 90 95 

Leu Asn Asp Tyr Asp Leu His Tyr 
100 



(2) INFORMATION FOR SEQ ID NO: 335: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Met lie Pro Gly He Asp Leu Ala Gly Thr lie Val Glu Ser Glu Ala 

1 5 10 15 

Pro Gly Phe Glu Lys Gly Glu Gin Val He Val Thr Ser Tyr Asp Leu 

20 25 30 

Gly Val Ser His Tyr Gly Gly Phe Ser Glu Tyr Ala Arg Val Lys Ser 

35 40 45 

Glu Trp He He Lys Leu Pro Asp Thr Leu Thr Leu Glu Glu Ser Met 

50 55 60 

He Tyr Gly Thr Ala Gly Tyr Thr Ala Gly Leu Ala He Glu Arg Leu 
65. 70 75 80 

Glu Lys Val Gly Met Asn He Glu Asp Gly Pro Val Leu Val Arg Gly 

85 90 95 

Ala Ser Gly Gly Val Gly Thr Leu Ala Val Leu Met Leu Asn Glu Leu 
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100 105 110 

Gly Tyr Lys Val He Ala Ser Thr Gly Lys Pro Arg Cys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Met He He Lys Asn Pro Leu Ala Ser Ser Thr Cys Gin Ala Xaa Glu 

1 5 10 15 

Asp Pro Val Gly Gly Glu Gly He Asn Tyr Val Thr Lys Arg Leu Asn 

20 25 30 

His Ser Gly Ser He Ala Val He Gly Met Thr Ala Gly Asn Thr Tyr 

35 40 45 

Thr Asn Ser Val Phe Pro His He Leu Arg Gly Val Asn He Leu Gly 

50 55 60 

He Asp Ser Val Phe Thr Ala Met Lys Leu Arg His Glu Arg Leu Ala 
65 70 75 80 

Ser Ser Arg Lys Arg Phe Lys Ala 
85 

(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 



Met Val Lys Val Thr Tyr Asp He Pro Thr Cys Glu Asp Tyr Cys Ala 
15 10 15 
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Leu Arg lie Asn Ala Gly Met Ser Pro 

20 25 
Lys Gly Leu Pro Asn Ala Leu Phe Thr 

35 40 
Arg Leu He Gly Met Gly Arg Val He 

50 55 
Gin He Val Asp He Ala Val Ser Lys 
65 70 
Gly Arg Leu He Met Glu His He Met 
85 



PCT/US97/02318 

Lys Thr Arg Glu Ala Ala Glu 
30 

Val Thr Leu Tyr Asp Lys Asp 
45 

Gly Asp Gly Gly Thr Val Phe 
60 

Ser Tyr Gin Gly Gin Asp Tyr 

75 80 
Lys Xaa Tyr 
90 



(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:338: 

Met Thr Leu Asn Lys Arg Asn Thr Pro Gly Ser Leu Xaa Pro Lys Phe 

15 10 15 

Leu Asn Val Ser Ser Leu He Gin Tyr He Gin Ala Ala Tyr His Glu 

20 25 30 

Pro Leu Arg Glu Glu Phe Lys Asn Leu Thr Pro Tyr Val Thr Lys Leu 

35 40 45 

Ser Lys Val His Gly Pro Asn His Pro Tyr Leu Val Glu Leu Lys Glu 

50 55 60 

Thr Tyr Asp Thr Phe Lys Ser Gly Met Leu Glu His He Gin Lys Glu 
65 70 75 80 

Asp Asp Val Asp Phe Gin Asn 
85 



(2) INFORMATION FOR SEQ ID NO: 339: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

Met His Lys Leu lie He Lys Tyr Asn Lys Gin Leu Lys Met Leu Asn 

15 10 15 

Leu Arg Asp Gly Lys Thr Tyr Thr He Ser Glu Asp Glu Arg Ala Asp 

20 25 30 

He Thr Leu Lys Ser Leu Gly Glu Val He His Leu Glu Gin Asn Asn 

35 40 45 

Gin Gly Thr Trp Gin Ala Asn His Thr Ser He Asn Lys Val Leu Val 

50 55 60 

Arg Lys Gly Asp Leu Asp Asp He Thr Leu Gin Leu Xaa Thr Glu Ala 
65 70 75 80 

Asp Tyr Ala Ser Phe Gly Tyr Pro Ser He Pro Arg He Arg 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:340: 

Met Arg Glu He Pro Lys Ser Ser He Xaa Pro Glu His Phe His Leu 

1 5 10 15 

Met Tyr Leu Leu Glu Gin His Ser Pro Tyr Phe He Asp Ala Glu Leu 

20 25 30 

Thr Glu Leu Arg Asp Ser Phe Gin He His Tyr Asp Leu Asn Asp Asn 

35 40 45 

His Thr Pro Cys Asp Asn Leu Lys Ser Phe Thr Lys Asn Glu Lys Leu 

50 55 60 

Arg Tyr Leu Leu Asn He Lys Asn Leu Glu Glu Val Asn Arg Thr Arg 
65 70 75 80 

Tyr Thr Phe Val Trp Ala Pro Asp Glu Leu Phe Phe Thr Arg Asp Gly 

85 90 95 

Lieu Pro He Ala Lys Thr Arg Gly lieu Gin Asn Val Val Asp Pro Xaa 



85 



90 



(2) INFORMATION FOR SEQ ID 



NO:340: 
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100 105 110 

Pro Val Ser Glu Ala Glu Phe Leu Thr Arg Tyr Lys Ala Leu Val lie 

115 120 125 

Cys Ala Phe Asn Glu Lys Gin Ser Phe Asp Ala Leu Val Glu Gly Asn 

130 135 140 

Leu Glu Leu His Lys Gly Thr Pro Phe Glu Thr Lys Val lie Glu Ala 
145 150 155 160 

Ala Thr Leu Asp Leu Leu Thr Ala Phe Leu Asp Glu Gin Tyr Gin Lys 

165 170 175 

Gin Glu Gin Asp Tyr Ser Gin Asn Tyr Ala Tyr Val Arg Lys Val Gly 

180 185 190 

His Thr Val Phe Lys Trp Val Ala lie Gly Met Thr Thr Leu Ser Val 

195 200 205 

Leu Leu lie Ala Phe Leu Ala Phe Xaa Tyr Xaa Ser Val Met Lys His 

210 215 220 

Asn Glu Arg lie Glu Lys Gly Xaa His Ala Xaa Val Arg Asp Asp Tyr 
225 230 235 240 

Thr Gin Val Leu Asn Thr Cys Asp Asp Leu Asp Gly Lys Lys Leu Ala 

245 250 255 

Arg Gly Xaa Leu Xaa He Tyr Ala Arg Gly Tyr Phe His Pro Thr Xaa 

260 265 270 

Leu Gly Xaa Xaa Xaa Arg 
275 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

Met Ala Thr Asp Thr Leu Xaa Lys Ala Arg Val Leu Asp He Thr Gly 

1 5 10 15 

Pro Leu Gly Asn Xaa He His Phe Ala Asn Ser Ala Val Asp Arg He 

20 25 30 

Val Pro Leu Gin Xaa Xaa Glu Asn He Leu Asp Val Met Val Glu Pro 

35 40 45 

Phe Tyr Glu Trp Val Val Glu Lys Asp Ala Trp Tyr Gly Pro Glu Leu 
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50 55 60 

Asn His lie Lys Tyr Val Asp Asp Leu Thr Pro Tyr lie Glu Arg Lys 
65 70 75 80 

Leu Leu Thr Val Asn Thr Gly His Ala Tyr Leu Ala Tyr Ala Gly Lys 

85 90 95 

Phe Ala Gly Lys Ala Thr Val Leu Asp Ala Val Lys Asp Ser Ser lie 

100 105 no 

Glu Ala Gly Leu Arg Arg Val Leu Ala Glu Thr Ser Gin Tyr lie Thr 

115 120 125 

Asn Glu Phe Asp Phe Thr 
130 



(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 



Met Gly Pro Gin Asp lie Ala Val Ala Lys Gly Ala Asp Ser Pro Leu 

15 10 15 

lie Lys Pro Xaa Ala Phe Ala Ser Glu lie His Gly Glu Ser Gly Leu 

20 25 30 

Asp Gly Pro Lys Leu Pro Ser Thr Pro Ser Arg Gin Ala Val Ala Met 

35 40 45 

Pro Ala Ser Asp Val lie lie Asn Lys Val Met Thr Ser Asp Thr Pro 

50 55 60 

Val Thr lie Val Ala Thr Gly Pro Leu Thr Asn Val Ala Thr Ala Leu 
65 70 75 80 

lie Arg Glu Pro Arg lie Ala Glu His lie Glu Ser lie Thr Leu Met 

85 90 95 

Gly Gly Gly Thr Phe Gly Asn Trp Thr Pro Thr Ala Gly lie Ser He 
100 105 110 

Phe Gly 



(2) INFORMATION FOR SEQ ID NO: 343 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino, acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/02318 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

Met Leu Lys Ala Ala Lys Arg Val Leu Glu Ser Gly lie Thr lie Asn 

15 10 15 

Val Phe Gly Leu Asp Val Thr His Gin Val Leu Ala Asp Asn His Val 

20 25 30 

lie Glu Arg Phe Glu Ser lie Asn Asn Pro Val Ala Gin Phe Val Val 

35 40 45 

Glu Leu Leu Gin Phe Phe Lys Lys Thr Tyr Lys Thr His Phe Asn Met 

50 55 60 

Asp Gly Gly Pro lie His Asp Ala Cys Thr Asn Leu Tyr Leu Leu Gin 
65 70 75 80 

Pro Glu Leu Phe Thr Met Val Pro Val Asn Xaa Asp lie 
85 90 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Met Ala Lys Thr Tyr lie Phe Gly His Lys Asn Pro Asp Thr Asp Ala 

1 5 10 15 

He Ser Ser Ala He He Met Ala Glu Phe Glu Gin Leu Arg Gly Asn 

20 25 30 

Ser Gly Ala Lys Ala Tyr Arg Leu Gly Asp Val Ser Ala Glu Thr Gin 

35 40 45 

Phe Ala Leu Asp Thr Phe Asn Val Pro Ala Pro Glu Leu Leu Thr Asp 
50 55 60 
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Asp Leu Asp Gly Gin Asp Val lie 
65 70 
Pro Ser Ser Asp Thr lie Val Ser 
85 

Pro His Arg lie Ala Asn Phe Glu 
100 

Ala Glu Pro Val Gly Cys Thr Ala 
115 120 
Glu Arg Gly Xaa Glu lie Asn Pro 

130 135 
Ala lie lie Ser Asp Ser Leu Leu 
145 150 
Gin Arg Arg Xaa Xaa Arg Trp Lys 
165 
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Leu Val Asp His Asn Glu Phe Xaa 

75 80 
Ala Thr He Lys His Val He Asp 

90 95 
Thr Ala Gly Pro Leu Cys Tyr Arg 
105 110 
Thr He Leu Tyr Lys Met Phe Arg 
125 

Glu He Gly Gly Leu Met Leu Ser 
140 

Xaa Asn Ser Pro Thr Cys Thr Gin 
155 160 

Asn 



(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 



Met Thr His Arg Ala Leu Leu Val Val Asp Tyr Ser Tyr Asp Phe He 

15 10 15 

Ala Asp Asp Gly Leu Leu Thr Cys Gly Lys Pro Gly Gin Asn He Glu 

20 25 30 

Asp Phe He Val Ser Arg He Asn Asp Phe Asn Tyr Tyr Gin Asp His 

35 40 45 

He Phe Phe Leu Met Asp Leu His Tyr Leu His Asp He His His Pro 

50 55 60 

Glu Ser Lys Leu Phe Pro Pro His Asn He Val Asp Thr Ser Gly Arg 
65 70 75 80 

Glu Leu Tyr Gly Lys Val Gly Lys Leu Xaa Asp Thr He Lys Ala Gin 

85 90 95 

Pro Asn Val His Phe He Asp Lys Thr Arg Tyr Asp Ser Phe Phe Gly 

100 105 110 

Thr Pro Leu Asp Ser Leu Phe Glu Arg Lys Lys Tyr 
115 120 
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(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 



Met Leu Leu Xaa Arg Leu Leu Ala Tyr Pro Pro Thr Asn Leu Pro Lys 

15 10 15 

lie Gly Gin Glu lie Gly Asn Val Phe Ala Gly He Gly Xaa Leu Met 

20 25 30 

He Ala Phe Ala Gin Asn Ser Ser Asn He Phe Asp Trp Leu Val Lys 

35 40 45 

Leu Thr Ser Gin Phe Arg Ala Trp Ser Glu Gin Val Gly Gin Ser Gin 

50 55 60 

Gly Phe Lys Asp Phe He Ser Tyr Val Gin Glu Asn Gly Pro Thr He 
65 70 75 80 

Met Gin Leu He Gly Asn lie He Lys Ala Leu Val Ala Phe Gly Thr 

85 90 95 

Ala Met Ala Pro lie Ala Ser Lys Leu Leu Asp Phe He Thr Asn Leu 

100 105 110 

Ala Gly Phe He Ala Lys Leu Phe Glu Thr His Pro Ala He Ala Gin 

115 120 125 

Val Ala Gly Val Met Gly lie Leu Gly Gly Val Phe Trp Ala Leu Met 

130 135 140 

Ala Pro He Val Ala He Ser Ser Val Leu Thr Asn Val Phe Gly Leu 
145 150 155 160 

Ser Leu Phe Ser Val Val Glu Lys lie Leu Glu Phe Val Arg Thr Ser 

165 170 175 

Ser Leu Val Thr Gly Ala Leu Glu Ala Leu Thr Gly Val Phe Gly Thr 

180 185 190 

He Ser Ala Pro lie Leu Ala Val He Ala Val He Gly Ala Phe He 

195 200 205 

Gly Val Leu Val Tyr Leu Trp Lys Thr Asn Glu Asn Phe Arg Asn Thr 

210 215 220 

He Thr Glu Ala Trp Asn Gly Val Lys Thr Ala Val Ser Gly Ala He 
225 230 235 240 
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Gin Gly Val Val Gly Trp Leu Thr Glu 
245 

Thr Leu Gin Pro lie Met Pro lie Leu 
260 265 
Met Gin Val Leu Gly Val Val Val lie 

275 280 
Asn lie He Gin Gly Leu Trp Thr Leu 

290 295 
He Gly Gin Val He Ser Arg Gly Lys 
305 310 
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Leu Trp Gly Lys He Gin Ser 
250 255 
Gin Val Leu Gly Gin He Phe 
270 

Gly He He Thr Asn Val Met 
285 

He Thr He Ala Phe Gin Ala 
300 

Ser Lys Ser 
315 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 



Met Glu Glu Ser Glu Arg Phe Asn 

1 5 
He Tyr He Cys Gly Asp Glu Lys 
20 

Ala He Lys Asp Val Leu Val Lys 

35 40 
Ala Glu Leu Leu Leu Arg Gin Met 

50 55 
Asp Val Tyr 
65 



Glu Trp He Glu Gin Gly Ala Ala 

10 15 
Cys Met Ala Lys Asp Val His Gin 
25 30 
Glu Arg His He Ser Gin Glu Glu 
45 

Lys Gin Gin Gin Arg Tyr Gin Arg 
60 



(2) INFORMATION FOR SEQ ID NO: 348: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

450 



WO 97/30070 



PCIYUS97/02318 



(xi) SEQUENCE DESCRIPTION : SEQ ID 



NO: 348: 



Met Glu Ser Lys Asp Gin Leu lie Gin Met 
1 5 10 



Leu Glu Ser Leu Lys Asp 



15 



Gin Gin Gin Phe Arg Asp Ala Leu Leu Ala Gin Lys Ala His Ala His 

20 25 30 

Gly Thr Ser Val Asn Asn Ser Ser Lys Phe Leu Asn Arg Arg Ser Lys 

35 40 45 

Ala Lys Lys Lys Lys Gin Ser Lys lie Lys His His Leu lie 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

Met Met Ala Ser Ser lie lie Leu Thr Asn lie Leu Pro Tyr Asp Ala 

15 10 15 

Gin Ala Ala Ser Glu Lys Asp Thr Glu lie Ser Lys Glu lie Leu Ser 

20 25 30 

Lys Gin Asp Leu Leu Asp Lys Val Asp Lys Ala lie Arg Gin lie Glu 

35 40 45 

Gin Leu Lys Gin Leu Ser Ala Ser Ser Lys Ala His Tyr Lys Ala Gin 

50 55 60 

Leu Asn Glu Ala Lys Thr Ala Ser Gin lie Asp Glu He He Lys Arg 
65 70 75 80 

Ala Asn Glu Leu Asp Ser Lys Glu Asn Lys Ser Ser His Thr Glu Met 

B5 90 95 

Asn Gly Gin Ser Asp He Asp Ser Lys Leu Asp Gin Leu Leu Lys Asp 

100 105 110 

Leu Asn Glu Val Ser Ser Asn Val Asp Arg Gly Gin Gin Ser Gly Glu 

115 120 125 

Asp Asp Leu Asn Ala Met Lys Asn Asp Met Ser Gin Thr Ala Thr Thr 

130 135 140 

Lys Tyr Gly Glu Xaa Asp Asp Lys Asn Asp Glu Ala Met Val Asn Lys 



50 



55 



60 
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145 150 155 160 

Ala Leu Glu Glu Leu Glu His Leu Asn Pro Ala Asn Thr Pro lie Arg 
165 170 175 

Lys Met His 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 



Ser Thr Ala Gin Leu Thr Lys Glu Asn Asn Val Lys Ser Leu 

5 10 15 

Ser Asn Thr Asp Arg Glu lie Phe Glu Asn Tyr Met Thr Tyr 

20 25 30 

Ser Asp Phe Arg Val Asn Pro His Asp Thr Glu Leu He He 
35 40 45 

He Leu Lys Gin Leu Leu Ser Ala Glu Gin His Gly Leu Leu 
55 60 
Asn His Asp Pro Lys Ala His Ala He Lys Glu 
70 75 80 

Ala Met Pro Asn Glu Thr Phe Lys Asn He Phe Lys Tyr He 

85 90 95 

His He Val Leu Leu He Gly He Val Ser Phe Leu Lys Gly 

100 105 HO 

Gly Phe Phe His Gly Lys Lys Trp Lys 
115 120 



Met Lys 
1 

Arg Leu 

Met Arg 

Asn Arg 
50 

Ala Leu Asp Phe Phe 

65 

Leu Lys 
Tyr Gin 
Phe Leu 



(2) INFORMATION FOR SEQ ID NO:351: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: 



protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

Met Phe Xaa Lys Phe Asp Arg lie Glu Asp Ser Leu Arg Thr Gin Glu 

1 5 10 15 

Lys lie Tyr Asp Lys Leu Asp Arg Asn Phe Glu Glu Leu Arg His Glu 

20 25 30 

Arg Gin Gly Arg Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

Met Arg lie Asp Lys Phe Leu Ala Asn Met Gly Val Gly Thr Arg Asn 

1 5 10 15 

Glu Val Lys Gin Leu Leu Lys Lys Gly Leu Val Asn Val Asn Glu Gin 

20 25 30 

Val lie Lys Ser Pro Lys Thr His He Glu Pro Glu Asn Asp Lys He 

35 40 45 

Thr Val Arg Gly Glu Leu He Glu Tyr He Glu Asn Val Tyr He Met 

50 55 60 

Leu Asn Lys Pro Lys Gly Tyr He Ser Ala Thr Glu Asp His His Ser 
65 70 75 80 

Lys Thr Val He Asp Leu He Pro Glu Tyr Gin His Leu Asn He Phe 

85 90 95 

Pro Val Gly Arg Leu Asp Lys Asp Thr Glu Gly Leu Leu Leu He Thr 

100 105 110 

Asn Asp Gly Asp Phe Asn His Glu Leu Met Ser Pro Asn Lys His Val 

115 120 125 

Ser Lys Lys Tyr Glu Val He Ser Gly Asn Pro He Thr Glu Asp Asp 

130 . 135 140 

Asn Ser Ser Phe 
145 
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(2) INFORMATION FOR SEQ ID NO:353: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 



Met Glu Ala Val Xaa Ser Gly Leu Gly Met He Gly Leu Asn Val Asn 

15 10 15 

Tyr Gly Asn Pro Thr Phe He Arg Asp Gly Glu Asn Gly Tyr Leu Val 

20 25 30 

Pro Phe Asp Thr Xaa Glu Asp Arg Val Asp Asp Val He Ala Lys Leu 

35 40 45 

Ala His Ala He Val Met Tyr Phe Asn Asn Gly Pro Gin Ala Pro His 

50 55 60 

Asp He Ser Tyr Glu Val Ala Gin Gin Phe Met Thr Gin Asp He He 
65 70 75 80 

Leu Lys Trp Glu Thr Leu Xaa Gin Glu Val Leu His Asp 
85 90 



(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 



Met Asp Tyr Ala He. Gin Leu Pro Asn Met Val Gin Gly Xaa Thr Gly 

1 5 10 .15 

Val Asn Pro Pro Val Gly Ala Val Val Val Asn Glu Gly Arg He Val 

20 25 30 

Gly He Gly Ala His Leu Arg Lys Gly Asp Lys His Ala Glu Val Gin 
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35 40 45 

Ala Leu Asp Met Ala Gin Xaa Asn Ala Glu Gly Ala Thr lie Tyr lie 

50 55 60 

Thr Leu Glu Pro Cys Ser His Phe Gly Ser Thr Pro Pro Cys Val Asn 
65 70 75 80 

Lys lie lie Asp Cys Lys lie Ala Xaa Val Val Leu Xaa Asn Xaa Arg 
85 90 95 

Gin Phe Arg 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

Met Gly Arg Pro Thr Phe Leu His Asn lie He Phe Asp Val Ala Arg 

15 10 15 

His His Arg Pro Asp Gin Ala His Met Tyr Leu Phe Asp Phe Gly Thr 

20 25 30 

Asn Gly Leu Met Pro Val Thr Asp He Pro His Val Ala Asp Tyr Phe 

35 40 45 

Thr Glu Asp Pro Arg Arg Pro Arg Leu Leu Arg Arg Tyr Val Tyr Leu 

50 55 60 

Met Met Lys Ser Gly Ser Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

Met Met He Lys Met Thr Arg Glu Gly Leu Ala Leu Asp Met Gin Val 

15 10 15 

Thr Leu Thr Ala Ser Arg Ala Asn Ala Met Lys Thr Pro Met Tyr He 

20 25 30 

Asn Met Lys Thr Arg He Ala Met Phe Leu Tyr Asp Lys Ser Glu Val 

35 40 45 

Ser Asn Val Val Gly Gin Xaa Xaa Xaa Ala Val Lys Asp Val Val Gly 

50 55 60 

Arg Ala Leu Leu Ser Ser Asp Asp Asn Val Ser Phe His He Gly Gin 
65 70 75 80 

Pro Phe Lys His Asp Glu Thr Lys Ser Tyr Asn Asp His Asn 



(2) INFORMATION FOR SEQ. ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Met He Ser Ser Glu Asn Pro Arg Glu He Ala His He Ala Glu He 

15 10 15 

Met Met Lys Glu He Asp He Leu Asn Glu Lys Tyr Ala He Cys lie 

20 25 30 

Ala Asp Ser Ser Gly Glu Phe Lys Ala Tyr Arg His Gin Val Ala Asn 

35 40 45 

Phe Ala Glu Glu Arg Glu Asp He Lys Ala He His Gin Leu Met He 

50 55 60 

Glu Asp Leu Lys Gin Arg Glu Met Asp Gly Pro Phe Glu Lys Asp Ser 
65 70 75 80 

Leu Tyr He He Asn Asp Phe Lys Thr Tyr He Asp Cys Thr Tyr He 

85 90 95 

Pro Glu Asp Asp Val Lys Lys Leu He Thr Lys Gly Pro Glu Leu Gly 

100 105 110 

Leu Asn He Leu Phe Val Gly He His Lys Glu Leu He Asp Ala Tyr 



85 



90 



115 



120 



125 
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Asp Lys Gin lie Asp Val Ala Arg Lys Met lie Asn Gin Phe Ser He 

130 135 . 140 

Gly He Pro Tyr Phe Arg Pro Thr He Leu 
145 150 



(2) INFORMATION FOR SEQ ID NO: 3 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE : amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



Met Lys Cys Val Gly Ala Thr Thr Val Ala Thr Thr Met He Cys Ala 

1 5 10 15 

Pro Met Ala Gly He Gin Phe Phe Val Thr Gly Gly He Gly Gly Val 

20 25 30 

His Lys Gly Ala Glu His Thr Met Asp He Ser Ala Asp Leu Glu Glu 

35 40 45 

Leu Ser Lys Thr Asn Val Thr Val He Cys Ala Gly Ala Lys Ser He 

50 55 60 

Leu Asp Leu Pro Lys Thr Met Glu Tyr Leu Glu Thr Lys Gly Val Pro 
65 70 75 80 

Val He Gly Tyr Gin Thr Asn Glu Leu Pro Ala Phe Phe Thr Arg Glu 

85 90 95 

Ser Gly Val Lys Leu Thr Ser Ser Val Glu Thr Pro Glu Arg Leu Ala 

100 105 HO 

Asp He His Leu Thr Lys Gin Gin Leu Asn Leu Glu Gly Gly He Val 

115 120 125 

Val Ala Asn Pro He Pro Tyr Glu His Ala Leu Ser Lys Ala Tyr He 

130 135 140 

Glu Ala He He Asn Glu Ala Val Val Glu Ala Glu Asn Gin Gly He 
145 150 155 160 

Lys Gly Lys Asp Ala Thr Pro Phe Leu Leu Gly Lys lie Val Glu Lys 

165 170 175 

Thr Asn Gly Lys Ser Leu Ala Ala Asn He Lys Leu Val Glu Asn Asn 

180 185 190 

Ala Ala Leu Gly Ala Lys He Ala Val Ala Val Asn Lys Leu Leu 
195 200 205 
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(2) INFORMATION FOR SEQ ID NO:359: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 59: 

Met Lys Phe Gin Leu Glu Glu Leu Ser Glu Ala His Leu Lys Glu Gly 

1 5 10 15 

Glu lie Glu Gin Leu Glu lie Asp lie Lys Arg lie His Asn Ser Glu 

20 25 30 

Lys Leu Ser Leu Ala Leu Asn Asn Ala His Met Thr Leu Thr Asp Glu 

35 40 45 

Asn Ala lie Thr Asp Arg Leu Tyr Glu Leu Ser Asn His Leu Leu Thr 

50 55 60 

lie Asn Asp lie Val Pro Asn Lys Tyr Asp Lys Leu Lys Glu Asp lie 
65 70 75 80 

Asp Gin Phe Tyr Tyr He Leu Glu Asp Ala Lys His Glu Xaa Tyr Asp 

85 90 95 

Glu Met Ala Xaa Thr Glu Phe Asp Glu Gin Val Leu Asn Glu Tyr Glu 

100 105 110 

Ser Arg Met Asn Leu Leu Asn Asn Leu Lys Arg Lys Tyr Gly Lys Asp 

115 120 125 

lie Ser Glu Leu lie Ala Tyr Gin Glu Lys Leu Asn Asn Glu lie Asn 

130 135 140 

Lys lie Glu Asn Tyr Glu Gin Ser Thr Ser Gin Leu Arg Glu Glu lie 
145 150 155 160 

Asn Ala Leu Tyr Asn Gin Val lie Glu Val Gly Gin Ala Leu Ser Lys 

165 170 175 

Gin Arg Arg lie Val Ala Arg Glu Leu Arg Asp His lie Val Ser Glu 

180 185 190 

lie Gin Asn Leu Gin Met Lys Asp Ala Asn Leu Glu lie Ser Phe Lys 

195 200 205 

Lys Leu Glu Glu Pro Asn lie Asp Gly lie Glu Phe Val Glu Phe Leu 

210 215 220 

Xaa Thr Pro Asn Lys Gly Gly Thr Xaa Lys Lys Phe Xaa 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

Met Pro Lys Glu Met Leu Pro He Leu Asp Lys Pro Thr He Gin Tyr 

15 10 15 

He Val Glu Glu Ala Ala Arg Ala Gly He Glu Asp He He He Val 

20 25 30 

Thr Gly Arg His Lys Arg Ala He Ala Asp His Phe Asp Ser Gin Lys 

35 40 45 

Glu Leu Glu Met Val Leu Lys Glu Lys Gly Lys Ser Glu Leu Leu Glu 

50 55 60 

Lys Val Gin Tyr Ser Thr Glu Leu Ala Asn He Phe Tyr Val Arg Gin 
65 70 75 80 

Lys Glu Xaa Lys Gly Leu Gly His Ala He Ser Ser Ala Leu Val Pro 
85 90 95 

He Tyr Arg Gin 
100 

(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Met Leu Thr He Pro Xaa Lys Glu Asn Arg Gly Ser Lys Glu Gin Glu 

15 10 15 

Val Ala He Met He Asp Ala Leu Ala Asp Thr Gly Xaa Lys Ala Leu 
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20 25 30 

Glu Ala Leu Ser Lys Lys Ser Gin Glu Glu lie Asp His lie Val His 

35 40 45 

Gin Met Ser Leu Ala Ala Val Asp Gin His Met Val Leu Ala Gin Leu 

50 55 60 

Ala His Glu Glu Thr Gly Arg Gly lie Tyr Glu Asp Lys Ala He Lys 
65 70 75 80 

Asn Leu Tyr Ala Ser Glu Tyr He Trp Asn Xaa He Lys Asp Asn Lys 

85 90 95 

Thr Val Gly lie He Gly Glu Asp Lys Glu Lys Gly Leu Thr Tyr Val 

100 105 HO 

Ala Glu Pro He Gly Val He Cys Gly Val Thr Pro Thr Thr Asn Pro 

115 120 125 

Thr Ser Thr Thr He Phe Lys Ala Met He Ala He Lys Thr Gly Asn 

130 135 140 

Pro He He Cys Ala Phe His Pro Ser Ala Gin Glu Ser Ser Lys Arg 
145 150 155 160 

Ala Ala Glu Val Val Leu Glu Ala Ala Met Lys Ala Gly Ala Pro Lys 

165 170 175 

Asp He He Gin Trp He Glu Val Pro Ser He Glu Ala Thr Lys Gin 

180 185 190 

Leu Met Asn His Lys Gly He Ala Leu Val Leu Ala Thr Gly Gly Ser 

195 200 205 

Gly Met Val Lys Ser Ala Tyr Ser Thr Gly Asn Arg His 
210 215 220 

j 

(2) INFORMATION FOR SEQ ID NO: 362: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 62: 



Met Tyr Asn Leu Asp Arg He Xaa Val Tyr Met Arg Leu Arg Arg Val 

15 10 15 

Gly Asn Lys Phe Ser He Lys Thr Trp Lys Phe Asp His He Lys Asp 

20 25 30 

Pro Asp Arg Arg Lys Pro He Asp Met Asp Glu Lys Glu Trp He Asp 
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35 



40 



45 



Gly Gly 



Lys 



Phe Tyr Gin Arg 



Pro Ala Ser lie 



lie Ala lie Tyr Ser 
60 



50 



55 



Ala Lys Tyr Xaa Gly Tyr Lys Trp Met Glu Met Asn Gly Leu Gly Ser 
65 70 75 80 

Phe Asn Thr Glu lie Leu Pro Xaa Pro Lys Gly Ala Arg Asp Val lie 

85 90 95 

lie Gin Lys Gly Asp Leu Val Lys lie Asp Xaa Gin Ala Xaa Ser Val 
100 105 110 

Val He Lys 
115 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 : 

Met Asp Val Asp Thr Thr Ser Lys Lys Gly Ala His Glu Lys Leu Leu 

15 10 15 

Thr Glu Phe Glu Lys Gly Asn Gly Asp He Leu Leu Gly Thr Gin Met 

20 25 30 

He Ala Lys Gly Leu Asp Tyr Pro Asn He Thr Leu Val Gly Val Leu 

35 40 45 

Asn Ala Asp Thr Met Leu Asn Leu Pro Asp Phe Arg Ala Ser Glu Arg 

50 55 60 

Thr Tyr Gin Leu Leu Thr Gin Val Ala Gly Arg Ala Gly Arg His Glu 
65 70 75 80 

Lys Ala Gly Gin Val He He Gin Thr Tyr Asn Pro Asp His Tyr Ser 

85 90 95 

He Leu Asp Cys Ser Lys Lys 



100 



(2) INFORMATION FOR SEQ ID NO:364: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 65 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 64: 

Met Asn Leu Asp Ser Gin lie Leu Val Lys Tyr Lys Ser Glu Pro Gly 

15 10 15 

Leu Leu Gin Ala Xaa Gin Phe Leu Asp Asp Tyr Tyr His Glu Lys Phe 

20 25 30 

lie Lys Glu Lys Leu Ala Leu Lys lie Asp lie Gly Xaa Thr Asp Asp 

35 40 45 

Asp Val Thr Leu Leu lie lie Arg Asn Lys Xaa Lys Tyr Cys Thr Ser 
50 55 60 

lie 
65 



(2) INFORMATION FOR SEQ ID NO:365: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 



Met Lys Val Leu Asn Leu Gly Ser Lys Lys Gin Ala Ser Phe Tyr Val 

1 5 10 15 

Ala Cys Glu Leu Tyr Lys Glu Met Ala Phe Asn Gin His Cys Lys Leu 

20 25 30 

Gly Leu Ala Thr Gly Gly Thr Met Thr Asp Leu Tyr Glu Gin Leu Val 

35 40 45 

Lys Leu Leu Asn Lys Asn Gin Leu Asn Val Asp Asn Val Ser Thr Phe 

50 55 60 

Asn Leu Asp Glu Tyr Val Gly Leu Thr Ala Ser His Pro Gin Ser Tyr 
65 70 75 80 

His Tyr Tyr Met Asp Asp Met Leu Xaa Lys Gin Tyr Pro Tyr Phe Xaa 
85 90 95 
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Arg Lys Asn lie His lie Pro Asn Gly Asp Ala Asp Asp Met Asn Ala _ 

100 105 110 

Glu Ala Ser Lys Tyr Asn Asp Val Leu Glu Gin Gin Gly Gin Arg Asp 

115 120 125 

He Gin He Leu Gly He Gly Glu Asn Gly His He Gly Phe Asn Glu 

130 135 140 

Pro Gly Thr Pro Phe Asp Ser Val Thr His He Val Asp Leu Thr Glu 
145 150 155 160 

Ser Thr He Lys Ala Asn Ser Arg Tyr Phe Lys Asn Glu Asp Asp Val 

165 170 175 

Pro Lys Gin Ala He Ser Met Gly Leu Ala Asn He Leu Gin Ala Lys 

180 185 190 

Arg He He Leu Leu Ala Phe Gly 
195 200 

(2) INFORMATION FOR SEQ ID NO: 366 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:366: 

Met Met Leu Phe Tyr Gin Cys Tyr Leu Lys Gin Gin Leu Gin Xaa He 

15 10 15 

Ala Leu Pro Val Ser Ala Gly He Gly Gly He Lys Glu Leu Thr Ser 

20 25 30 

Leu Ala Xaa He Leu Asn Gly Val He He Tyr Ala Leu Gly Asn Lys 

35 40 45 

Phe Xaa Lys Leu Phe Arg He Thr Asn Pro He Ala Arg Gly Leu Ala 

50 55 60 

Leu Gly Thr Ser Gly His Thr Leu Gly Val Ala Pro Ala Lys Glu Leu 
65 70 75 80 

Gly Pro Val Glu Glu Ser Met Ala Ser He Ala Leu Val Leu Val Gly 

85 90 95 

Val Val Val Val Ala Val Val Pro Val Phe Val Ala He Phe Phe 
100 105 110 



(2) INFORMATION FOR SEQ ID NO:367: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 287 amino acids 
<B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

Met Asn Ala Gly Thr Gin Leu Tyr His lie Val Ala Ser Asn Ala Tyr 

15 10 15 

His Lys Asp Thr Glu Asp Phe Tyr lie Ser Leu Lys lie Val Asp Val 

20 25 30 

Lys Gin Pro Glu Gly Asp Gin Arg Val Tyr Arg Thr Ser Thr Tyr Asp 

35 40 45 

Leu Thr Thr Asp Glu He Ser Lys Val Lys Gin Ala Phe He Asn Ala 

50 55 60 

Asn Arg Asp Val He Thr Leu Ala Glu Gly Asp He Ser Val Thr Asn 
65 70 75 80 

Thr Pro Asn Gly Ala Asn Val Ser Thr He Thr Val Asn He Asn Lys 

85 90 95 

Gly Arg Leu Thr Lys Ser Phe Ala Ser Asn Leu Ala Asn Met Asn Phe 

100 105 110 

Leu Arg Trp Val Asn Phe Pro Gin Asp Tyr Thr Val Thr Trp Thr Asn 

115 120 125 

Ala Lys He Ala Asn Arg Pro Thr Asp Gly Gly Leu Ser Trp Ser Asp 

130 135 140 

Asp His Lys Ser Leu He Tyr Arg Tyr Asp Ala Thr Leu Gly Thr Gin 
145 150 155 160 

He Thr Thr Asn Asp He Leu Thr Met Leu Lys Ala Thr Thr Thr Val 

165 170 175 

Pro Gly Leu Arg Asn Asn He Thr Gly Asn Glu Lys Ala Gin Ala Glu 

180 185 190 

Ala Gly Gly Arg Pro Asn Tyr Arg Thr Thr Gly Tyr Ser Gin Ser Asn 

195 200 205 

Ala Thr Thr Asp Gly Gin Arg Gin Phe Thr Leu Asn Gly Gin Val He 

210 215 220 

Gin He Leu Asp He He Asn Pro Ser Asn Gly Tyr Gly Gly Gin Pro 
225 230 235 240 

Val Thr Asn Ser Asn Thr Arg Ala Asn His Ser Asn Ser Thr Val Val 
245 250 255 
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Asn Val Asn Glu Pro Ala Ala Asn Gly Ala Gly Ala Phe Thr He Asp 

260 265 270 

His Val Val Lys Ser Asn Ser Thr His Asn Ala Ser Asp Ala Val 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 368: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

Met Asn Phe Pro Trp Asp Gin Leu Leu Val Lys Gly Asn Trp Met He 

15 10 15 

Thr Met Ala Gin He Gly Ala Pro Phe Leu Val He Gly Leu He Ala 

20 25 30 

Val He Thr Tyr Phe Lys Leu Trp Lys Tyr Leu Tyr Lys Glu Trp Gly 

35 40 45 

Ser Pro Ser Val Arg Pro He Lys Lys Asn Arg 
50 55 

(2) INFORMATION FOR SEQ ID NO: 369: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

Met Thr Ala He Leu His Asn Lys Tyr His Lys Val Val Leu Pro Pro 

1 5 10 15 

Tyr Leu Gly Phe Phe Gly Gly Xaa Arg Phe Val Pro He Val Thr Ala 

20 25 30 

Phe Ala Ala He Phe Leu Gly Val Leu Met Phe Phe He Trp Pro Ser 
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35 40 45 

lie Gin Ala Gly lie Tyr His Val Gly Gly Phe Val Thr Lys Thr Gly 

50 55 60 

Ala lie Gly Thr Phe Val Tyr Gly Phe lie Leu Arg Leu Leu Gly Pro 
65 70 75 80 

Leu Gly Leu His His lie Phe Tyr Leu Pro Phe Trp Gin Thr Ala Leu 
85 90 95 

Gly Gly Thr Arg 
100 



(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

Met Leu Ala Xaa Met Cys Gly Phe Leu Phe Leu Ala Ser Asp Pro Ala 

1 5 10 15 

Lys Asp Gly Gly Phe Leu Ser Ala Phe Met Gly Thr Lys Gly Leu Leu 

20 25 30 

Thr Ala Phe Leu Ser Ala Phe Val Thr Val lie Val Tyr Asn Phe Cys 

35 40 45 

Val Lys Arg Asn lie Thr lie Lys Met Pro Lys Glu Val Pro Pro Asn 

50 55 60 

lie Ser Gin Val Phe Lys Asp Leu lie Pro Phe Ser Ala Val lie lie 
65 70 75 80 

lie Leu Tyr Ala Leu Asp Lieu Val lie Arg Asn Ser Phe Lys Ser Asn 

85 90 95 

Val Ala Glu Gly He Leu Lys Leu Phe Glu Pro Leu Phe Thr Ala Ala 

100 105 110 

Asp Gly Trp He Gly Val Thr He He Phe Gly Ala Phe Ala Leu Phe 

115 120 125 

Trp Phe Val Gly He His Gly Pro Ser He Val Glu Pro Ala lie Ala 

130 135 140 

Ala He Thr Tyr Ala Asn He Glu Ala Asn Phe Lys Leu Leu Gin Ala 
145 150 155 160 

Gly Glu His Ala Asp Lys He He Thr Ser Gly Thr Gin Met Phe He 
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165 170 175 

Val Thr Phe Gly Gly Thr Gly Ala Thr Leu Val Xaa Pro Phe Met Phe 

180 185 190 

Met Leu Asp Asp Glu He 
195 

(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

Met Asn Arg Asn Gly He Xaa Ser Arg Xaa Pro Asp Gin Pro Gin Ser 

15 10 15 

Ala Pro Lys Glu Gin Asn Ser Asp Ser Asn Asp Glu Glu Thr Val Thr 

20 25 30 

Xaa Lys Glu Arg Lys Ser Lys Val Thr Gin Leu Lys Pro Leu Thr Leu 

35 40 45 

Glu Xaa Lys Arg Lys Leu Arg Arg Lys Arg Gin Lys Arg He Gin Tyr 

50 55 60 

Ser Val He Thr He Leu Val Leu Leu He Ala Val He Leu He Tyr 
65 70 75 80 

Met Phe Ser Pro Leu Ser Lys He Ala Ala Cys Lys Tyr Lys Trp Lys 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 
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Met Tyr Thr Phe Ser Lys Xaa Asn Ala lie Xaa Asp Leu Glu Glu Asp 

1 5 10 15 

Pro Leu Leu Xaa Ser Val Glu lie His Lys Gin Leu Pro Xaa Thr Leu 

20 25 30 

Asn Val Asp lie Thr Glu Asn Glu lie lie Ala Leu Ser Glu Lys lie 

35 40 45 

Xaa Val Asn lie Tyr Leu Tyr 
50 55 

(2) INFORMATION FOR SEQ ID NO:373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

Met Asp Gly Phe Lys Gly Thr Gin Glu Asp Asp Met lie Lys Ala Leu 

1 5 10 15 

Ser Glu Met Thr Pro Glu Val Xaa Arg Tyr lie Ala Glu Val Thr Tyr 

20 25 30 

Ala Pro Ser Lys Thr Lys His Ser Arg lie Glu Leu Phe Xaa Thr Asp 

35 40 45 

Gly Leu Gin Val He Gly Asp He Ser Thr He Ser Lys Lys Met Lys 

50 55 60 

Tyr Tyr Pro Gin Met Ser Gin Ser Leu Ser Arg Asp Arg Ser Gly Lys 
65 70 75 80 

Leu Lys Thr Arg Gly Tyr He Asp Leu Ser Val Gly Ala Ser Phe He 

85 90 95 

Pro Xaa Arg Gly Asn Thr Ser Ser Gin Ser Glu Ser Asp Lys Asn Val 

100 105 110 

Thr Lys Ser Ser Gin Glu Glu Asn Gin Xaa Xaa Glu Glu Xaa Gin Ser 

115 120 125 

Val Leu Xaa Lys Xaa Xaa Lys Gin Ser Ser Lys Asn Asn 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 374 : 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

<C> STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

Met Ser Thr Leu Val Gly Thr Val Leu Ser Gly Phe Glu Tyr Arg Ala 

15 10 15 

His Lys Glu Glu Leu Asp Asn Leu Thr Glu Val Leu Lys Glu Tyr Lys 

20 25 30 

Ser Lys Tyr Lys Tyr Thr Gly Tyr Thr Glu Asn Ala lie Met Lys Thr 

35 40 45 

Gin Asn Ser Gly Phe Arg Asn Glu Tyr Tyr Tyr Leu Thr Ala lie Pro 

50 55 60 

Tyr Thr Leu Asp Glu Tyr Lys Arg Tyr Phe Gin Pro Leu lie Lys Glu 
65 70 75 80 

Asp Asp Lys Ser Phe Arg Asp Gly Met Arg Asn Ser Lys Lys Gin Leu 

85 90 95 

Lys Asp Lys Ser Arg Pro Tyr Val Val Thr Thr Leu Phe Ser Thr Lys 

100 105 110 

Asp Asn Phe Thr Lys Asp Asn Thr lie Asp Glu Met lie Asp Phe Ser 

115 120 125 

Glu Val Ser Cys Leu Lys Lys Lys Lys Asn lie Pro His Asp Val 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 96 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

Met Ser Thr Leu Val Gly Thr Val Leu Ser Gly Phe Glu Tyr Arg Ala 

1 5 io 15 

Gin Lys Glu Lys Tyr Asp Asn Leu Tyr Lys Phe Leu Lys Glu Asn Glu 
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20 25 30 

Lys Lys Xaa Xaa Tyr Thr Gly Phe Thr Lys Glu Ala lie Asn Lys Thr 

35 40 45 

Gin Asn Ser Gly Tyr Glu Asn Glu Tyr Phe Tyr lie Xaa Ala Asn Xaa 

50 55 60 

Pro Thr Leu Gin Glu Tyr Arg Lys Tyr Tyr Glu Pro Leu lie Lys Lys 

65 70 75 80 

Asn Xaa Leu Asn Phe Lys Lys Gly Met Lys Gin Ala Arg Glu Arg Ser 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 376: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

Met Asp Arg Val Thr Gin Phe Asn Val Leu Ala Xaa Ser Gly Arg Ala 

15 10 15 

Val Glu Val Cys Gly Asp Val Asp Val Met lie Leu Asp Lys Thr Gly 

20 25 30 

Thr lie Thr Tyr Gly Asn Arg lie Ala Ser Glu Phe Leu Pro Val Asn 

35 40 45 

Gin Gin Met Leu Glu Lys Leu lie Val Ala Ala Tyr Met Ser Ser lie 

50 55 60 

Tyr Asp Asp Thr Pro Glu Gly Lys Ser lie Val Arg Leu Ala Lys Gin 
65 70 75 80 

Met Tyr lie Asn Glu Arg Pro Lys Asp lie Asp Gly Thr Tyr Lys Pro 
85 90 95 

Phe Tyr Ser 



(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



Met Lys Tyr Gin Pro Leu Ser Tyr Lys Glu He Glu Ala Val Val His 

15 10 15 

Lys Gly Glu Thr Val Pro Ala Gly Val Thr Arg Phe Asn He Ser Gly 

20 25 30 

Arg Cys Leu Asn Leu Gin Val Pro Leu Ala Leu Leu Lys Gin Asp Asp 

35 40 45 

Asp Val Glu Gin Leu Arg Asn Trp Lys Gin Phe Leu Ala Asp Lys Phe 

50 55 60 

Ala Asn Met Arg Cys Tyr Thr Glu Lys Val Tyr Leu Val Glu Gin 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) .LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



Met Lys Glu His Gly Arg His Tyr Leu Asp Asn His Asn Tyr Lys Asp 

1 5 10 15 

He His Met Pro Asn Asn Thr Pro Val Gly Phe Trp He Gly He Phe 

20 25 30 

Met Thr lie Gly Gly Phe Phe Leu He Phe Glu Thr Val He Pro Ala 

35 40 45 

Leu He Cys Leu Phe Gly He Phe Gly Thr Met He Tyr Arg Lys Xaa 

50 55 60 

Pro Lys Ser Asp Xaa Gly He Ser Tyr Pro Ser Cys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 379: 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 



Met Ser His Asp Thr 

1 5 
Leu Asn Lys Leu Gly 
20 

Phe Gly Thr Leu Phe 
35 

Tyr Ala Gly Lys Met 
50 

lie Met Thr Phe Ala 
65 

lie Tyr Tyr Met Arg 
85 

lie lie Thr L€u Leu 
100 

Glu Phe Ala His Leu 
115 



Asn Thr lie Asp Ser Arg Thr His Glu Gly Glu 

10 15 
Phe Trp lie Phe He Thr Ala Glu Phe Ala Leu 

25 30 
Ala Thr Leu Leu Thr Leu Gin His Gly Gly Asp 

40 45 
Thr Thr Glu Leu Phe Glu Leu Pro Leu Val Leu 

55 60 
Leu Leu Phe Ser Ser Tyr Thr Cys Gly He Ala 
70 75 60 

Gin Glu Lys Gin Lys Leu Met Met Phe Trp Met 

90 95 
Leu Gly Leu Val Phe Val Gly Phe Glu He Tyr 

105 110 
Cys He Arg Arg Arg 
120 



(2) INFORMATION FOR SEQ ID NO: 380: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 : 



Met Ala Glu Asn Lys Asn Asn Leu Ser He Asn Asp Asp His Ser Asn 

1 5 10 15 

Ala Ala Met Thr His Thr Ser Asp Ala He Ala Ser Ser Asp Phe He 

20 25 30 

He Arg Glu Leu Asp Leu Asn Gin Glu Pro Glu Met Gin Arg Glu Ser 
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35 40 45 

Lys Asn Phe Gly Gin Asp Ala . Tip Ala Gin Leu Lys Arg Asn Lys Leu 

50 55 60 

Ala Val Val Gly Met lie Gly Leu lie lie He Val He Phe Ala Phe 
65 70 75 80 

He Gly Pro Val He Asn Lys His Asp Tyr Ala Glu Gin Asn Val Glu 

85 90 95 

His Arg Asn Leu Pro Ala Lys He Pro Val Phe Arg Gin Ser Ser He 

100 105 110 

Phe Thr Phe Trp Met Val Lys Met Gin Asp Gly Lys Asp Cys Leu Leu 

115 120 125 

Lys Gin Gin Met Leu Lys Lys lie He Gly Leu Gly Tyr 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

Met Leu Asp He Phe He Gly Val Val Tyr Gly Ala He Ser Gly Phe 

15 10 15 

Phe Gly Gly Arg Val Asp Thr He Met Gin Arg He Leu Glu Val He 

20 25 30 

Ala Ser He Pro Asn Leu He Val Val He Leu Phe Val Leu He Phe 

35 40 45 

Glu Pro Ser He Trp Thr He He Leu Ala Met Ser He Thr Gly Trp 

50 55 60 

Leu Gly Met Ser Arg Val Val Arg Gly Glu Phe Leu Lys Leu Lys Asn 
65 70 75 80 

Gin Glu Phe Val Met Ala Ser Lys Thr Leu Gly Ala Ser Lys Phe Lys 

85 90 95 

Leu He Phe Lys His lie Leu Pro Asn Thr Leu Gly Ala He Val Val 

100 105 HO 

Thr Ser Met Phe Thr Val Ser Cys His Phe Leu Arg Ser He Phe Lys 

115 120 125 

Phe His Trp Tyr Arg Xaa Thr Cys Thr Ser Asn He Val Arg Val He 
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130 135 140 

Ser Lys 
145 



(2) INFORMATION FOR SEQ ID NO: 382: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 



Met Asn Asn Ser Gly Phe Phe Lys 

1 5 
Gin Met Asn Gin Ala Pro Ser Leu 
20 

Ala Lys Ala Gly lie Glu Met Met 

35 40 
Thr Val Ala Pro Tyr 
50 



Gin Lys Ser Arg Leu Phe Trp Thr 

10 15 
Ala Lys Glu Asp Glu Lys Glu Gly 
25 30 
Lys Asn Asn Tyr Lys Glu lie Met 
45 



(2) INFORMATION FOR SEQ ID NO:383: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : S ing 1 e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



Met Gin Ala Phe Trp Pro Gly Pro 

1 5 
Pro Gly Tyr Leu Cys Arg Lys Val 
20 

Val Arg Met Pro Ser His Ser Val 
35 40 



lie Ser Phe lie Leu Pro Leu Lys 

10 15 
Ser Gly Gly Leu Ser Ser Val Ala 
25 30 
Gly Arg Gin Leu Leu Gin lie lie 
45 
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Asn Glu Pro Leu Ala Ala Pro Ser Ala Asn Leu Ser Gly Arg Pro Ser 

50 55 . 60 

Pro Thr Thr Phe Asn His Val Tyr Gin Asp Leu Asn Ala Arg lie Asp 
65 70 75 80 

Gly lie Val Gin Ala Glu Gin Ser Glu Glu Gly Leu Glu Ser Thr Val 

85 90 95 

Leu Asp Cys Thr Ser Phe Pro Tyr Lys He Ala Arg Pro Gly Ser He 

100 105 110 

Thr Ala Ala Met He Thr Glu He Leu Pro Asn Ser He Ala His Ala 

115 120 125 

Asp Tyr Asn Asp Thr Glu Gin Ala He Ala Pro Gly Met Lys Tyr 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

Met Pro Thr Pro Asp Glu Val His He Thr Val Gly Thr Val Arg Tyr 

1 5 10 15 

Gin Ala His Gly Arg Glu Arg Lys Gly Val Cys Ser Val His Phe Ala 

20 25 30 

Glu Arg He Lys Pro Gly Asp He Val Pro He Tyr Leu Xaa Lys Asn 

35 40 45 

Pro Asn Phe Lys Phe Pro Met Lys Gin Asp He Pro Val He Met He 

50 55 60 

Gly Pro Gly Thr Gly He Ala Pro Phe Arg Ala Tyr Leu Gin Glu Arg 
65 70 75 80 

Glu Glu Leu Gly Met Thr Gly Lys Thr Trp Leu Phe Phe Gly Asp Gin 

85 90 95 

His Arg Ser Ser Asp Phe Leu Tyr Glu Glu Glu He Glu Glu Trp Leu 

100 105 HO 

Glu Asn Gly Asn Leu Thr Arg Ser Cys Leu 



115 



120 



(2) INFORMATION FOR SEQ ID 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 



Met Asn Xaa Leu Arg Arg Gin Leu Ser Met Asn Ser Leu Leu lie Asp 

1 5 10 15 

Thr Ser Asn Gin Pro Leu Ser Val Ala Leu Met Gin Asn Asp Glu Val 

20 25 30 

Leu Ala Glu lie Thr Thr Asp Ser Lys Gin Asn His Ser Val Gin Leu 

35 40 45 

Met Pro Ala lie Ser Gin Leu Phe Glu Gin Ser Lys He Ala Lys Gin 

50 55 60 

Gin Leu Asp Ala He He Val Ala Glu Gly Pro Gly Ser Tyr Thr Gly 
65 70 75 80 

Leu Arg He Gly Val Thr Val Ala Lys Thr Leu Ala Tyr Ala Leu Asp 

85 90 95 

Val Lys Leu Tyr Gly Val Ser Ser Leu Lys Ala Leu Ala Ala Thr He 

100 105 110 

Asp His Thr Asp Lys Leu Leu Gly Ser Cys Val Trp Met Gin Arg Arg 

115 120 125 

Gin Ala Gly Leu Phe Arg His He Pro Gly Gly Lys Met Gly Asn 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 386: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

Met Ala He Lys Ala Glu Glu He Ser Ala Leu Leu Arg Ser Gin He 
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1 5 10 15 

Glu Asn Tyr Glu Ser Glu Met .Ser Val Thr Asp Val Gly Thr Val Leu 

20 25 30 

Gin He Gly Asp Gly He Ala Leu He His Gly Leu Asn Asp Val Met 

35 40 45 

Ala Gly Glu Leu Val Thr Phe His Asn Gly Val Leu Gly Leu Ala Gin 

50 55 60 

Asn Leu Glu Glu Ser Asn Val Gly Val Val lie Leu Gly Pro Tyr Thr 
65 70 75 80 

Gly He Thr Glu Gly Asp Glu Leu Xaa Thr Tyr Trp Val Val Ser Trp 

85 90 95 

Glu Val Pro Val Gly Glu Glu Leu He Arg Lys Ser Cys Xaa Ser He 

100 105 110 

Arg Thr Thr Tyr 
115 

(2) INFORMATION FOR SEQ ID NO: 387: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 87: 

Met Lys Ser Ala Glu Gin Trp He Asp Glu Leu Gin Leu Glu Ser His 

15 10 15 

Pro Glu Gly Gly Phe Tyr Arg Glu Thr He Arg Glu Val Leu Lys Asp 

20 25 30 

Gly Arg Arg Ala Pro Phe Xaa Ser He Tyr Phe Leu Leu Thr Asp Asp 

35 40 45 

Asn He Ser His Phe His Arg He Asp Ala Asp Glu Val Trp Tyr Tyr 

50 55 60 

His Ala Gly Asp Ser Leu Thr He His Met He Asn Pro Asp Gly Glu 
65 70 75 80 

Tyr Thr Thr Ala Thr Leu Gly Thr Asp He Gin Asn Gly Asp Val Leu 

85 90 95 

Gin Tyr Val Val Pro Lys Gly Thr He Phe Ala Ser Ser He Glu He 

100 105 110 

Ser Asn Thr Tyr Ser Leu Val Gly Cys Met Cys Gin Pro Ala Xaa Glu 
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115 120 125 

Phe Lys Gin Phe Glu Leu Phe Lys Gin Ser Glu Leu He Thr Gin Tyr 

130 135 140 

Pro His Leu Lys Ser Val He Xaa Lys Tyr Ala Leu Lys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO:388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met Tyr Glu Arg Lys Leu Val Met Gin Lys Ser Ala Leu Phe Ala Pro 

15 10 15 

Glu Lys Tyr Asn He He Ser Glu He Glu Lys Tyr Ser His He Pro 

20 25 30 

Asp Lys Lys Ala He Leu Tyr His Asn Thr Glu Gly Glu Asp He Ser 

35 40 45 

Val Thr Tyr Gin Gin Leu He Glu Gin Ser Asn Lys Val Gly Asn Val 

50 55 60 

Leu Ala Ser His Gly Leu Ser Lys Gly Asp Lys Val Leu He Met Met 
65 70 75 80 

Pro Arg Ser He Ala Thr Tyr Glu Leu Tyr He Ala Ala Leu Lys Leu 

85 90 95 

Gly Val Ala He He Pro Cys Ser Glu Met Leu Arg Thr Lys Asp Leu 

100 105 HO 

Gin Tyr Arg He Thr His Gly Glu He Asn Ala Val He Ala Leu Glu 

115 120 125 

Pro Phe Thr Val Glu Phe Glu Lys He Lys Glu Tyr Asp Ala Leu Xaa 

130 135 140 

Lys Phe Val He Ala Gly Gin Lys Asp Gly Trp He Ser Leu Glu Ala 
145 150 155 • 160 

Glu Lys Glu Asn Ala Ser Asn Lys Leu Glu Met Ala Asp Thr Thr Arg 

165 170 175 

Asp Asp Met Ala Xaa Leu Ser Tyr Thr Ser Gly Thr Thr Gly Asn Pro 

180 185 190 

Lys Ala Arg Tyr Thr Leu Ser Trp Met Gly He Cys Thr Phe Thr Lys 
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195 200 205 

Trp His Gin Asn He Gly Tyr Val Leu Met Lys Xaa Asp Leu Val Trp 

210 215 220 

Ala Thr Ser Ser Xaa Arg Leu Ala Lys Asn Gly 
225 230 235 

(2) INFORMATION FOR SEQ ID NO:389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

Met Gin Met He Glu Pro Lys Phe Val Val Thr Ala Val Met Leu Asn 

15 10 15 

He Phe Ser Ala Leu He He Ala Ser Val He Asn Pro Tyr Lys Ser 

20 25 . 30 

Asp Asp Ser Asp Val Glu He Asp Xaa Leu Thr Lys Ser Thr Glu Thr 

35 40 45 

Lys Ser He Glu Trp Lys Asn Arg Lys Thr 
50 55 

(2) INFORMATION FOR SEQ ID NO:390: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met He Gly Asp Ser Ala Met Asp Gly Phe Lys He Ala Val Val Val 

15 10 15 

Ala Val Met Leu Leu Ala Phe He Ser Leu Met Glu Ala He Asn lie 
20 25 30 
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Met Phe Gly Ser Val Gly Leu Xaa Xaa Lys Gin Leu He Gly Tyr Val 

35 40 45 

Phe Ala Pro He Ala Phe Leu Met Gly He Pro Trp Ser Glu Ala Val 

50 55 60 

Pro Ala Gly Ser Leu Met Ala Thr Lys Leu He Thr Asn Glu Phe Val 
65 70 75 80 

Ala Met Leu Asp Phe Lys Asn Val Leu Gly Asp Val Ser Ala Arg Thr 

85 90 95 

Gin Gly He He Ser Val Tyr Leu Val Ser Phe Ala Asn Val Gly Thr 

100 105 Ho 

Val Gly He He Val Gly Ser Xaa Gin Gly Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

Met Ser Leu Ala He Val He Gly Ser Glu Gly Gin Gly Met Ser Arg 

15 10 15 

Leu Val Ser Asp Lys Cys Asp Phe Tyr He Lys He Pro Met Val Gly 

20 25 30 

His Val Asn Ser Leu Asn Ala Ser Val Ala Ala Ser Leu Met Met Tyr 

35 40 45 

Glu Val Phe Arg Lys Arg His 
50 55 



(2) INFORMATION FOR SEQ ID NO: 392: 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 81 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 92: 

Met Phe Leu Leu Ser Met Ser Leu Ala Pro Asn Phe Lys Thr Tyr Gly 

15 10 15 

Phe Trp Asn Arg Val Gly Leu Gly Thr Leu Val Thr Asp Glu Thr Phe 

20 25 30 

Gly Val Ala lie Thr Pro Tyr Leu Lys Gly Glu Ala lie Asn Asp Arg 

35 40 45 

Trp Met His Gly Leu Asn lie Thr Ala Tyr Leu Phe Gly Asp Asn Phe 

50 55 60 

Met Cys Ser Trp Gly Phe lie Leu Ala Asn lie Ser Gin He Arg Lys 
65 70 75 80 

Arg 



(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 

Met Arg Arg Ser Val Lys Gin He Thr Glu He Thr Ala Ala Gin Arg 

15 10 15 

Val He Asp Asn Gly Asp Ala Thr Ala Gin Gin He Ser Asp Glu Lys 

20 25 30 

His Arg Val Asp Asn Ala Leu Thr Ala Leu Asn Gin Ala Lys His Asp 

35 40 45 

Leu Thr Ala Asp Thr His Ala Leu Glu Gin Ala Val Gin Gin Leu Asn 

50 55 60 

Arg Thr Gly Thr Thr Thr Gly Lys Lys Pro Ala Ser He Thr Ala Tyr 
65 70 75 80 

Asn Asn Ser He Arg Ala Leu Gin Ser Asp Leu Thr Ser Xaa Lys Asn 

85 90 95 

Ser Ala Asn Ala He He Gin Lys Pro He Arg Thr Val Gin Glu Val 
100 105 HO 

His Leu Arg 
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(2) INFORMATION FOR SEQ ID NO: 394: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE : amino acid 

CO STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 

Met Thr Gin Ser Ser lie Gin Ala Tyx Glu Asn Ala Lys Arg Ala Gly 

15 10 15 

Gin Xaa Glu Xaa Thr Asn Ala Gin Asn Val lie Asn Asn Gly Ala Arg 

20 25 30 

Ala Thr Asp Gin Gin lie Ala Ala Glu Lys Thr Lys Val Glu Glu Lys 

35 40 45 

Tyr Asn Ser Leu Lys Gin Ala lie Ala Gly Leu Thr Pro Asp Leu Ala 

50 55 60 

Pro Leu Gin Thr Ala Lys Thr Gin Leu Gin Asn Asp lie Asp Gin Pro 
65 70 75 80 

Thr Ser Thr Thr Gly Met Thr Ser Ala Ser Val Ala Ala Phe Asn Asp 

85 90 95 

Lys Leu Ser Ala Ala Arg Thr Lys lie Gin Glu lie Asp Arg Val Leu 

100 105 110 

Ala Ser His Pro Asp Val Ala Thr He Arg Gin Asn Val Thr Ala Ala 

115 120 125 

Asn Ala Ala Lys Thr Ala Leu Gly Ser Ser Ala Gin Met Pro 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 395: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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Met Thr Gin Leu Asp Asn His Trp Met Ala Met Phe Xaa Asp Met Asn 

1 5 10 15 

Phe Asp Leu Gly lie Met Ala Leu Tyr Ala lie Glu Ser Asp Pro Leu 

20 25 30 

Ala Asn Xaa Pro Arg Asn Ser Asp Ala Glu lie Ala Val Xaa Asp Glu 

35 40 45 

Ser His lie Asp Ala Tyr 
50 

(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:396: 

Met Val Arg Glu His Tyr Gin Lys Asp Val lie Lys Arg Leu Val Ala 

15 10 15 

Tyr Leu Asn Asn Glu Pro lie Gly Val Val Asp Val lie Glu Ser Glu 

20 25 30 

Asn Tyr He Glu Leu Asp Gly Phe Gly Val Leu Glu Gin Phe Arg His 

35 40 45 

Gin Gly He Gly Ser Thr He Gin Ser Val Asp Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 
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Met Tyr Ala Lys Gin Gly Tyr .Val Tyr Gin Ser Xaa Leu Leu Ser Xaa 

15 10 is 

He Lys Arg Arg Tyr Trp Lys Leu Xaa He Asp Asp Arg Phe Val Leu 
20 25 30 

Glu 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

Met Ala Tyr He Ser Leu Xaa Tyr His Ser Pro Thr He Gly Met His 

1 5 10 15 

Gin Asn Leu Xaa Val He Leu Pro Glu Asp Gin Ser Phe Phe Asn Asn 

20 25 30 

Asp He Xaa Ala Lys Pro Leu Lys Thr Leu Met Leu Leu His Gly Leu 

35 40 45 

Ser Ser Asp Glu Thr Thr Tyr Met Arg Tyr Thr Ser He Glu Arg Tyr 

50 55 60 

Ala Asn Glu His Lys Leu Ala Val He Met Pro Asn Val Asp His Ser 
65 70 75 80 

Ala Tyr Ala Asn Met Ala Tyr Gly His Ser Tyr Tyr Asp Tyr He Leu 

85 90 95 

Glu Val Tyr Asp Tyr Val His Gin He Phe Pro Leu Ser Lys Lys Arg 

100 105 110 

Asp Asp Asn Phe He Ala Gly His Ser Met Gly Gly Tyr Gly Thr He 

115 120 125 

Lys Phe Ala Leu Thr Gin Gly Asp Lys Phe Ala Lys Ala Val Pro Leu 

130 135 140 

Ser Ala Val Phe Glu Ala Gin Asn Leu Met Asp Leu Glu Trp Asn Asp 
145 150 155 160 

Phe Ser Lys Glu Ala He He Gly Asn Leu Ser Ser Cys 
165 170 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



( i i ) ' MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 



Met Leu Val Leu Ser Gly Cys Gly 

1 5 
Ser Asp Asp Asn Gly Ser Ser Lys 
20 

Leu Met Pro Ser Asn Thr Glu lie 

35 40 
Tyr He Val Gly Val Phe Asn Gly 
50 55 



Met Lys Asp Asn Asp Lys Gin Gly 

10 15 
Ser Pro Tyr His Arg Xaa Val Ser 
25 30 
Leu Tyr Glu Leu Gly Leu Gly Lys 
45 



(2) INFORMATION FOR SEQ ID NO:400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 



Met Lys Trp Phe Lys Xaa Trp Asp 

1 5 
Asp Leu Val Pro Asp Leu Thr Asp 
20 

Arg Met Val Trp Thr Arg He Leu 

35 40 
Arg Gin Gly Arg Leu Gly Phe Tyr 

50 55 
Ser Gin Leu Ala Ser Gin Tyr Ala 
65 70 



Glu Asn Gly Asn Val Val Asn Glu 

10 15 
Glu Gin Leu Val Glu Leu Met Glu 
25 30 
Asp Gin Arg Ser He Ser Leu Asn 
45 

Ala Pro Thr Ala Gly Gin Glu Ala 
60 

Leu Glu Lys Glu Asp Tyr He Leu 
75 80 
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Pro Gly Tyr Arg Asp Val Pro Gin lie lie Trp Ala Trp Phe Thr lie _ 
85 90 95 

Asn 



(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 

Met Thr Tyr Arg Tyr Gly Pro His Thr Met Ala Gly Asp Asp Pro Thr 

1 5 10 15 

Arg Tyr Arg Thr Ser Asp Glu Asp Ala Glu Trp Glu Lys Lys Asp Pro 

20 25 30 

Leu Val Arg Phe Arg Lys Phe Leu Glu Asn Lys Gly Leu Trp Asn Glu 

35 40 45 

Asp Lys Glu Asn Glu Val lie Glu Arg Ala Lys Ala Asp lie Lys Ala 

50 55 60 

Ala lie Lys Glu Ala Asp Asn Thr Glu Lys Gin Thr Val Thr Ser Leu 
65 70 75 80 

Met Glu lie Met Tyr Glu Asp Met Pro Gin Asn Leu Ala Glu Gin Tyr 

85 90 95 

Glu lie Tyr Lys Glu Lys Glu Ser Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
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Met Leu Leu His Ser Gin Ser Phe Val Val He Val He Gly He Phe 

15 10 15 

He Leu Gly Phe Phe Leu Ser Thr Tyr Glu Ala Thr Met Pro Gly Ser 

20 25 30 

Leu Pro Thr Met Phe Tyr Ser His He Arg Tyr Arg Thr Leu Ser Val 

35 40 45 

Thr Phe Asn He Ser Val Ser He Phe Gly Gly Thr Thr Pro Leu Val 

50 55 60 

Ala Thr Trp Leu Val Thr Lys Thr Gly Asp Pro Leu Ala Pro Ala Tyr 
65 70 75 80 

Tyr Leu Thr Ala He Ser Val He Gly Phe Leu Val He Thr Phe Leu 

85 90 95 

His Leu Ser Thr Ala Gly Lys Ser Leu Lys Gly Ser Tyr Pro Asn Val 

100 105 110 

Asp Asn Glu Gin Asp Arg Ala Tyr Tyr Ala Glu His Pro Lys Glu Ala 

115 120 125 

Leu Trp Trp Val Lys Glu Arg Lys Asn 
130 135 

(2) INFORMATION FOR SEQ ID NO: 403 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: . 

Met Ala Lys Gin Asp Val He Glu Leu Glu Gly Thr Val Leu Asp Thr 

15 10 15 

Leu Pro Asn Ala Met Phe Lys Val Glu Leu Glu Asn Xaa His Glu He 

20 25 30 

Leu Ala His Val Ser Gly Xaa He Arg Asn Glu Leu Gin Phe Val Phe 

35 40 45 

Tyr Pro Xaa Asp Lys Val Thr Val Glu Met Ser Pro Tyr Asp Leu Thr 

50 55 60 

Xaa Gly Thr Ser Tyr Leu Ser Leu 
65 70 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:404: 

Met Glu Val Thr Val Gly lie Val Lys Gly Lys Glu Phe Leu Xaa Gly 

1 5 10 15 

Arg Met Xaa Xaa Gin Gly Phe Leu Leu Asp Gly Phe Pro Arg Thr lie 

20 25 30 

Glu Gin Ala Glu Ala Leu Asn Asn lie Met Ser Glu Leu Asp Arg Asn 

35 40 45 

lie Asp Ala Val He Asn He Glu Val Pro Glu Glu Glu Leu Met Asn 

50 55 60 

Arg Leu Thr Gly Arg Arg He Cys Glu Ser Cys Gly Thr Thr Tyr His 
65 70 75 80 

Leu Val Phe Asn Pro Pro Lys Val Glu Gly He Cys Asp He Asp Gly 

85 90 95 

Val Xaa Xaa Tyr Gin Arg Glu Asp Asp Asn Pro Glu Xaa Val Ala Arg 
100 105 110 

Ala Val 

(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

Met Arg Thr Leu He Thr He Ala Ser Thr He He Gly Thr Phe Leu 
15 10 15 
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Ser lie Gly Leu Thr Glu Leu lie He Tyr Met Thr Asp Gly Lys Gly 

20 25 30 

He Lys Tyr Glu Thr Met Asn Phe Leu Ser Leu Pro Pro Lys Asp He 

35 40 45 

Phe Leu Ala Ser Val Leu He Gly Ser Leu Gly Ala He Met Asp Val 

50 55 60 

Ala He Thr He Ala Ser Gly Met His Glu He Xaa Gin Arg Thr Xaa 
65 70 75 80 

His He Ser Met Arg Arg Trp Ala Leu Ala Trp Ala Lys His Pro Thr 

85 90 95 

Arg Tyr Tyr Gly Asn Asn Asp 
100 

(2) INFORMATION FOR SEQ ID NO: 406: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Gin Ala Leu Arg Leu Met Phe Glu 

1 5 
Glu He Lys Asn Val Lys Asp Asp Val 

20 25 
Lys Thr Gly Cys Gly Arg Leu Gin Phe 
35 40 

(2> INFORMATION FOR SEQ ID 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



Ala Thr Glu Glu Thr Lys Gin 
10 15 
Asp Xaa Phe Glu Arg Lys Ser 
30 

Leu Asn 
NO: 407: 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 
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Met Ser Arg Gin His His Arg Phe Leu Phe Phe lie Asn Lys Leu Pro 

15 10 15 

Gly lie Thr lie Lys Ser Leu Leu Glu He Leu Glu He Ser Lys Gin 

20 25 30 

Gly Ser His Ala Thr Leu Gin Lys Leu Lys Glu Gin Gly Leu He He 

35 40 45 

Glu Lys Val Leu Glu Thr Asp Arg Arg Val Lys Lys Leu Tyr Ser Thr 

50 55 60 

Asp Lys Gly Asp Gin Leu He Ala Glu Leu Asn Lys Ala Gin Asp Glu 
65 70 75 80 

Leu Leu Gin Asn He Tyr Gin Gin Val Gly Ser Asp Trp Tyr Asp Val 

85 90 95 

Met Glu Ala Leu Ala Lys Arg Arg Pro Gly Phe Asp Phe He Lys His 

100 105 110 

Leu Lys Asp Glu Lys Glu Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 : 



Met He His His Ser Asn Glu Tyr Asn He Asp He Thr Ala Gin Asn 

15 10 15 

He Asn Lys Tyr Thr Ala Leu Gin Tyr He Phe Asp Ala Asp Val Lys 

20 25 30 

Tyr He Ala Phe Gly 
35 



(2) INFORMATION FOR SEQ ID NO:409: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

490 



WO 97/30070 



PCT/US97/02318 



(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 409: 

Met Leu Val Gly Gly Tyr lie lie Gly Pro Ser Glu Ala Tyr Thr His 

1 5 10 15 

Ala He Leu Lys Leu Asp Lys He Lys Tyr He Asn Asn Asn Thr Gin 

20 25 30 

Ala He Cys Lys Val Leu Lys Ser Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Gly Gly Ser He Thr He Met Ser Lys Phe Asp Asp Asn Gin Ser 

15 10 15 

Asp Phe He Pro Ala Trp Phe Ala Gly Asn He Trp Gly Asp Glu Cys 

20 25 30 

Pro Phe Val He Ala Phe Leu Pro Trp Val Leu He He He Pro Tyr 

35 40 45 

Leu Leu Phe Lys Ser Asn Thr Leu Asn He He His Thr Gly Asp Asn 

50 55 60 

He Ala Arg Gly Leu Gly Val Arg Leu Ser Arg Glu Arg Leu He Leu 
€5 70 75 80 

Phe Phe He Ala Val Met Leu Ser Ser Ala Ala Val Ala Val Ala Gly 

85 90 95 

Ser He Ser Phe He Gly Leu Met Gly Pro His He Ala Lys Arg He 

100 105 no 

Val Gly Pro Arg His Gin Leu Phe Leu Pro He Ala He Leu Val Gly 

115 120 125 

Ala Cys Leu Leu Val He Ala Arg Ala He Gly Gin Asn Leu Tyr Tyr 



35 



40 



130 



135 



140 
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Asn Gin Val Gly Phe Gin Gin Val Leu Ser 
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(2) INFORMATION FOR SEQ ID NO: 411: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

/ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

Met Asp Xaa Ser Lys Thr Ser Xaa Ala Ser Gly Gly Asp Cys Xaa lie 

1 5 10 15 

Thr lie lie Asn Xaa Asp Val Asp Lys Glu Xaa lie Tyr Asp Glu Trp 

20 25 30 

Thr Lys His Gly lie Lys Pro Leu Lys Phe Asn lie Tyr His Gly Gin 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 412: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 



Met Val Lys He Ala He Phe Ala Ser Gly Ser Gly Ser Asn Phe Glu 

15 10 15 

Asn He Val Glu His Val Glu Ser Gly Lys Leu Glu Asn He Glu Val 

20 25 30 

Thr Xaa Leu Tyr Thr Asp His Gin Asn Ala Phe Cys He Asp Arg Ala 

35 40 45 

Asn Lys His Asp He Pro Val Tyr He Asn Glu Pro Lys Gin Phe Asp 

50 55 60 

Ser Lys Ala Ala Tyr Glu Gin His Leu Val Thr Leu Leu Thr Lys Asp 
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65 70 75 80 

Lys Val Glu Trp lie lie Leu Ala Trp Leu His Ala Ser Tyr Ser Gly 
85 90 95 

Pro Gly Leu His 
100 



(2) INFORMATION FOR SEQ ID NO: 413: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 

Met Lys Trp His Glu Arg Arg lie lie Val Asp lie Arg Asp Lys Gin 

15 10 15 

Leu Arg Tyr Gin Asp lie Ala lie Leu Tyr Arg Asp Glu Ser Tyr Ala 

20 25 30 

Tyr Leu Phe Asp Ser lie Leu Pro Leu Tyr Asn lie Pro Tyr Asn lie 

35 40 45 

Asp Thr Lys Arg Ser Met Thr His His Pro Val Met Glu Met He Arg 

50 55 60 

Ser Leu He Glu Val He Gin Ser Asn Trp Gin Val Asn Pro Met Leu 
65 70 75 80 

Arg Leu Leu Lys Thr Asp Val Leu Thr Ala Ser Tyr Leu Lys Ser Ala 

85 90 95 

Tyr Leu Val Asp Leu Leu Glu Asn Phe Val Leu Glu Arg Gly He Tyr 

100 105 110 

Gly Lys Arg Trp Leu Asp Asp Glu Leu Phe Asn Val Glu His Phe Ser 

115 120 125 

Lys Met Gly Arg Lys Ala His Lys Leu His Arg Arg 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 414: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:414: 

Met Ser lie Glu lie Gin Ala Asp lie Glu Phe Lys Leu Asp Glu Met 

1 5 10 15 

Val Asp Cys Ala Ala Ala Lys Met Ala Ala He Ser Asn Pro Ala Thr 

20 25 30 

Asn Gly Gly He Ser Val Lys Met Lys Tyr Gly Thr Thr Asp Asp Val 

35 40 45 

Leu Lys Pro Phe Gly Lys Thr Ala Ser Cys Val 
50 55 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

Met Thr His Leu Leu Glu Thr Phe Glu Met Ser He Asp His Gin Glu 

1 5 10 15 

Asp Gly Leu Val Val He Ser Met Pro Val Thr Asp Lys Val Lys Gin 

20 25 30 

Pro Phe Gly Tyr Leu His Gly Gly Ala Ser He Ala Leu Gly Glu Thr 

.35 40 45 

Ala Cys Ser Leu Gly Ser Ala Asn Leu He Asp Thr Thr Lys Phe He 

50 55 60 

Pro Leu Gly Leu Glu Met Asn Ala Asn His He His Ser Ala Lys Asp 
65 70 75 80 

Gly Arg Val Thr Ala Thr Ala Glu He He His Arg Gly Lys Ser Thr 

85 90 95 

His Val Trp Asp He Lys He Lys Asn Asp Xaa Glu Gin Leu lie Thr 

100 105 110 

Val Met Arg Gly Tyr Xaa Leu Leu Phe Lys Pro Leu Lys 
115 120 125 
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(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Met Leu Ala Tyr Gly Thr Gly Thr Phe Thr Gly Lys Asp Phe Leu Lys 

15 10 15 

Ala Gly lie Pro Leu Thr lie Val Gly Tyr lie Leu Val He Val Phe 

20 25 30 

Ser Met Thr Tyr Trp Lys Trp Leu Gly Leu Leu 
35 40 

(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met Ser Arg Gly He Ser Gin Arg Glu Ala Glu Arg Leu Val He Gin 

1 5 10 15 

Gly Phe Leu Asp Pro Val Val Arg Glu Leu Pro He Glu Asp Val Lys 

20 25 30 

Arg Gin Leu Arg Glu Val He Glu Arg Lys Val Ser Lys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 418: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 97 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met Asn His Asn Val lie He Val He Ala Leu He He Val Val He 

15 10 15 

Ser Met Leu Ala Met Leu He Arg Val Val Leu Gly Pro Ser Leu Ala 

20 25 30 

Asp Arg Val Val Ala Leu Asp Ala He Gly Leu Gin Leu Met Ala Val 

35 40 45 

Arg Ala Leu Phe Ser He Leu Leu Asn He Lys Tyr Met He Val Val 

50 55 60 

He Met Met He Gly He Leu Ala Phe Leu Gly Thr Ala Val Phe Ser 
65 70 75 80 

Lys Phe Met Asp Lys Gly Lys Val He Glu His Asp Gin Asn His Thr 
85 90 95 

Asp 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 419: 

Met Thr He Ala Met Thr He Trp Asn Ala He Trp Thr Phe Xaa Gin 

15 10 15 

Thr Leu Trp Asn Thr He Val Thr Val Ala Thr Lys Val Trp Asn Ala 

20 25 30 

He Thr Thr Ala He Ser Thr Ala Leu Gin Ala Ala Trp Ser Phe He 

35 40 45 

Ser Asn He Trp Asn Thr He Trp Ser Phe Leu Ser Ser He Leu Thr 
50 55 60 
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Thr lie Trp Asn Lys Val Val Ser lie Phe Thr Gin Val Val Ser Thr 
65 70 75 80 

lie Ser Asp Lys Met Ser Gin Ala Trp Asn Phe lie Val Thr Lys Gly 

85 90 95 

Met Gin Trp Val Ser Thr lie Thr Ser Thr Leu lie Asn Phe Val Asn 

100 105 110 

Arg Val lie Gin Gly Phe Gly Asn Gly Cys 

115 120 

(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

Met Asn Val Thr Leu Ser Leu Ala Met Asp Lys Asp Phe Val lie His 

15 10 15 

Thr Pro Leu Met Trp Leu Asn Lys Ala Glu Thr Trp Lys Leu Ser Asp 

20 25 30 

Glu Leu Glu Val Leu Asp Tyr lie Arg Thr Lys Thr Leu Thr Cys Tyr 

35 40 45 

Asn Gly lie lie Gly Asp Gly Cys Gly Glu Cys Pro Ala Cys His Leu 

50 55 60 

Arg Gin Arg Gly Leu Asn Gin Tyr Leu Glu Ser Lys Gly Ala Xaa 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 
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Met Leu Thr Thr Glu Lys Leu Val Glu Thr Leu Lys Leu Asp Leu lie 

15 10 15 

Ala Gly Glu Glu Gly Leu Ser Lys Pro lie Lys Asn Ala Asp lie Ser 

20 25 30 

Arg Pro Gly Leu Glu Met Ala Gly Tyr Xaa Xaa His Tyr Ala Ser Asp 

35 40 45 

Arg lie Gin Leu Leu Gly Thr Thr Glu Leu Ser Phe Tyr Asn Leu Leu 

50 55 60 

Pro Asp Lys Asp Arg Ala Gly Arg Met Arg Lys Leu Cys Arg Pro Glu 
65 70 75 80 

Thr Pro Ala lie He Val Thr Arg Gly Leu Gin Pro Pro Glu Glu Leu 

85 90 95 

Val Glu Ala Ala Lys Glu Leu Asn Thr Pro Leu He Val Ala Lys Asp 

100 105 110 

Ala Thr Thr Ser Leu Met Ser Arg Leu Thr Thr Phe Leu Glu His Ala 

115 120 125 

Leu Ala Lys Thr Thr Ser Leu His Gly Val Leu Val Asp Val Tyr Gly 

130 135 140 

Val Gly Cys Thr 
145 



(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

Met He His Gly Xaa Xaa Val Leu Gly Trp Gly Val Gly Gly He Glu 

1 5 10 15 

Xaa Glu Ala Gly Met Leu Gly Gin Pro Ser Tyr Phe Pro He Pro Glu 

20 25 30 

Val He Gly Val Arg Leu Val Xaa Ser Leu Pro Gin Gly Ala Thr Ala 

35 40 45 

Thr Asp Leu Cys Val Lys Ser Asn Ser Arg Ala Thr 
50 55 60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

Met His Asn Gin Val Thr Xaa Thr Leu Lys Leu Phe Gin Ala Pro Thr 

1 5 10 15 

Tyr His Asn Tyr Ser Thr Ser Thr Thr Ser Ser Ser Val Arg Leu Ser 

20 25 30 

Asn Gly Asn Thr Ala Gly Ala Thr Gly Ser Ser Ala Ala Gin lie Met 

35 40 45 

Ala Gin Arg Thr Gly Val Ser Ala Ser Thr Trp Ala Ala lie lie Ala 

50 55 60 

Arg Glu Ser Asn Gly Gin Val Asn Ala Tyr Asn Pro Ser Gly Ala Ser 
65 70 75 80 

Gly Leu Phe Gin Thr Met Pro Gly Trp Gly Pro Thr Asn Thr Gly Asp 

85 90 95 

Gin Gin lie Asn Ala Xaa Val Lys Ala Tyr Lys Ala Gin Gly Leu Gly 

100 105 110 

Ala Trp Gly Phe 
115 

(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 : 

Met Lys Tyr lie His Arg Arg His lie Thr Met Lys Pro Gin Leu Ser 
15 10 15 
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Trp Lys Val Gly Gly Gin Gin Gly Glu Gly lie Glu Ser Thr Gly Glu 

20 25 30 

Xaa Phe Ala Thr Ala Met Asn Arg Lys Gly Tyr Tyr Leu Tyr Gly Tyr 

35 40 45 

Arg His Phe Ser Ser Arg lie Lys Gly Gly His Thr Asn Asn Lys lie 

50 55 60 

Arg Val Ser Thr Thr Pro Val His Ala lie Ser Asp Asp Leu Asp lie 
65 70 75 80 

Leu lie Ala Phe Asp Gin Glu Thr lie Asp Val Asn His His Glu Met 

85 90 95 

Arg Glu Asp Ser lie lie Leu Ala Asp Ala Lys Ala Lys Pro Val Lys 

100 105 110 

Pro Glu Gly Cys His Ala Gin Leu lie Glu Leu Pro Phe Thr Ala Thr 

115 120 125 

Ala Lys Glu Leu Gly Thr Ser lie Arg Ala Lys His Gly Cys Asn Arg 

130 135 140 

Cys Tyr 
145 

(2) INFORMATION FOR SEQ ID NO: 425: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 

Met Gly Ser Arg Gly Leu Arg Lys Lys Gly Leu Arg Pro Phe Arg Asn 

1 5 10 15 

Lys Gly Gly His Ala Glu Ala Phe Phe Pro Ser Lys Leu Pro Leu lie 

20 25 30 

Asn Leu Arg Met Asn Asn Arg Asn His Arg Lys lie Val Val lie Asp 

35 40 45 

Gly Gin lie Gly Tyr Val Gly Gly Phe Asn Val Gly Asp Glu Tyr Leu 

50 55 60 

Gly Lys Ser Lys Lys Phe Gly Tyr Trp Arg Asp Thr His Leu Arg lie 
65 70 75 80 

Val Gly Asp Ala Val Asn Ala Leu Gin Leu Arg Phe lie Leu Asp Trp 
85 90 95 
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Asn Ser Thr Ser His Thr 
100 
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(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

{ C ) STRANDEDNESS : s ing 1 e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

Met Gly He Gly Arg Val Thr Gin Val Met Gly Pro Val He Ala Val 

15 10 15 

Arg Xaa Glu His Asn Glu Val Pro Lys He Asn Asn Ala Leu Val He 

20 25 30 

Asp Val Pro Lys Glu Glu Gly Thr He Gin Leu Thr Leu Glu Val Ala 

35 40 45 

Leu Gin Leu Gly Asp Asp Xaa Xaa Arg Thr He Ala Met Asp Ser Xaa 

50 55 60 

Asp Gly Val Gin Arg Gly Met Asp Val Lys Asp Thr Gly Lys Glu He 
65 70 75 80 

Ser Val Pro Val Gly Asp Glu Thr Leu Gly Arg Val Phe Asn Val Leu 

85 90 95 

Gly Glu Thr He Asp Leu Lys Glu Glu He Ser Asp Ser Val Arg Arg 

100 105 110 

Asp Pro He His Arg Gin Ala Pro Ala Phe Asp Glu Leu Ser Thr Glu 

115 120 125 

Val Gin He Leu Glu Thr Gly He Lys Val Val Asp Leu Leu Ala Pro 

130 135 140 

Tyr He Lys Gly Gly Lys He Gly Leu Phe Gly Gly Ala Gly Val Gly 
145 150 155 160 

Lys Thr Val Leu He Gin Glu Leu He Asn Asn He Ala Gin Arg Ala 



165 



170 



175 



Arg Trp Val 



Phe 



Leu 



Val 



Phe 



Arg 



Arg Cys 



Arg Leu Asn Gly Thr 



Pro 



180 



185 



190 



(2) INFORMATION FOR SEQ ID NO: 427: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 

Met Leu Asn Pro Pro Leu Xaa Gin Leu Thr Ser Gin lie Lys Ser Lys 

1 5 10 15 

Tyr Leu He Ala Thr Thr Ala Ala Lys Arg Ala Arg Glu He Asp Glu 

20 25 30 

Gin Pro Glu Thr Glu Leu Leu Ser Glu Tyr His Ser Phe Xaa Pro Val 

35 40 45 

Gly Arg Ala Leu Glu Glu He Ala Asp Gly Lys He Arg Pro Val He 

50 55 60 

Ser Ser Asp Tyr Tyr Gly Lys Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 428: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ JD NO: 428: 

Met His Pro Gin Leu Val Ser Gin Tyr Asp Leu Ala Lys Gly Asn He 

15 10 15 

Pro Thr Trp Asn Leu Phe Gly Leu Glu He Lys Gin Leu Asn Tyr Gin 

20 25 30 

Gly Gin Val Leu Pro Val Leu He Ala Ala Tyr Val Leu Ala Lys He 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

Met lie Ala Ser Ser Leu Val Asn Lys Lys Ser Tyr His Gin Asp Lys 

15 10 15 

Asn Thr lie Ala lie Asp Lys Pro Tyr Phe Gly Val Ser Leu lie val 

20 25 30 

Phe lie Val Leu Leu Ser Gly Leu Thr Phe lie Pro Val Leu Leu Leu 

35 40 45 

Gly Pro lie Gly Glu Phe Leu Thr Leu Lys 
50 55 

(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) ' STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

Met Asn lie lie Glu Gin Lys Phe Tyr Asp Ser Lys Ala Phe Phe Asn 

15 10 15 

Thr Gin Gin Thr Lys Val He Ser Phe Arg Lys Asp Gin Leu Lys Lys 

20 25 30 

Leu Ser Lys Ala He Lys Ser Tyr Glu Ser Asp He Leu Glu Ala Leu 

35 40 45 

Tyr Thr Asp Leu Gly Lys Asn Lys Gly Thr Lys Leu Met Leu Leu Lys 

50 55 60 

Leu Ala 
65 

(2) INFORMATION FOR SEQ ID NO: 431: 
(i) SEQUENCE CHARACTERISTICS: 

503 



WO 97/30070 



PCT/US97/02318 



(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

Met Xaa Leu lie Gly Lys He Xaa Gly Ser Val Asp Cys Ala Lys Glu 

15 io 15 

Trp Thr Xaa Trp Gly He Lys Tyr Gly Leu Phe He Gin Ser Val He 

20 25 30 

Asp Phe He He He Ala Cys Ala Leu Phe He Cys Val Lys He Ala 

35 40 45 

Asn Thr Leu Met Lys Lys Glu Glu Ala Glu Glu Glu Ala Val Val Glu 

50 55 60 

Glu Asn Val Val Leu Leu Thr Glu He Arg Asp Leu Leu Arg Glu Lys 
65 70 75 80 

Lys 



(2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:432: 

Met Asn Leu Leu Thr Phe Asp Glu Met Ala Arg He Ala Lys Val Tyr 

15 10 15 

Ala Glu Leu Gly Val Lys Lys He Arg He Thr Gly Gly Glu Pro Leu 

20 25 30 

Met Arg Arg Asp Leu Glu Pro Cys Thr Tyr Ser 
35 40 

(2) INFORMATION FOR SEQ ID NO: 433: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:433: 



Met Phe Asn Pro Phe Asp Glu Phe 

1 5 
Val Glu Val Ala Gly Thr Phe Pro 
20 

Asn Glu Thr lie Lys Gly Phe Met 

35 40 
Leu Lys Phe His Gin Met Ser Lys 

50 55 
Pro Tyr His lie Pro lie Thr Asn 
65 70 
Lys Thr Tyr Glu Val Val Val Arg 
85 



Pro His Thr He Glu He Gly Glu 

10 15 
Lys Glu Tyr Glu Arg Phe Lys Ser 
25 30 
Asp Thr Pro Thr Ser Ser Glu Thr 
45 

Asp Phe Asp Arg Asn Leu Tyr Thr 
60 

Lys Thr Leu Phe Asn Tyr Glu Gly 

75 80 
Thr Gly Arg Pro Arg Arg Thr Thr 
90 95 



(2) INFORMATION FOR SEQ ID NO: 434: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 : 

Met Thr Lys His Glu Gin He Leu Asp Tyr He Glu Ser Leu Ser He 

15 10 15 

Gly Ser Lys He Ser Val Arg Lys He Ala Lys Phe Leu Asn Val Ser 

20 25 30 

Glu Gly Thr Ala Tyr Arg Ala He Lys Asp Ala Asp Lys Met Gly Met 

35 40 45 

Val Ala Thr He Asp Arg Val Gly Thr Val Arg He Glu Lys Arg Asn 
50 55 60 
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Arg.Tyr Glu lie Glu His Leu Thr Phe Asn Glu He Phe Asn He He 
65 70 75 80 

Asp Gly Gin Val Leu Gly Cys His Xaa Gly He Thr Lys Met Val Ser 

85 90 95 

Lys Phe Ala He Gly Ala Met Glu Leu Lys Asp He Leu Lys Tyr He 

100 105 HO 

Gly Pro Lys Thr Leu Leu He Val Gly Asn Arg Glu Asp Val Pro Gly 

115 120 125 

Xaa Ser Xaa Xaa Lys Glu Val Arg Pro He Leu He Thr Trp Trp lie 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

Met Ser Phe Gin Arg Val Pro Cys Glu He Ser Ala Val Pro Val He 

15 10 15 

His Val Asp Pro Phe He Asn Gin Gin Xaa Arg Gin Lys Leu Asn Gin 

20 25 30 

He He Asn Asp Ser Arg Glu Xaa Arg Val Met Lys Met Ala Thr Asp 

35 40 45 

Gly Lys Ser Leu Ala Asp Leu Leu Ser 
50 55 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 
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Met Trp Ser Phe Lys Ser Met Phe Asn Thr Pro Glu Asp Pro Glu Lys 

1 5 10 15 

Asp Ala Tyr Glu Phe Tyr Gly Lys Val Gin Pro Gly lie Thr Thr Pro 

20 25 30 

Thr Gin Lys Thr Cys Asn Phe Val Ala Leu Asp Leu Lys Ser Lys Asp 

35 40 45 

Arg Asp Ala lie Lys Ala Met Phe Lys Lys Trp Thr Val Met Ala Asp 

50 55 60 

Arg Met Met Asp Gly Asp Thr Val Gly Lys Thr Ser Asn Asn Pro Leu 
65 70 75 80 

Met Pro Pro Val Asp Thr Gly Glu Ser lie Gly Leu Gly Ala Ser Lys 

85 90 95 

Leu Thr lie Thr Phe Gly lie Ser Lys Ser Leu Met Lys Lys Asn Gly 
100 105 110 

Leu Ser Lys 
115 

(2) INFORMATION FOR SEQ ID NO:437: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

Met Leu Thr His Phe Tyr Ala Xaa His lie Glu His Arg Arg Trp Gly 

1 5 10 15 

Ala Pro Xaa Pro Xaa lie Glu He Val Lys Leu Gly He Asn Lys Ala 

20 25 30 

Arg Gly He Glu Gin Val Arg Gin Phe Leu Xaa He Asp Arg Asn Asn 

35 40 45 

He He Ala Phe Gly Asp Glu Asp Asn Asp He Glu Met He Glu Tyr 

50 55 60 

Ala Arg His Gly Val Ala Met Glu Asn Gly Val Gin Glu Leu Lys Asp 
65 70 75 80 

Val Ala Asn Asn He Thr Phe Asn Asn Asn Glu Asp Gly He Gly Ser 
85 90 95 

He Phe Glu 
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(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 

Met He Glu Arg Phe Val His Glu His Asp Val He Met Phe Thr He 

15 10 15 

Cys Met Asp Val He Asp Ser Ala Phe Ala Pro Gly Val Ser Ala Pro 

20 25 30 

Xaa Val Leu Gly Leu Tyr Pro His Thr Val Leu Glu Leu Ala Lys Arg 

35 40 45 

He He Pro Ser Asp Lys Val Ser Ser Val Ser He Ala Glu Met Asn 

50 55 60 

Pro Thr Tyr Asp Ala Asp Asn Arg Xaa Xaa Lys Leu Val Pro Asn Leu 
65 70 75 80 

Xaa His His Phe Leu Lys 
85 

(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:439: 

Met Met Ser Phe Thr Met Asn Gin Gin Met Arg Gin Leu Leu Gin Lys 

15 10 15 

Tyr Val Glu Lys His Gin He Thr Asp Phe Met Phe Phe Met Ser Val 
20 25 30 
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Val Met Thr Leu Leu Ser Arg Tyr Ala 

35 40 
Gly Ser Val Met Ser Ala Arg Met His 

50 55 
Gly Met Phe Ala Asn Thr Leu Val Tyr 
65 70 
Lys Met Trp Thr Gin Phe Leu Gin Glu 
85 
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Arg Lys Asp Asp Val Val Val 
45 

Lys Gly Thr Glu Gin Met Leu 
60 

Ser Gly Gin Pro Ser Pro Asp 
75 80 

Val 
90 



(2) INFORMATION FOR SEQ ID NO: 440: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:440: 



Met Val Ser Tyr Ala Gly Thr Ser. Ser Met Glu Val Val Xaa His lie 

1 5 10 15 

Arg Xaa Val Asp Val Phe Asn Asn Lys His Asp Leu Ala Pro Leu Ser 

20 25 30 

Tyr Leu Thr Phe Val Ala Leu Asp Asp Glu Gly Thr Pro Lys His Val 

35 40 45 

Pro Gly Val Tyr Pro Glu Asp Asp Val Val Lys Trp Phe Tyr Asp Thr 

50 55 60 

Ala Pro Gin Arg Val Glu Arg Arg Lys Ala Arg Arg lie Glu Ser Lys 
65 70 75 80 

Gin Thr Xaa Glu Tyr Leu Ala Gin Val Gin His He Arg Asp 

85 90 



(2) INFORMATION FOR SEQ ID NO:441: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 441: 



Met Ala Asn Ala His Asp Phe lie 

1 5 
Thr Glu Val Gly Glu Arg Gly Val 
20 

Arg Leu Ser He Ala Arg He Phe 

35 40 
Leu Asp Glu Ala Thr Ser Ala Leu 

50 55 
Ser Arg Ser lie Arg Cys Val Asp 
65 70 



Met Asn Leu Pro Gin Gly Tyr Asp 

10 15 
Lys Leu Ser Gly Gly Gin Lys Gin 
25 30 
Leu Asn Asn Pro Pro He Leu He 
45 

Asp Leu Glu Ser Glu Ser Xaa Asn 
60 



(2) INFORMATION FOR SEQ ID NO: 442: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 

Met Ala Ser Ser Leu Gly Gly Ala Phe Gly Val Ala He Ser Gly Ala 

15 10 15 

Val Tyr Val Gly Ala Val Ala Ala Thr Ser He His Thr Gly Ala Met 

20 25 30 

He Ala Leu Trp Val Asn Val Leu Met Gly He Met Ala Phe He Ala 

35 40 45 

He Leu Phe Gly Ala Xaa Ser 
50 55 



(2) INFORMATION FOR SEQ ID NO: 443: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 



Met He Tyr Phe Asn He Gly Gin He He Ala Asn He He Cys Trp 

15 10 15 

Ala Leu He Ala Pro Thr Leu Asp He Leu He Tyr Asn Glu Pro Ala 

20 25 30 

Asn Lys Val Tyr Thr Gin Gly Val He Ser Ala Val Leu Asn He He 

35 40 45 

Ser Val Gly He He Gly Thr He Leu Leu Lys Ala Tyr Ala Ser Ser 

50 55 60 

Gin He Lys Lys Gly Ser Leu Arg Lys Glu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 444: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION 

Met Lys Met Lys Lys Xaa Asp lie 

1 5 
Gly He Gly Ala Ala Val Phe Val 
20 

Pro Thr Gly Phe Pro Asn Thr Asn 

35 40 
Ala Leu He Ser Ala He Phe Gly 

50 55 
I^eu Val Gly His Ala He Lys Asp 
65 70 
Gly Gly Val Gly Leu Phe Val Gin 
85 



SEQ ID NO: 444: 

Ser Val Lys Thr Val Val Ala He 

10 15 
He Leu Gly Arg Phe Val Val He 
25 30 
He Glu Thr Ser Tyr Ala Phe Leu 
45 

Pro Phe Ala Gly Leu Met Pro Gly 
60 

Phe Pro Asp He Arg Val Val Leu 

75 • 80 
Glu Leu Ser Pro Val 
90 



(2) INFORMATION FOR SEQ ID NO: 445: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO:445: 

Met Leu Asn Ala Asp Leu Lys Gin Gin Leu Lys Gin Leu Leu Glu Leu 

1 5 10 15 

Met Glu Gly Asn Val Glu Phe Val Gly Ser Leu Gly Ser Asp Glu Lys 

20 25 30 

Ser Lys Glu Leu Lys Glu Leu Leu Thr Gly Asn Phe 
35 40 

(2) INFORMATION FOR SEQ ID NO: 446: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

Met Ser Leu Arg Asp Glu Ala Leu Glu Met His Lys Arg Asn Gin Gly 

15 10 15 

Lys Leu Glu Val Lys Pro Asn Val Lys Val Thr Asn Lys Glu Glu Leu 

20 25 30 

Ser Leu Ala Tyr Ser Pro Gly Val Val Glu Pro Cys Lys Asp lie Tyr 

35 40 45 

Glu Asp Lys Arg Lys Val Tyr Asp Tyr Thr lie Lys Gly Asn Thr Val 

50 55 60 

Ala Val He Thr Asp Gly Thr Ala Val Leu Gly Leu Gly Asn He Gly 
65 70 75 , 80 

Pro Glu Ala Ser He Pro Val Met Glu Gly Lys Ala Val Leu Phe Lys 

85 90 95 

Ser Phe Ala Gly He Asn Gly Val Pro He Ala Leu Asn Thr Thr Asp 

100 105 110 

Thr Glu Glu He He Lys Thr Val Lys Ser Leu Glu Pro Asn Tyr Gly 
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115 



120 



125 



Gly lie Asn Leu Glu Asp lie Ser Ala Pro Arg Cys Ser Glu lie Glu 

130 135 140 

Val Pro lie Glu Lys Arg Asn 



(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 

Met Asp lie Val Asn He Glu Ser Leu Ala Glu Gly Leu Asn Lys Trp 

15 10 15 

Met Lys Glu Leu Ser Lys Thr Thr Ser Leu Asp Pro Val Lys Pro Val 

20 25 30 

Asp Val Leu Leu Thr Glu Asn Asp He Thr He Lys Asn He Arg Asp 

35 40 45 

He Glu Ser Phe Lys Thr Leu Arg Xaa Lys lie Phe Leu Arg Pro He 

50 55 60 

Phe Glu Met Asp Asp Leu Ser Asp Phe Phe Ser 
65 70 75 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 
Met Thr Gin Tyr Pro Leu Trp Gin Gin Leu Asn Thr Leu Lys Gin Ala 



145 



150 



(2) INFORMATION FOR SEQ ID 



NO: 448: 



1 



5 



10 



15 
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Thr Trp Val Asp Leu Thr His Thr 
20 

Ser Glu Phe Glu Lys Gly Glu Val 

35 40 
His Gly Phe Tyr Val Gin Arg Trp 

50 55 
His lie Gly Cys Asn Gin Ser Thr 
65 70 



Phe Asp Pro Glu lie Pro Arg Phe 
25 30 
Ser Thr Leu Phe Thr Val Lys Asp 
45 

Asn lie Val Thr Gin Tyr Gly Thr 
60 

Leu Val Gly Glu Gin Thr Val lie 
75 80 



(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 91 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

Met Gin Leu Phe Glu Thr Met Xaa He Asp Asn Gly His He Pro Arg 

15 10 15 

Leu Thr Tyr His Thr Asn Arg He Lys Cys Ser Ser Glu Arg Leu Asn 

20 25 30 

Phe Lys Phe Asp Glu His Ala Trp Arg Asn Glu Leu Asn Asp Val Thr 

35 40 45 

Thr Lys Tyr His Ser Gly Gin Tyr Arg Leu Lys He Val Leu Asn Ala 

50 55 60 

Glu Ser Lys Phe Glu Thr He Val Ser Pro Leu Pro Glu Lys Ser Ser 
65 70 75 80 

Phe Thr Ala Lys Phe Gin Val Leu Pro Gin Ser 
85 90 



(2) INFORMATION FOR SEQ ID NO: 450: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 



Met Met lie Thr lie Leu Ala Gly Ala Ala Leu Ser lie Ser Gly Ala 

15 10 15 

He Val Gin Ser Val Thr Lys Asn Pro He Ala Glu Pro Gly He Leu 

20 25 30 

Gly He Asn Ala Gly Gly Gly Phe Ala He Ala Leu Phe He Ala He 

35 40 45 

Gly Lys He Asn Ala Asp Asn Phe Val 
50 55 

(2). INFORMATION FOR SEQ ID NO: 451: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 



Met Pro Leu Asp Thr Cys Gly Ala Met Ser Pro Gly Met He Gly Tyr 

15 10 15 

Trp Leu Glu Thr Glu He Asn Arg He Leu Thr Glu Met Asn Ser Asp 

20 25 30 

Arg Thr Val Gly Thr He Val Thr Thr Cys Gly Ser Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 452: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 
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Met Gin Ser Met Glu Leu Leu Tyr Asn Gin Gin Phe Glu Val Xaa Lys 

1 5 10 15 

Ala He He Leu Ala Ala Thr Ser Arg Thr Ser Ser Tyr Ser Arg Ala 

20 25 30 

Phe Asn Glu He Ala Arg Gin Ala He His Leu Gly Gly Lys Glu Gly 

35 40 45 

Leu Ser He Ala Arg Gin Leu Gly Phe Leu Thr Tyr Arg Ser Ser Lys 

50 55 60 

Ser Tyr Asp Glu Arg Phe Thr Pro Asp Glu Xaa Val Ala Tyr Gin Gin 
65 70 75 80 

His Gin Gly Asn Lys Phe Lys Glu His Phe Asp Leu Asn Xaa Tyr Leu 

85 90 95 

Thr Leu Leu Asp Val Leu Asp Ser His Asn He Asp Arg Gly Arg Thr 

100 105 no 

Asp Val Thr His He Phe Lys Asn Leu Glu Thr Lys Val Leu Xaa Met 

115 120 125 

Gly Phe He Asp Asp Leu Leu Tyr Pro Asp Asp Gin Val Arg Ala Leu 

130 135 140 

Gly Glu Arg Phe Lys Tyr His Arg His Phe Phe Val Pro Asp Asn Val 
145 150 155 160 

Gly His Asp Gly Phe Leu Leu Asn Phe Ser Thr Trp Ala Pro Asn Leu 

165 170 175 

Tyr His Phe Leu Asn Leu Lys His Phe Lys Arg Lys 
180 185 

(2> INFORMATION FOR SEQ ID NO: 453: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

Met Thr Asn Leu Asn Asp Tyr Phe Ser Glu Glu Thr Val Glu Met Met 

1 5 10 15 

Leu Ser Asn He Glu Val He Val Gly Asp Phe Glu Cys Met Asp Asp 

20 25 30 

Val Val Leu Pro Glu Asn Met Asp Thr He He His Ala Xaa Ala Arg 
35 40 45 
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Thr Asp His Phe Gly Asp Asp Asp 

50 55 
Gly Thr Val Asp Val He Xaa Val 
65 70 
He Tyr Val Ser Thr He Ser Val 
85 

Glu Asp Val Thr Phe Ser Glu Ala 
100 

Thr Ser Pro Tyr Thr Arg Ser Lys 
115 120 
Ser Cys Lys 
130 
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Glu Phe Glu Lys Val Asn Val Gin 
60 

Ala Gin Gin His His Ala Arg Leu 

75 80 
Gly Thr Tyr Phe Asp He Asp Thr 

90 95 
Asp Val Tyr Lys Gly Gin Leu Leu 
105 110 
Phe Tyr Ser Glu Leu Asn He Arg 
125 



(2) INFORMATION FOR SEQ ID NO: 454: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 454: 

Met His Xaa He Lys Ala Ala He Glu Glu Ala His Lys Xaa Asn Lys 

15 10 15 

Gin Leu Leu Val Asp Met He Ala Val Gin Asp Leu Arg Lys Thr Cys 

20 25 30 

Lys Thr Thr Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 455: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 
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Met He Thr He Asp Asp Leu He Glu Tyx Arg Lys Lys Leu Glu Pro 

1 5 10 15 

Glu lie Glu Phe Lys Pro Lys Ser Glu Asn Ala Tyr Arg Phe Arg Asn 
20 25 30 

He 



(2) INFORMATION FOR SEQ ID NO: 456: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 : 

Met Ala Pro He Ala Ala Trp Gly He Thr Thr Thr Ala Phe He Asn 

15 10 15 

Phe Gly Tyr Leu Lys Gly Phe Glu Gly Val Phe He Tyr Pro Ala Leu 

20 25 30 

Leu Ser Leu He He Gly Ala He Ser Tyr Val Leu He Arg Asp Thr 

35 40 45 

Pro Gin Ser Gin Gly Leu Pro Pro He Glu He Tyr Lys Asn Asp Phe 

50 55 60 

Ala Thr Ser Asp Lys Lys His 
65 70 

(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 
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Met Thr Tyr Gin Ser Cys Gin Asn Phe His Cys Ser Lys Ser Asp Thr 

1 5 10 15 

Ser lie Leu Lys Thr Lys Val Lys Ser Asp Lys Asn Asp Thr Asn Tyr 

20 25 30 

Arg Pro Val Leu Asn Gin Tyr Tyr Gly Glu Leu Leu Phe Leu Leu Asn 

35 40 45 

Ser Glu Pro Asn Asn Gin Lys Tyr Lys Xaa Asn Ser Asn Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 4 58: 



£i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



Met Glu Thr lie Glu Lys Gly Met 

1 5 
Leu lie Phe Gly Leu Gly Ala lie 
20 

Gly Gly Ala Thr Arg lie Ala Asp 

35 40 
Lys His Val Gin Trp Ala Met Leu 

50 55 
Cys lie Val Leu 
65 



Gly Xaa Thr Leu Gly His He Ala 

10 15 
Leu Gly Lys Leu Leu Ala Asp Gly 
25 30 
Thr Leu Xaa Gin Lys Phe Gly Gin 
45 

Gly Cys Ser He Tyr Arg Arg Tyr 
60 



(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

£ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 
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Met Leu Gly Val Val Pro Val lie Arg Ala Leu Arg Val Lys Ala Met 

15 10 15 

Ala lie Gin Ala Glu Thr Met Asp Ser lie Asp Arg Lys Leu Pro Gly 

20 25 30 

Leu Ser Glu Arg Glu Arg Lys lie He Ser Lys His Thr Gin Ser He 

35 40 45 

He Asn His Met Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 460: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 : 

Met Val Val He Phe He Pro Val Leu Ala Xaa Trp Arg Gly He Phe 

15 10 15 

His Gly Tyr Lys Ser Xaa Gly Xaa Ala Ala Val Ser Xaa Val Thr Glu 

20 25 30 

Gin He Ala Arg Ser Xaa Phe Thr Tyr 
35 40 

(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 



Met Pro Thr Gly Thr Asn He Tyr Ala Val Lys Gly Gly He Ala Asp 
15 10 15 
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Lys Val Trp Thr Asp Tyr Gly Gly Gly Asn Pro He Gin He Lys Thr , 
20 25 30 

Gly Ala 



(2) INFORMATION FOR SEQ ID NO: 462: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 



Met Thr Lys Thr Lys Ala He Asp He He Gly Ala Pro Ser Thr Phe 

15 10 15 

Gly Gin Arg Lys Leu Gly Val Asp Leu Gly Pro Thr Ala He Arg Tyr 

20 25 30 

Ala Gly Leu He Ser Arg Leu Xaa Gin Leu Ala Pro 
35 40 



(2) INFORMATION FOR SEQ ID NO: 463: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:463: 



Met Gin He Lys Leu Pro Lys Pro Phe Phe Phe Glu Glu Gly Lys Arg 

15 10 15 

Ala Val Leu Leu Leu His Gly Phe Thr Gly Asn Ser Ser Asp Val Arg 

20 25 30 

Gin Leu Gly Arg Phe Leu Gin Lys Lys Gly Tyr Thr Ser Tyr Ala Pro 

35 40 45 

Gin Tyr Glu Gly His Ala Ala Pro Pro Glu Glu He Leu Lys Ser Ser 
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50 55 
Pro Phe Xaa Trp Phe Lys Asp Ala Leu 
65 70 
Glu Gin Gly Tyr Asp Glu lie Val Gly 
85 

Val Gly lie Leu Arg Phe Lys lie Lys 
100 105 
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60 

Asp Gly Tyr Asp Tyr Leu Val 

75 80 
Cys Trp Ser lie He Xaa Arg 
90 95 
Ala 



(2) INFORMATION FOR SEQ ID NO: 464: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

Met Lys Gin Tyr Pro Met Thr Leu Ser Gly Gly Glu Ala Gin Arg Val 

1 5 10 15 

Ala Leu Ala Arg Ala Leu Ser Thr Lys Pro Asp Leu He Leu Leu Asp 

20 25 30 

Glu Pro Phe Ser Ser Leu Asp Asp Thr Thr Gin Asp Glu Ser He Thr 

35 40 45 

Leu Val Lys Arg He Phe Xaa Xaa Trp Xaa He Pro He He Phe Val 

50 55 60 

Thr His Ser Xaa Tyr Glu Ala Xaa His Met Ala His Glu He He Thr 
65 70 75 80 

He Gly 



(2) INFORMATION FOR SEQ ID NO: 465: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 

Met Arg Tyr His Ser Leu lie Ala Asp Gly Ala Thr Phe Pro Asn Cys 

1-5 - 10 15 

Leu Lys lie Thr Ala Lys Asn Asp Glu Ala lie lie Met Ala Phe Glu 

20 25 30 

His lie Arg Phe Pro Val Phe Gly Val Gin Tyr His Pro Glu Ser He 

35 40 45 

Leu Ser Glu Tyr Gly Tyr Arg Gin Val 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 



Met Gly Val Asp Thr Glu Asp Asn 

1 5 
Arg Gly Lys Gly Pro Val Val Lys 
20 

Ala Lys Asn Val Phe Leu Ala Ser 

35 40 
He Ala Trp His Leu Ser Lys He 

50 55 
Asn Arg His Ala Cys Phe Gin Arg 
65 70 



Ser Asp Pro Lys Tyr He Thr He 

10 15 
Glu Leu Lys Lys His Ala Lys Lys 
25 30 
Asp Pro Asp Arg Glu Gly Glu Ala 
45 

Leu Glu Leu Glu Asp Ser Lys Glu 
60 

Asn Asn 



(2) INFORMATION FOR SEQ ID NO: 467: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

Met Val Leu His Asp lie Xaa Gin Ala He Arg Phe Ser Asp His Leu 

15 10 15 

He Ala Met Lys Glu Gly Asp He He Ala Thr Gly Ser Thr Glu Asp 

20 25 30 

Val Leu Thr Gin Glu He Leu Glu Lys Val Phe Asn He Asp Val Val 

35 40 45 

Leu Arg Lys Asp Pro Lys Thr Gly Lys Pro Leu Leu Val Thr Tyr Asp 

50 55 60 

Leu Cys Arg Thr Ser Leu Phe Leu He Lys 
65 70 



(2!) INFORMATION FOR SEQ ID NO:468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



Met Tyr Gly Arg Leu Met Asn Thr 

1 5 
Ala Thr Pro Glu Gin He Leu Val 
20 

Asp Asp He He Thr Glu Gin Asp 

35 40 
Asp Gin Leu Ser Ala Glu Val He 

50 55 
Ser Asp Ser Leu Thr Ala Lys Val 
65 70 
Lys 



Pro Ala Val Asn Xaa He Phe Glu 

10 15 
Asn Gly Glu Leu He Val His Arg 
25 30 
He Leu Ala His He Asn Leu He 
45 

Asp Thr Pro Ser Thr Ala Thr He 
60 

Glu Val Thr Leu Leu Glu Gly Ser 
75 80 



(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 199 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 69: 



Met Val Met Gly Val Glu Val Phe Pro His His Asp Val Lys Gin Xaa 

15 10 15 

Pro His lie Pro Ala Thr Glu lie Met Arg His Xaa Pro Thr His Pro 

20 25 30 

Asn Asn Gin His Lys Gly Tyr Ala Gly His His Leu Phe lie Lys Leu 

35 40 45 

Xaa Asn Tyr His Gin Thr Asp lie Ser Arg Ser lie Cys Thr lie Phe 

50 55 60 

Thr Lys lie Thr lie Leu lie Phe Val Asp Gly Thr Gly Lys Ser Leu 
65 70 75 80 

Glu Asp Leu Lys Gly lie lie Asp Gin Gly Gin Pro Val Leu lie Tyr 

85 90 95 

His Thr Asn Leu Gly Ser Lys Pro Leu Leu Arg Val Phe Arg Phe Asp 

100 105 110 

Asn Lys Pro Ala Lys Gin Val Ser Asn lie His Val Thr Val Leu He 

115 120 125 

Gly Tyr Asp Asp Tyr Tyr Tyr Tyr Tyr He Asp Pro Leu Trp Ser His 

130 135 140 

He Arg Arg Gly Leu Val Leu Pro Ala He lie Pro Asn Arg Lys Gin 
145 150 * 155 160 

He He Lys He Arg Lys Glu Lys Met Glu Tyr Ser Phe Asn Ser Pro 

165 170 175 

Gly Arg Lys Cys He Tyr Val Gin Pro His Ser Tyr Thr He Glu Asn 

180 185 190 

Gin Gin Gin Asn Lys His Thr 
195 

(2) INFORMATION FOR SEQ ID NO: 470: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

Met Thr Asn Tyr Lys Val Val Val Leu Asp Met Asp Asp Thr Leu Leu 

15 10 15 

Asn Ser Asp Asn Val lie Ser Glu Glu Thr Ala Asn Tyr Leu Thr Ala 

20 25 30 

He Gin Asp Glu Gly Tyr Tyr Val Val Leu Ala Ser Gly Arg Pro Thr 

35 40 45 

Glu Gly Met He Pro Thr Ala Arg Asp Leu Lys Leu Pro Glu His His 
50 55 60 

Ser 
65 

(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



Met Ala Val Lys His Ala Ser Ala Pro Lys Ala Tyr He Asn He Thr 

15 10 15 

Gly Leu Gly Phe Ala Lys Leu Thr Lys Glu Gly Ala Glu Leu Lys Tyr 

20 25 30 

Ser Asp He Thr Lys Thr Arg Gly Leu Gin Lys He Gly Val Glu Thr 

35 40 45 

Gly Gly Glu Leu Lys Thr Ala Tyr Ala Asp Gly Gly Pro He Glu Ser 

50 55 60 

Gly Asn Thr Asp Gly Glu Gly Lys He Ser Leu Pro Met His Ala Phe 
65 70 75 80 

Pro Lys Glu He Arg Lys He Val Phe Asn Glu Asp Tyr Asp Glu Gly 

85 90 95 

Trp Arg Leu Arg Thr Arg Lys Pro Gly Lys Pro His Gin Leu Arg Xaa 

100 105 110 

Leu Tyr Gly Ser Asp Lys Glu Arg Xaa Xaa Thr Gly Thr He 
115 120 125 
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(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 



Met Tyr Thr Thr Gly Cys Leu Glu lie Glu His Gly Asp Ala Asn Glu 

15 10 15 

Gly Val Leu Leu Leu Lys Gin Ala Leu Glu Val Asp Asn Ala Tyr Pro 

20 25 30 

Arg Thr Phe lie Asp Phe Lys Ala Asp Leu Tyr Ser 
35 40 



(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

Met Lys Glu Asp Leu Asp Pro Thr Phe Met Trp His Leu Ala Phe Ala 

1 5 10 15 

Tyr Gly Gin Glu Glu Arg Asp Lys Glu Ala Gin His Phe Phe Glu Leu 

20 25 30 

Ala Tyr Pro Thr Met Lys Thr Asn lie Asp Phe Met Ser Asp Tyr Tyr 

35 40 45 

Phe Tyr Leu He Glu He Gly Gin Lys Glu Gin Ala He Thr He Leu 

50 55 60 

Asn Gin Leu Leu Glu Leu Glu Pro Ser Asn Glu Asn Trp His Asp Glu 
65 70 75 80 

Leu Ser Arg Leu Gin Ser 
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85 

(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

<C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Met Asn Ser Gin Glu Leu Leu Ala lie Ala Val Asp Xaa lie Xaa Asn 

1.5 10 15 

Lys Lys Gly Glu Asp Thr lie Ser Leu Glu Met Lys Gly lie Ser Asp 

20 25 30 

Met Thr Asp Tyr Phe Val Val Thr His Gly Asn Asn Glu Arg Xaa Val 

35 40 45 

Gin Ala lie Ala Arg Ala Val Lys Glu Val Ala Asn Glu Gin Asn lie 

50 55 60 

Glu Val Lys Arg Met Glu Gly Tyr Asn Glu Ala Arg Trp lie Leu lie 
65 70 75 80 

Asp Leu Ala Arg Cys Cys Gly Thr Cys Phe Pro 
85 90 

(2) INFORMATION FOR SEQ ID NO:475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 5: 

lie Asp lie Pro Leu Asp Glu Ser Ser Phe Met Tyr Asp Thr Pro 

5 10 15 

lie lie Gin Asp His His Met Thr His Leu Val Ser Val Lys Glu 
20 25 30 

528 
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Leu Xaa lie lie Met Pro Lys Lys Glu Xaa Lys Gin Arg Val Tyr Gin 

35 40 45 

Leu Asn Glu Ala Gin Thr Leu Phe Phe Gly Gly Leu Arg Ala His Arg 

50 55 60 

Leu Cys lie Arg Trp 
65 

(2) INFORMATION FOR SEQ ID NO: 476: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 



Met lie Ser Gly Leu Gly Phe lie Thr lie Gly Pro Gly Ala Lys Val 

1 5 10 15 

lie Val Arg Val Pro Lys Asn Val Asp Val Val Leu Arg Asn Ser He 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 



Met Ala He His Tyr Glu Thr Lys Ala Thr Asn Val Gly Gly Arg Lys 

15 10 15 

Gly His Val Tyr Thr Asp Asp Arg Ala Leu Asp He Asp He Val Pro 

20 25 30 

Pro Ala Gin Ala Asp Gly Lys Ala Thr Asn Pro Glu Gin Leu Phe Ala 
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35 



40 



45 



Ala Gly Tyr Ala Ser Cys Phe Asn Gly Ala Phe Asp Leu lie Leu Lys 

50 55 60 

Gin Asn Lys Cys Val Met Leu lie Gin Lys Val Thr Leu Thr Val Arg 
65 70 75 80 

Leu Glu Asp Asp Gin Thr Gin Lys Val lieu lie Xaa Trp Phe Asn 



(2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

Met Arg Asn Val Lys Gin lie Ala Thr Lys Ser He He Ala He He 

1 5 10 15 

Ser Leu Cys He Leu Thr Tyr Thr Thr Met He Gly Ser Val Leu Ala 

20 25 30 

Asp Glu He Lys Tyr Pro Ser Ala Lys Phe Asn Gin Pro Glu Ala Lys 

35 40 45 

Asp Lys Thr Glu Leu Thr Thr Ser He Phe Asp Glu Lys He Lys Glu 

50 55 60 

Asn Lys Ala Leu Glu Leu Leu lie Phe Asn Gin Glu Asn Lys Asn Val 
65 70 75 80 

Thr Glu Glu Gin Gin Leu Val Asp Glu Lys Ala Gin Leu He Ser Asp 

85 90 95 

Met Thr Gly Lys He Tyr Leu Gin Val Lys Leu Lys Gly Gin He Asp 

100 105 110 

Lys Glu Gin Leu Val Phe Gin Asn Asp Lys Asn Glu Glu Phe Pro Phe 

115 120 125 

Val He Lys Asp Glu Lys Asp Asp Pro 
130 135 



85 



90 



95 



(2) INFORMATION FOR SEQ ID NO: 479: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 95 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



Met Ser Arg Thr Lys Leu Gin Asp 

1 5 
Thr Glu Pro Lys Gin Val Val Val 
20 

Ala Leu Ala Gly lie Phe Gly lie 

35 40 
Leu Lys Glu Trp Glu Lys Asp Ser 

50 55 
Ser Leu Ser Ser Thr Leu lie Val 
65 70 
Tyr Met Lys Val Arg Pro Lys Lys 
85 



Phe Pro Ser Lys Glu Asn Thr Val 

10 15 
Asn Pro Leu Phe Ala Lys Pro Asn 
25 30 
Ser Tyr Ser Ser Val Asn Arg lie 
45 

Lys Gly Val Asp Asp Leu Tyr Tyr 
60 

lie Ser lie Pro Arg Phe Xaa Glu 

75 80 
Trp Met Phe Glu Val Lys Gin 
90 95 



(2) INFORMATION FOR SEQ ID NO: 480: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:480: 



Met Leu Glu Pro Thr lie Lys Pro He Glu Ala Val Asp Glu Cys Leu 

15 io 15 

Gly Glu Val Val Asp Lys He Leu Asp Met Asp Gly Tyr Ala He He 

20 25 30 

Thr Ala Asp His Gly Asn Ser Asp Gin Val Leu Thr Asp Asp Asp Gin 

35 40 45 

Pro Met Thr Thr His Thr Thr Asn Pro Val Pro Val He Val Thr Lys 

50 55 60 

Glu Gly Val Thr Leu Arg Glu Thr Gly Arg Leu Gly Asp Leu Ala Pro 
65 70 75 80 
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Thr Leu Leu Asp Leu Leu Asn Val Glu Gin Pro Val Asp Met Thr Gly 

85 90 95 

Glu Ser Leu lie Lys His 
100 



(2) INFORMATION FOR SEQ ID NO: 481: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 



Met Thr Xaa Arg He Ser Xaa He 

1 5 
Ala Ser Xaa Ala Gly Gly Cys Ser 
20 

Ala Thr Met Thr Trp Gin Gly Trp 

35 40 
Gly Val Trp Thr Met Ser His Phe 

50 55 
Val Gin Gin Xaa Ser Ala He Gly 
65 70 



Gly Gly Xaa Xaa Leu Gly Phe Gly 

10 15 
He Gly Asn Gly Leu Val Glu Thr 
25 30 
He Ala Leu Ala Ser Met He Xaa 
45 

Xaa Xaa Val Arg Pro Met Lys Xaa 
60 



(2) INFORMATION FOR SEQ ID NO: 482 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 



Met lie Gin Thr His He Asn Val Val Gly Leu Ser He Asn Thr Gly 

1 5 10 15 

Val Phe Ala Ala Gly Ala Asn Asp Asn His Glu Pro Lys Lys Gly He 



WO 97/30070 PCT/US97/Q2318 

20 25 30 

Lys Phe Ala Thr Pro Ala Asn Arg Thr Asn Lys Gly He Tyr Gly Lys 

35 40 45 

Ser He He Glu 
50 

(2) INFORMATION FOR SEQ ID NO: 483: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 

Met Leu Glu Thr Gin Glu Val Asp Ser Tyr Arg Thr Ala Thr Asn His 

15 10 15 

He Phe Lys Arg Lys Asn Gly Ala Ser Lys Asn He He Asp Glu Lys 

20 25 30 

Leu He Pro Lys Asp Tyr Trp Leu Ser Gin Ala Pro Lys Leu Asn Ser 

35 40 45 

Lys Gin Leu He Asp Asp Leu Lys Ala Gly Lys Asp He Pro Gly Val 

50 55 60 

Glu Leu Lys Val Thr Glu Ser Leu Val He Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 484: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 



Met Ser Glu He He Val Tyr Thr Gin Asn Asp Cys Pro Pro Cys Thr 
1 5 10 15 
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Phe Val Lys Asn Tyr Leu Asn Glu His Gin He Asp Phe Glu Glu Arg 

20 25 30 

Asn He Asn Asn Gin Gin Tyr Arg Asn Glu Met He Asp Phe Asp Ala 

35 40 45 

Phe Ser Thr Pro Phe He Leu Leu Asn Gly Asn Pro Met Tyr His Val 

50 55 60 

Asp Leu Asp Glu He Asn Lys Val Leu Asn He Gin Asp 
65 70 75 



(2) INFORMATION FOR SEQ ID NO:485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 : 

Met Lys Tyr Ala Asp Lys Lys Gly Val Asp lie Tyr Tyr He Ser Asp 

15 10 15 

Arg Asp Lys Glu Lys Asp Leu Lys Ala Thr Gin Lys Asn Leu Lys Gin 

20 25 30 

Gin Gly He Pro Gin Ala Lys Lys Ser His He Leu Leu Lys Gly Lys 

35 40 45 

Asp Asp Lys Ser Lys Glu Ser Arg Arg Gin Met Val Gin Lys Asp His 

50 55 60 

Lys Leu Val Met Leu Phe Gly Asp Asn Leu Leu Asp Phe Thr Asp Pro 
65 70 75 80 

Lys Glu Ala Thr Ala Glu Ser Arg Glu Ala Leu He Glu Lys His Lys 

85 90 95 

Asp Asp Phe Gly Lys Lys Tyr He He Phe Pro Asn Pro Met Tyr Gly 

100 105 no 

Ser Trp Glu Ala Thr lie Tyr Asn Asn Asn Tyr Lys Ala Ser Asp Lys 

115 120 125 

Ala Lys Asp Lys Leu Arg Lys Asn Ala lie Lys Gin Ser Asp Pro Lys 

130 135 140 

Thr Gly Glu Val Lys 
145 

(2) INFORMATION FOR SEQ ID NO: 486: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

Met Ser Leu Gin Asp Pro Thr Arg Lys Met Ser Lys Ser Asp Asp Asn 

1 5 10 15 

Ala Lys Asn Phe lie Ser Leu Leu Asp Glu Pro Asn Val Ala Ala Lys 

20 25 30 

Lys lie Lys Ser Ala Val Thr Asp Ser Asp Gly lie lie Lys Phe Asp 

35 40 45 

Arg Asp Asn Lys Pro Gly lie Thr Asn Leu lie Ser lie Tyr Ala Gly 

50 55 60 

Leu Thr Asp Met Pro lie Lys Asp lie Glu Ala Lys Tyr Glu Gly Glu 
65 70 75 80 

Gly Tyr Gly Lys Phe Lys Gly Asp Leu Ala Glu lie Val Lys Ala Phe 

85 90 95 

Leu Val Glu Phe Gin Xaa Lys lie Arg Lys Phe Leu 
100 105 

(2) INFORMATION FOR SEQ ID NO: 487: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 

Met Leu Ser Asp Ala Gin Glu Gly Asp Val lie Ser Gly Lys Pro Trp 

1 5 10 15 

lie Trp Met Pro Pro Ala He Met He Thr Leu Thr He Leu Ser He 

20 25 30 

Asn Phe Val Gly Gly Gly Leu Lys Asp Ala Phe Asn Pro Arg Gly Arg 
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35 40 45 

Arg 



(2) INFORMATION FOR SEQ ID NO: 488: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

Met Gly Arg Asp Asn Val Leu lie Thr Pro His lie Gly Ser Ala Ser 

1 5 10 15 

Val Thr Thr Arg Asp Asn Met He Gin Leu Cys He Asn Asn He Glu 

20 25 30 

Ala Val Met Thr Asn Gin Val Pro His Thr Pro Val Asn 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 489: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 89: 

Met Lys Ser Lys Ser Lys Gin Pro Pro Asn Lys Tyr Val Glu Ala Phe 

15 10 15 

Lys Pro Tyr Leu Leu Thr Leu Leu Tyr Leu Ala He Phe He Thr Leu 

20 25 30 

Tyr Leu He Tyr Gly Ser Gly Asp Thr His Asn Asn Phe He Tyr Asn 
35 40 45 

Glu Phe 
50 
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(2) INFORMATION FOR SEQ ID NO: 490 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 

Met Thr Asp lie He Asn Lys Leu Gin Ala Phe Ala Asp Ala Asn Pro 

1 5 10 15 

Gin Ser He Ala Val Arg His Thr Thr Asp Glu Leu Thr Tyr Gin Gin 

20 25 30 

Leu Met Asp Glu Ser Ser Lys Leu Ala His Arg Leu Gin Gly Ser Lys 

35 40 45 

Lys Pro Met He Leu Phe Gly His Met Ser Pro Tyr Met lie Val Gly 

50 55 60 

Met He Gly Ala He Lys Ala Gly Cys Gly Tyr Val Pro Val Asp Thr 
65 70 75 80 

Ser Asn Ser Trp Lys Thr Val Leu Lys 
85 



(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 



Met Leu He Ala Asn Glu Thr Val Ala Glu His Phe Ser Lys Leu Asn 

15 10 15 

Val Pro Phe He Tyr Arg Val His Glu Gin Pro Lys Ser Asp Arg Leu 

20 25 30 

Arg Gin Phe Phe Asp Phe He Thr Asn Phe Gly He Met He Lys Gly 
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35 40 45 

Thr Gly Glu Asp lie His Pro Thr Thr Leu Gin Lys Val Gin Glu Glu 

50 55 60 

Val Glu Gly Arg Pro Glu Gin Met Val He Ser Thr Met Met Leu Arg 
65 70 75 80 

Ser Met Gin Gin Ala His Tyr Asp Asp Val Asn Leu Gly His Cys Gly 

85 90 95 

Leu Ser Ala Glu Tyr Tyr Thr His Xaa Thr Ser Pro He Arg Arg Tyr 

100 105 110 

Pro Asp Leu Thr Xaa His Arg Leu He Arg Lys Tyr Leu He Glu Lys 

115 120 125 

Ser Met Asp Asn Lys Glu Val Lys Arg Trp Glu Asp Lys Leu Pro Glu 

130 135 140 

Leu Ala Glu His Thr Ser Lys Arg Glu Arg Arg Ala He Glu Ala Glu 
145 150 155 160 

Arg Asp Thr Asp Glu Leu Lys Lys Ala Glu Tyr Met He Gin His He 

165 170 175 

Gly Asp Glu Phe Glu Gly He Val Ser Ser Val Ala Asn Phe Gly Xaa 

180 185 190 

Phe He Glu Leu Xaa Asn Thr He Glu Gly Met Xaa His He Ala Asn 

195 200 205 

Met Thr Asp Asp Tyr Tyr Arg Val Glu Glu Arg Gin Met Ala Leu He 
210 215 220 

Gly 
225 



(2) INFORMATION FOR SEQ ID NO: 492: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 



Met His Thr Leu Ser His Asn Lys He Lys Ser Gin Leu Arg Met Thr 

15 10 15 

Leu Pro Glu Tyr Met He Pro Val Asn Phe Met His He Glu Gin He 

20 25 30 

Pro He Thr He Asn Gly Lys Leu Asp Lys Lys Ala Leu Pro He Met 
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35 40 45 

Asp Tyr Val Asp Thr Asp Ala Tyr Val Ala Pro Ser Thr Asp Thr Glu 

50 55 60 

His Leu Leu Cys Gin lie Cys A * a A ^P He Leu His Val Asn Gin Val 
65 70 75 80 

Gly He His Asp Asn Phe Phe Glu Leu Gly Gly His Ser Leu Lys Ala 

85 90 95 

Thr Leu Val Val Asn Arg He Glu Ala Ser Thr Gly Lys Arg Leu Gin 

100 105 110 

He Gly Asp Leu Leu Xaa Lys Pro Thr Val Phe Glu Leu Ala Arg Ala 

115 120 125 

He Ala Lys Val Gin Glu Gin Asn Tyr Glu Val He Pro Arg Xaa Tyr 

130 135 140 

Lys Xaa Lys Val Asp Tyr Val Ala Trp Ser Ser Cys Pro Lys Ser Val 
145 150 155 160 

Leu Tyr Leu Leu Trp Lys Ser Xaa Pro He Lys He Arg Leu Tyr Asn 

165 170 175 

Arg Thr Phe Phe Met Ala Val He He Arg Thr 
180 185 

(2) INFORMATION FOR SEQ ID NO: 493: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 



Met Lys Asn Leu Asn Ser Phe Asn Tyr Val Arg Lys Gly Leu Glu Tyr 

15 10 15 

Glu Glu Lys Arg Gin Glu Glu Glu Leu Leu Asn Gly Gly Glu He Gly 

20 25 30 

Gin Glu Thr Arg Arg Phe Asp Glu Ser Thr Gly Lys Thr He Leu Met 

35 40 45 

Arg Val Lys Glu Gly Ser Asp Asp Tyr Arg Tyr Phe Pro Glu Pro Asp 

50 55 60 

He Val Pro Leu Tyr He Asp Asp Ala Trp Lys Glu Arg Val Arg Gin 
65 70 75 80 

Thr He Pro Glu Leu Pro Asp Glu Arg Lys Ala Lys Tyr Val Asn Glu 
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85 90 
Leu Gly Leu Leu His Thr Met Xaa Xaa Tyr 
100 105 



95 



(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 : 

Met Val Lys Ala Lys Leu Trp Gly Phe Leu Val Gly Gin He Met Lys 

15 10 15 

Ala Ser Lys Gly Gin Ala Asn Pro Gin Leu Val Asn Gin Leu Leu Lys 

20 25 30 

Gin Glu Leu Asp Lys Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 495: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:495: 

Met His Glu Asn Tyr Asp Val Leu He Ala Ala Gly Gly Asp Gly Thr 

1 5 10 15 

Leu Asn Glu Val Val Asn Gly He Ala Glu Lys Pro Asn Arg Pro Lys 

20 25 30 

Leu Gly Val He Pro Met Gly Thr Val Asn Asp Phe Gly Arg Ala Leu 

35 40 45 

His He Pro Asn Asp He Met Gly Ala Leu Asp Val He He Glu Gly 
50 55 60 
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His Ser Thr Lys Val Asp lie Gly Lys Met Asn Asn Arg Tyr Phe lie 
65 70 75 80 

Asn Leu Ala Ala Gly Gly Gin Leu Thr Gin Val Ser Tyr Glu Thr Pro 

85 90 95 

Ser Lys Leu Lys Ser He Val Gly Pro Phe Ala Tyr Tyr He Lys Gly 

100 105 110 

Phe Glu Met Leu Pro Gin Met Lys Ala Val Asp Leu Arg He Glu Tyr 

115 120 125 

Asp Gly Asn Val Xaa Gin Gly Glu Ala Leu Leu Phe Phe Xaa Gly Leu 

130 135 140 

Thr Asn Pro Met Ala Gly Phe Glu Lys Leu Val Pro Gly Arg 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

Met Asp Xaa Gly Asn Pro Val Xaa Xaa He Tyr Phe Xaa Glu Pro Asp 

15 10 15 

Leu His Phe Thr He He Asp Phe Asn Gin Glu Leu Leu Cys He Tyr 

20 25 30 

He Asp Phe Asp Ser Gly Leu Arg His Ser Asn Met Ala Thr Glu Ser 

35 40 45 

Gly He Ser Leu Arg He Asn He Ala Lys Ser Asp Phe Thr Lys Phe 

50 55 60 

He Asn Glu Leu Ala Ser Leu Tyr 
65 70 

(2) INFORMATION FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 



Met Gin He Glu Leu Thr Asp Ala Ala Val Thr Trp Phe Lys Asn Glu 

15 10 15 

Leu Glu Leu Pro Glu Asn Asn Lys Val Leu Val Phe Phe Val Arg Tyr 

20 25 30 

Gly Gly Glu Phe Gin Leu Lys Gin Gly Phe Ser Pro Ala Phe Thr Val 

35 40 45 

Glu Pro Lys Glu Asp Val Asp He Gly Tyr Glu Gin Gin Tyr Asp Asp 

50 55 60 

Leu Asn Val Val Val Ala Glu Lys Asp Leu Trp Tyr Phe Glu Asp Asp 
65 70 75 80 

His He He Val Asn Val Xaa Asp His Glu Asp Glu He Ser Tyr Ser 
85 90 95 

Thr Lys 



(2) INFORMATION FOR SEQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:498: 

Met' Thr Lys He Ser Lys He He Asp Glu Leu Asn Asn Gin Gin Ala 

1 5 10 15 

Asp Ala Ala Trp He Thr Thr Pro Leu Asn Val Tyr Tyr Phe Thr Gly 

20 25 30 

Tyr Arg Ser Glu Pro Met Lys Asp Tyr Leu His Tyr 
35 40 



(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 38 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 

Met Val lie Thr lie Glu Pro Gly lie Tyx Val Pro Gly Val Ala Gly 

15 10 15 

Val Arg lie Glu Asp Asp lie Leu Val Thr Asn Glu Gly Tyr Glu Val 

20 25 30 

Leu Thr His Tyr Glu Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

Met Leu Arg Arg Xaa Gly Asn Pro Gly lie Ser Thr Cys Ala lie Val 

1 5 10 15 

lie Lys Glu Asp Lys Gin His Xaa Thr Tyr Thr His Glu Leu Gly Glu 

20 25 30 

Met Asp Asn His Thr Ala Glu Trp Ala Ala Cys He Tyr Ala Leu Glu 

35 40 45 

His Ala Arg Glu Leu Asn Val Gin Asn Ala Leu Leu Tyr Thr Asp Ser 

50 55 60 

Lys Leu He Ala Arg 
65 

(2) INFORMATION FOR SEQ ID NO: 501 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 



Met His Pro Arg Thr He Xaa He Asp Val He Ser Lys Leu His Gin 

1 5 10 15 

Asn Lys Asp He Gin Trp His Gin He Lys Ser Ser He He Glu Thr 

20 25 30 

Leu Gin Pro Tyr Leu Phe Glu Lys Thr Ala Arg Lys Pro Met He Leu 

35 40 45 

Pro Val He Met Lys Val Asn Glu Gin Lys Glu Ser Asn Asn Lys 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 



Met Ala Arg Leu Arg Thr He Xaa Gly Val Met Ser His Asp Leu Gin 

15 10 15 

Leu Pro He Gly He Leu Pro Gly Gly Thr Phe Asn Asp Phe Thr Xaa 

20 25 30 

Thr Leu Asn He Ala Pro Asn His Lys Gin Ala Ser Glu Gin Met He 

35 40 45 

Ser Ala Gin Val Gly Thr Tyr Asp Val He Lys He Asn Asn Gin Tyr 

50 55 60 

Ala Leu Asn Phe Val Gly Leu Gly Leu He Val Gin Asn Ala Glu Asn 
65 70 75 80 

Val Gin Asp Gly Ser Lys Asp He Phe Gly Lys Leu Ser Tyr He Gly 

85 90 95 

Ser Thr Val Lys Thr Leu Leu Asn Pro Thr Gin Phe Asn Tyr Gin Leu 

100 105 , 110 

Ser He Asp Asp Lys Thr Tyr Ser Gly Glu Thr Thr Met He Leu Thr 
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115 120 125 

Ala Asn Gly Pro Phe lie Gly Gly Ser Arg Xaa Pro Leu Thr Asp Leu 

130 135 140 

Ser Pro Gin Asp Gly 
145 

(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

Met Lys Phe Asn Glu lie Trp lie Asn Glu Tyr Leu Ala Leu Val Asn 

15 10 15 

Asp Asp Asn Pro lie His Asn Glu lie Val Pro Gly Gin Leu Val Ser 

20 25 30 

Gin Met Met Leu Met Ala Met Ser Leu Glu Thr Asn Gin Cys Gin lie 

35 40 45 

Asn Tyr Val Lys Pro lie Leu lie Asn Glu Asn lie Glu Phe lie Glu 

50 55 60 

Gin His Glu His Glu He He Ala He Asn Asp Asp Gly Glu He Lys 
65 70 75 80 

He Lys He Ser Leu Ser Thr Lys Lys 
85 

(2) INFORMATION FOR SEQ ID NO: 504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 
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Met Phe Gin Gin Ser Tyr Asn His Gly Ala Gly Cys Thr Phe Ala Ala 

1 5 10 15 

Ala Thr Pro Ala Tyr Leu Ala Asn Gly Lys Ser Pro Lys Glu Ala Val 

20 25 30 

lie Ser Thr Ser Glu Ser Phe Arg Arg Phe Cys Tyr 
35 40 



(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 

Met His Ser Leu His Glu Leu Val Lys Leu Phe Thr Val His Gin Arg 

1 5 10 15 

He Phe He Glu Gin He Arg Glu Gly Ala His Leu Thr Ala Gly Lys 

20 25 30 

Phe Met Gin Gly Glu Phe Pro Val Met Met Phe Gly Leu Pro Gly Ala 

35 40 45 

Ala Leu Ala He Tyr His Thr Ala Asn Pro Glu Asn Lys Lys Val Val 

50 55 60 

Ala Gly Leu Met Gly Ser Ala Ala Leu Thr Ser Xaa Leu Xaa Gly He 
65 70 75 80 

Thr Glu Pro Leu Glu Phe Ser Phe Leu Phe Val Ala Pro Leu Leu Phe 

85 90 95 

Phe He His Ala Val Leu Asp Gly Leu Ser Phe Leu Thr Leu Tyr Leu 

100 105 HO 

Leu Asp Val His Leu Gly Tyr Thr Phe Ser Gly Gly Phe He Asp Tyr 

115 120 125 

Val Leu Leu Gly Val Leu Pro Asn Lys Thr Gin Trp Trp Leu Val He 

130 135 140 

Pro Val Gly Leu Val Tyr Ala Val He Tyr Tyr Phe Val Phe Arg Phe 
145 150 155 160 

Leu He Val Lys Leu Lys Tyr Lys Thr Pro Gly Arg Glu Asp Lys Gin 

165 170 175 

Ser Gin Ala Val Thr Ala Ser Ala Thr Glu Leu Pro Tyr Ala Val Leu 
180 185 190 
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Glu Ala Met Gly Gly Lys Ala Asn lie Lys His Leu Asp Ala Cys lie 

195 200 205 

Thr Arg Leu Arg Val Glu Val Asn Asp Lys Ser Lys Val Asp Val Pro 

210 215 220 

Gly Val Lys Asp Leu Gly Ala Ser Gly Val Leu Glu Val Gly Asn Asn 
225 230 235 240 

Met Gin Ala lie Cys Gly Ser 
245 



(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 



Met Phe Arg Pro lie Ser Asp Ala Gly Ala Ala lie Phe Glu Leu Xaa 

15 10 15 

Asp Asp Gin lie Glu Lys Ala Lys Gin Asn lie Ser Ala Ala Gly Tyr 

20 25 30 

Gin Pro Gly Glu Glu Glu Ser Gin Pro Ser Glu Asn Ser Val Asp Phe 

35 40 45 

Asn Arg Asp Gly Asp Tyr Asp Leu Leu lie lie Gly Ser Gly Gly Ala 

50 55 60 

Ala Phe Ser Ala Ala He Lys Ala Asn Glu Asn Gly Ala Lys Val Ser 
65 70 75 80 

Met Val Glu Arg Gly Thr Val Gly Gly Thr Cys Val Asn He Gly Cys 

85 90 95 

Val Pro Ser Lys Thr Met Leu Arg Ala Gly Glu He Asn Gly Leu Ala 

100 105 110 

Gin Asn Asn Pro Phe Thr Gly Leu Gin Thr Ser Thr Gly Ala Ala Asp 

115 120 125 

Leu Ala Gin Leu Thr Glu Gin Lys Asp Gly Leu Val Ser Gin Met Arg 

130 135 140 

Gin Glu Lys Tyr He Asp Leu He Glu Glu Tyr Gly Phe Asp Leu He 
145 150 155 160 

Arg Gly Glu Ala Ser Phe He Asp Asp Lys Thr lie Gin Val Asn Gly 
165 170 175 

547 



WO 97/30070 PCT/US97/02318 

His Asn He Thr Ser Lys Ser Phe Leu He Ala Thr Gly Ala Ser Pro 

180 185 190 

Ala Val Pro Glu He Pro Gly Met Asn Glu Val Asp Tyr Leu Thr Ser 

195 200 205 

Thr Ser Ala Leu Glu Leu Lys Glu Val Pro Gin Arg Leu Ala Val He 

210 215 220 

Gly Ser Gly Tyr He Ala Ala Glu Leu Gly Gin Met Phe His Asn Leu 
225 230 235 2 40 

Gly Thr Glu Val Thr Leu Met Gin Arg Ser Glu Arg Leu Phe Lys Thr 

245 250 255 

Tyr Asp Pro Glu He Ser Glu Ala He Asp Glu Ser Leu Thr Glu Gin 

260 265 270 

Gly Leu Asn Leu He Thr Gly Val Thr Tyr Gin Lys Val Glu Gin Asn 

275 280 285 

Gly Lys Ser Thr Ser He Tyr He Glu Val Asn Gly Gin Glu Gin Val 

290 295 300 

He Glu Ala Asp Gin Val Leu Val Ala Thr Gly Arg Lys Pro Asn Thr 

310 315 320 

Glu Thr Leu Asn Leu Glu Ser Ala Gly Val Lys Thr Gly Lys Lys Gly 

325 330 335 

Glu Val Leu Thr Asn Glu Tyr Leu Gin Thr Ser Asn Asn Arg He Tyr 

340 345 350 

Ala Ala Gly Asp Val Thr Leu Gly Pro Gin Phe Val Tyr Val Ala Ala 

355 360 365 

Tyr Glu Gly Gly He Val Ala Asn Asn Ala Leu Gly Leu Ala Lys Arg 

370 375 380 

Lys He Asp Leu Arg Phe Val Pro Gly Val Thr Phe Thr Asn Pro Ser 
385 390 395 400 

He Ala Thr Val Gly Leu Thr Glu Gin Gin Ala Lys Glu Lys Gly Tyr 

405 410 415 

Asp Val Lys Thr Ser Val Leu Pro Leu Asp Ala Val Pro Arg Ala Leu 

420 425 430 

Val Asn His Glu Thr Thr Gly Val Tyr 
435 440 



(2) INFORMATION FOR SEQ ID NO: 507: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 

Met Lys Asn Leu Lys Ser Phe Phe lie Lys Asn Thr Leu Val Leu Thr 

15 10 15 

Ser Thr Ala Leu Leu Phe Ser Ser Phe Glu Pro Val Val His Ala Ala 

20 25 30 

Glu Asn Lys Glu Val Val Lys Thr Val Glu Asn Asp Asn Val Glu Phe 

35 40 45 

Asn Glu Leu Leu Glu Gly Gin Glu Tyr lie Trp Glu Val Leu Ser Arg 

50 55 60 

Asp Asp Glu Gly Phe Lys Met Phe Leu Asp Glu Gin Arg Ala Phe Asn 
65 70 75 80 

Pro Asn Phe Asp Asn Asp Val Leu Leu Ala Lys Gin Glu Gly Ala Leu 

85 90 95 

Asn Ser Ser Val Phe Ala Asn Asn Ser Met Thr Leu Asn Ser Asn Lys 

100 105 110 

Pro Thr Pro Arg Gly Pro lie Gly Ala Thr Leu Lys Ala lie Lys Ala 

115 120 125 

Leu Ser Pro Ser Leu Arg His Gly Gly Asn Cys Tyr lie Leu Asp His 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

Met lie Asn Thr Gly Ser Ala Ala Ser He Val Tyr Gin Gin Gly Ala 

15 10 15 

He Tyr Ala Asn Ala Glu Phe He Gin He His Pro Thr Ala He Pro 

20 25 30 

Gly Asp Asp Lys Leu Arg Leu Met Ser Asp Gin His Xaa Cys 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 509: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

Met Xaa Lys Thr Trp Xaa lie Leu He Cys His He Lys He Gin His 

1 5 10 15 

Glu Leu Asp Val Lys Leu Gly Gly He He Glu He Tyr Xaa Lys Phe 

20 25 30 

Thr Gly Asp Asp Pro Arg Lys Val Pro Met Lys He Phe Pro Ala Val 

35 40 45 

His Tyr Ser Met Gly Gly Leu Tyr Val Asp Tyr Asp Gin Met Thr Asn 

50 55 60 

He Lys Gly Leu Phe Ala Ala Gly Glu Cys Asp Phe Ser Gin His Gly 
65 70 75 80 

Gly Asn Arg Leu Gly Ala Asn Ser Leu Leu Ser Ala He Tyr Gly Gly 

85 90 95 

Thr Val Ala Gly Pro Asn Ala He Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 

Met Thr Pro Asn Val Thr Val Val Arg Glu Asn Glu Lys Leu Leu Glu 

1 5 10 15 

Thr Asp Xaa Lys He Val Glu Leu Met Lys Arg Tyr Glu Asp He Asp 

20 25 30 

Met Glu Asp Thr Gin Thr Trp Ser Asn Gin Ala Val Phe Phe Thr Arg 
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35 40 45 

Gin Leu Trp Asn Met Leu Val Leu Ala Arg Val lie Thr He Gly Ala 

50 55 60 

Tyr Asn Arg Asn Glu Ser Arg Gly Ala His Tyr Lys Pro Glu Phe Leu 
65 70 75 80 

He Ser He Phe Val Arg Ser Thr 
85 

(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 

Met Gly Leu Thr Asp Leu Arg Lys His He He Tyr Glu Asp Val Trp 

1 5 10 15 

Thr Pro Glu Asp He Glu Lys Asn Tyr Arg Ser Asn Arg Gly Ala He 

20 25 30 ' 

Tyr Gly Val Val Ala Asp Lys Lys Lys Asn Lys Gly Phe Thr Phe Pro 

35 40 45 

Lys Glu Ser Gin Tyr Phe Glu Asn Leu Tyr Phe Val Gly Gly Ser Xaa 

50 55 60 

Asn Pro Gly Gly Gly Met Pro Met Val Thr Leu Ser Gly Gin Gin Xaa 
65 70 75 80 

Ala Asp Lys Xaa Asn Ala Arg Arg Ser Glu Xaa He Gly Xaa 
85 90 

(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:5l2: 

Met Glu Tyr He Asp Ala Ser His He Val He Ala Leu Leu He He 

1 5 10 15 

Leu Val He He Thr Leu Phe He Leu Leu Phe He Glu Leu Val Glu 

20 25 30 

Val Arg He Pro Tyr He Asp Leu Met Asn Val Ser Ala Thr Asn Met 

35 40 45 

Lys Ser Tyr Leu Ser Trp Lys Val Asn Pro Ala Gly Ser He Thr Leu 

50 55 60 

Met Met Ser He Ser Ala Phe Val Phe Leu Lys Ser Gly He His Phe 
65 70 75 80 

He Leu Ser Met Phe Asn Lys Ser He Ser Asp Asp Met Pro Met Leu 

85 90 95 

Thr Phe Asp Ser Pro Val Gly He Ser Val Tyr Leu Val He Gin Met 

100 105 no 

Leu Leu Gly Tyr Phe Leu Ser Arg Phe Leu He Asn Thr Lys Gin Lys 

115 120 125 

Ser Lys Asp Phe Leu Lys Ser Gly Asn Tyr Phe Ser Gly Val Lys Pro 

130 135 140 

Gly Lys Asp Thr Glu Arg Tyr Leu Asn Tyr Gin Ala Arg Arg Val Cys 
145 150 155 160 

Trp Phe Gly Ser Ala Leu Val Thr Val He He Gly He Pro Leu Tyr 

165 170 175 

Phe Thr Leu Phe Val Pro His Leu Ser Thr Glu He Tyr Phe Ser Val 

180 185 190 

Gin Leu He Val Leu Val Tyr He Ser He Asn lie Ala Glu Thr He 

195 200 205 

Arg Thr Tyr Leu Tyr Phe Asp Lys Tyr Lys Pro Phe Leu Asn Gin Tyr 
210 215 220 

Trp 
225 



(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Met Lys Tyr Phe lie Pro Ala Trp Tyr Asp Asp Gin Arg Trp Trp Gin 

15 10 15 

Asp Thr Thr Val Pro Tyr Tyr Gin Leu Gin Asn Lys Thr Glu Phe Asp 

20 25 30 

Asp Met lie Ser Leu Met Gly Met His Leu Glu Asn Asp Leu Asp Tyr 

35 40 45 

Gin Leu lie Val Leu Asn His Ala Pro Asn Leu Arg Thr Phe Leu His 

50 55 60 

Arg Tyr Asp Leu Tyr Glu Thr Lys Tyr Ser Ser Val Phe Asp Glu lie 
65 70 75 80 

Gin Gly Phe Ser His His Ala Pro Gin Ala He Asn Tyr His His Leu 

B5 90 95 

Lys Trp Xaa Asp Asp Val Gly Val Trp Val His Ala Arg He Tyr 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

Met He Ala Gin Gin He His Glu Asp Asp His Val He Val Ala Ser 

15 10 15 

Asp Ala Arg His Asn Arg Gin He Ala Asn His He Pro Ala Lys Leu 

20 25 30 

Leu Ser Tyr Ser Phe Phe Lys Asn Arg Asn Glu Xaa Val Ser Asp Glu 

35 40 45 

Glu Tyr Gin Ser He Val Lys Asn Ala His Leu He Val Asp Ser Val 

50 55 60 

Gin Leu Glu Arg Asp Leu He Ser His Gin Glu Lys Tyr Gin Arg Glu 
65 70 75 80 

Asn Thr Met He Arg He Thr Pro Phe Glu Thr Arg Gin Ser Pro Asn 

85 90 95 

He Ser Ser Gin Leu Met Glu Thr Phe He Gly Val Trp He Asp Gly 
100 105 110 
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Met Ser Asp Ala Asp Leu Gin Gin Met Met Gin Arg Leu Val Asp Tyr 

115 120 125 

He Ala Gin Glu Asp Tyr Tyr Arg Leu He Leu Leu Ser Arg His Gin 

130 135 140 

Asn Asp He Pro Met Trp Leu Arg Glu Cys He Thr Ser Val Asn Glu 
I 45 150 155 160 

Glu Tyr Gin Ala Lys Gin Asn Ala Asp Val Asn Val Ser Ala Leu Met 

165 170 175 

Thr Pro Glu Asp Gin Asp Asp He lie Ala Val Lys Thr He His Ala 

180 185 190 

Glu His Asp Val Val Glu Ala Leu Arg Thr Leu Arg Leu Val He Asp 

195 200 205 

Met Ser Lys Glu Pro Asp Leu Tyr Leu Gin He Ser Ala He Ser Ala 

210 215 220 

Gly He Pro Gin He Asn Gly Gin Gin Thr Asp Tyr Val Ser Asp Tyr 
225 230 235 240 

Asp Asn Gly Arg He He Asn Thr Ala Gly 
245 250 

(2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 63 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

Met Leu Asp Ser Pro Leu Glu He Pro Ala Lys Pro Phe Asp Ala Tyr 

1 5 10 15 

He Ser Val Thr Val Lys Ala Arg Gly Asn Gly Thr Val His Leu Gly 

20 25 30 

Pro Lys His Thr Arg Trp Ser Arg Xaa Xaa Xaa Xaa Gin Phe Leu Xaa 

35 40 45 

Gly Gly Asn Arg Phe Glu Asp Xaa Xaa Arg Xaa Glu Phe Asn Tyr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 516: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

Met Xaa Pro Pro Leu Asn Val Asn Phe Ser Gly Tyr Arg Xaa Pro Glu 

1 5 10 15 

Gly Phe Glu Gly Tyr Tyr Met Met Xaa Arg Met Asn Ala Pro Phe Leu 

20 25 30 

Leu lie Ala Asp Pro Arg Val Xaa Gly Gly Ser Phe Tyr lie Gly Ser 

35 40 45 

Ser Glu Tyr Glu Gin Arg Asp Tyr Gin Cys Tyr 
50 55 



(2) INFORMATION FOR SEQ ID NO: 517: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 



Met lie Glu Xaa Leu Asp Ala Arg Val Asn Val He He Leu Glu His 

15 10 15 

Leu Val Gly Pro He Asp Phe Lys Gin Asp He Leu Ala Val Lys Val 

20 25 30 

Leu Ala Gin Leu Phe Ser Lys He Lys Pro Asp Val He His Leu His 

35 40 45 

Ser Ser Lys Ala Gly Thr Val Gly Arg He Ala Lys Phe He Ser Lys 

50 55 60 

Ser Lys Asp Thr Arg Val Val Phe Thr Ala His Gly Trp Ala Phe Thr 
65 70 75 80 

Glu Gly Val Lys Pro Ala Lys Lys Phe Leu Tyr Leu Val He Glu Lys 

85 90 95 

Leu Met Ser Cys He Thr Asp Ser He He Cys Val Ser Asp Phe Asp 
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100 105 HO 

Lys Gin Leu Ala Leu Lys Tyr Arg Phe Asn Arg Leu Lys Leu Thr Thr 

115 120 125 

He His Asn Gly He Ala Asp Val Pro Ala Val Xaa Gin Thr Leu Lys 

130 135 140 

Ser Gin Ser His Asn Asn He Gly Glu Val Val Gly Xaa Leu Pro Asn 
145 150 155 160 

Lys Gin Asp Leu Gin He Asn Ala Pro Thr Lys His Gin Phe Val Met 

165 170 175 

He Ala Arg Phe Ala Tyr Pro Lys Leu Pro Gin Asn Leu He Ala Ala 

180 185 190 

He Glu He Leu Lys Leu His Asn Ser Asn His Ala His Phe Thr Phe 

195 200 205 

He Gly Asp Gly Pro Thr Leu Asn Asp Cys Gin Pro Pro Ser Cys Thr 

210 215 220 

Ser Trp Val Lys Lys Met Met Ser His Phe Trp Ala Met Ser Leu Met 
225 230 235 240 

Arg Val He Tyr Tyr His Asn Thr He Arg Leu Phe 
245 250 

(2) INFORMATION FOR SEQ ID NO: 518: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 



Met Pro Thr Gly Leu Pro Val He 

1 5 
Glu Leu Val Ala Asp Asn Gly He 
20 

Thr He Ala Lys Val Leu Glu Lys 

35 40 
Lys Met Ser Asn Gin Ser Arg Lys 

50 55 
Glu Lys Met He Lys Glu Val Glu 
65 70 
Gin 



Ala Ser His Val Gly Gly He Ser 

10 15 
Cys Met Met Asn Thr Gin Pro Glu 
25 30 
Tyr Leu He Asp Ser Asp Tyr He 
45 

Arg Tyr Leu Glu Cys Phe Thr Glu 
60 

Asp Val Tyr Asn Gly Lys Ser Thr 
75 80 
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(2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 

Met Lys Arg Leu Phe Asp Val Val Ser Ser lie Tyr Gly Leu Val Val 

15 10 15 

Leu Ser Pro He Leu Leu He Thr Ala Leu Leu He Lys Met Glu Ser 

20 25 30 

Pro Gly Pro Ala He Phe Lys Gin Lys Arg Pro Thr He Asn Asn Glu 

35 40 45 

Leu Phe Asn He Tyr Lys Phe Arg Ser Met Lys He Asp Thr Pro Asn 

50 55 60 

Val Ala Thr Asp Leu Met Asp Ser Thr Ser Tyr He Thr Lys Thr Gly 
65 70 75 80 

Lys Val He Arg Lys Thr Ser He Asp Glu Leu Pro Gin Leu Leu Asn 

85 90 95 

Val Leu Lys Gly Glu Met Ser He Val Gly Pro Arg Pro Ala Leu Tyr 

100 105 110 

Asn Gin Gly Leu Arg He Asn Arg Lys Thr Tyr Lys Ser Glu Arg Ala 
115 120 125 

Tyr Glu 
130 

(2) INFORMATION FOR SEQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

Met Gly Phe lie Asn Pro Asn Xaa Gin Pro Gly Met Thr Val Ala Val 

1 5 10 is 

Met Gly Cys Gly Ser lie Gly Leu Leu Ala He Gin Trp Ala Arg He 

20 25 30 

Phe Gly Ala Ala His He He Ala He Asp He Asp Ala His Lys Leu 

35 40 45 

Asp He Ala Thr Ser Leu Gly Ala His Gin Thr He Asn Ser Lys Glu 

50 55 60 

Glu Asn Leu Glu Lys Phe He Glu Asn His Tyr Ala Asn Gin lie Asp 
65 70 75 80 

Leu Ala He Glu Ser Ser Gly Ala Lys Val Thr lie Gly Gin He Leu 

85 90 95 

Thr Leu Pro Lys Lys Gly Gly Glu Val Val Leu Leu Gly He Pro Tyr 

100 105 no 

Asp Asp He Glu He Asp Arg Val His Phe Glu Lys He Leu Arg Asn 

115 120 125 

Glu Leu Thr Val Cys Gly Ser Trp Asn Cys Leu Ser Ser Asn Phe Pro 

130 135 140 

Gly Lys Glu Trp Thr Ala Thr Leu His Tyr Met Lys Thr Lys Asp He 
145 150 155 160 

Asn Val Lys Pro He He Ser His Phe Leu Pro Leu Arg Lys Arg Pro 

165 170 175 

Gly Asp He Leu He Asn 
180 

(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

Met He Lys Arg Asp Thr Pro Met He Pro Leu His Gin Thr Glu Glu 

1 5 10 15 

Glu Glu Phe Tyr Thr Phe He Gly Gin Phe Tyr Ser Leu Asn Gin His 
20 25 30 
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lie Leu Pro Lys Glu Val His Val Pro Arg Asn Leu Asp Lys Glu Met 

35 40 45 

He Gin Ser Val Val Asp Thr Lys He Val Gin Pro Ala Arg Gly Pro 

50 55 60 

Lys Lys Asp Met Val Asp Leu Ala Ala His Asn Ala Lys Val Ser Leu 
65 70 -75 60 

Asn Asn Lys Phe Glu Leu He Ser Arg Asp Glu Ser Arg Thr He Lys 

85 90 95 

Ala He Glu Glu Leu Gly Thr Gin Met Gly lie Gin Thr Pro He Arg 

100 105 110 

He Glu Ala Phe Asp Asn Ser Asn He Gin Gly Val Asp Pro Val Ser 

115 120 125 

Ala Met Val Thr Phe Val Asp Gly Lys Pro Asp Lys Lys Asn Tyr Arg 

130 135 140 

Lys Tyr Lys He Lys Thr Val Lys Gly Pro Asp Asp Tyr Lys Ser Met 
145 150 155 160 

Arg Glu Val Val Arg Arg Arg Tyr Ser Arg Val Leu Asn Glu Gly Leu 

165 170 175 

Pro Leu Pro Asp Leu He He Val Asp Gly Gly Lys Gly His Met Asn 

180 185 190 

Gly Val He Asp Val Leu Gin Asn Glu Leu Gly Leu Asp He Pro Val 

195 200 205 

Ala Gly Leu Gin Lys Asn Asp Lys His Gin Thr Ser Glu Leu Leu Leu 

210 215 220 

Trp Arg 
225 

(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

Met Tyr Arg Glu Arg Trp Ser His Asp Lys Glu Val Arg Thr Thr His 

1 5 10 15 . 

Gly Val Asn Cys Thr Gly Ser Cys Ser Trp Lys Val Phe Val Lys Asn 
20 25 30 
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Gly Val He Thr Trp Glu Asn Gin Gin Thr Asp Tyr Pro Ser Cys Gly 

35 40 45 

Pro Asp Met Pro Glu Tyr Glu Pro Arg Gly Cys Pro Arg Gly Ala Ser 

50 55 60 

Phe Ser Trp Tyr Glu Tyr Ser Pro Leu Arg He Lys Tyr Pro Tyr He 
65 70 75 80 

Arg Gly Lys Leu Trp Asp Leu Trp Thr Glu Ala Leu Glu Glu Asn Tyr 

85 90 95 

Gly Asn Arg Val Ala Ala Trp Ala Ser He Val Glu Asn Glu Asp Lys 

100 105 no 

Ala Lys Gin Tyr Lys Gin Ala Arg Gly Met Gly Gly His Val Arg Ser 

115 120 125 

Asn Trp Lys Asp Val Thr Glu He He Ala Ala Gin Leu Leu Tyr Thr 

130 135 140 

He Lys Lys Tyr Gly Pro Asp Arg He Ala Gly Phe Thr Pro He Pro 
145 150 155 160 

Ala Met Ser Met He Ser Tyr Ala Ala Gly Ala Arg Phe He Asn Leu 

165 170 175 

Leu Gly Gly Glu Met Leu Ser Phe Tyr Asp Trp Tyr Ala Asp Leu Pro 

180 185 190 

Pro Ala Ser Pro Gin He Trp Gly Glu Gin Thr Asp Val Pro Glu Ser 

195 200 205 

Ser Asp Trp Tyr Asn Ala Ser Tyr He He Met Trp Gly Ser Asn Val 

210 215 220 

Pro Leu Thr Arg Thr Pro Asp Ala His Phe Met Thr Glu Val Arg Tyr 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 523: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

Met Gin Thr Gly Leu Xaa Xaa Arg Ala Val Ala Lys Ser Xaa Thr Phe 

15 10 15 

Ser Thr Gly His Xaa Xaa Asp Ser Glu Glu Asn Asp Val Asp Val Asn 
20 25 30 
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Xaa Leu Val Asn Gly Gly Thr Leu Ala lie Tyr Met Gly Val Lys Arg 

35 40 45 

Leu Gly Lys He He Ala Gin He Gin Gin Tyr Thr Asp He Asp Tyr 

50 55 60 

Pro He Ala He Val Phe Gin Ala Ser Cys Phe Asn Glu Phe Val Val 
65 70 75 80 

Lys Gly Arg Leu Ser Asn He Val Gly Lys Leu Glu His Tyr Ala He 

85 90 95 

Glu Ala Lys Pro Gly He Cys He He Gly Glu Val Val Gly Tyr Thr 

100 105 no 

Glu Asn Val Ser Thr Thr Ser Asn Pro Thr Gin Gin Phe Tyr Val Val 

115 120 125 

Ser Gly Ser Arg His Asp Ala Leu Met Leu Cys Glu His Leu Tyr Asp 

130 135 140 

Glu Gly Tyr Gly Cys Leu Leu Asn Pro Asn Asp Thr Ser Asn Gly Thr 
145 150 155 160 

Tyr His Ser Gly Arg Gin Tyr Asp Tyr Tyr Asp Ala Phe He Lys Gin 

165 170 175 

Gin Glu Asn Val Thr Tyr He Ser Thr Asp Arg Ala Asp Ala Asn Thr 

180 185 190 

Val Leu Cys His 
195 



(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

Met Ser Val Asp Lys Tyr Gly Lys Val Tyr Leu He Gly Ala Gly Pro 

1 5 10 15 

Gly Asn Pro Asn Tyr Leu Thr Lys Lys Ala Glu Arg Leu He Arg Glu 

20 25 30 

Ala Asp Val He Leu Tyr Asp Arg Leu Val Asn Pro Leu He Leu Gin 

35 40 45 

Tyr Ala Asn Leu Thr Thr Glu He He Asp Val Gly Lys Lys Pro Tyr 
50 55 60 
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Ala Lys His He Gin Gin Glu Lys He Asn Glu Cys He Val Glu Ala . 
65 70 75 80 

Ala Arg Arg Tyr Asn Lys Val Gly Glu Gly 
85 90 

(2) INFORMATION FOR SEQ ID NO: 52 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 

Met Tip Cys Phe Asp Ala Leu Tyr Arg Glu Thr Gly He Tyr Ala Glu 

15 10 15 

Arg Thr Ala Pro Trp Leu Arg Arg Leu Gly Phe Glu Asn Val Lys Glu 

20 25 30 

Val Leu Leu Asp Pro Glu Arg Gin Asn Glu Leu Phe Glu Arg He Met 

35 40 45 

Asp Ala Lys Lys Ala Val Glu Ala Glu Pro Trp Glu Ala He Xaa Ser 

50 55 60 

Asn Ala Gin Ala Arg Lys He Phe Glu Val Glu Lys Val 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 526 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

Met Asn Asp Lys Thr Ser Asn Asp Leu Tyr Gly Lys He Lys His Cys 

15 10 15 

Asn Glu Phe He Asn His Ser Asn Asp Ser Asn Leu Ser Ser Ser His 
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20 25 30 

Asp Val Asp Glu Ser Ser Thr Lys Gin Lys His lie Lys Asn Lys Thr 

35 40 . 45 

Thr lie Asp His Asn Asp Asp Leu Phe Lys His Val Lys Asp lie Leu 

50 55 60 

Arg Lys Gin Gly Gin lie 
65 70 

{2} INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

Met Asn Tyr Val Arg Ser Ser Asn Lys Ser His Gly Asn Gin Asn Gin 

15 10 15 

He Glu Gly Ala Lys Ser Glu Gly Lys Lys Val Val Val He Glu Asp 

20 25 30 

Leu He Ser Thr Gly Gly Ser Ser Val Thr Ala Val Glu Ala Leu Lys 

35 40 45 

Gin Ala Gly Ala Glu Val Leu Gly Val Val Ala He Phe Thr Tyr Gly 

50 55 60 

Leu Lys Lys Ala Asp Asp Thr Phe Ser Asn He Gin Leu Pro Phe Tyr 
65 70 75 80 

Thr Leu Ser Asp Tyr Asn Glu Leu He Glu Val Ala Glu Asn Glu Gly 

85 90 95 

Lys He Ser Ser Glu Asp He Gin Thr Leu Val Glu Trp Arg Asp Asn 
100 105 110 

Leu Ala 



(2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

Met Lys Ala Cys Arg Val Leu Val Asn Thr Pro Ser Ala Val Gly Gly 

15 10 15 

He Gly Asp Met Tyr Asn Glu Leu He Pro Ser Leu Thr Leu Gly Cys 

20 25 30 

Gly Ser Tyr Gly Arg Asn Ser He Ser His Asn Val Ser Ala Thr Asp 

35 40 45 

Leu Leu Asn He Lys Thr He Ala Lys Arg Arg Asn Asn Thr Gin He 

50 55 60 

Phe Lys Val Pro Ala Gin He Tyr Phe Glu Glu Asn Ala He Met Ser 
65 70 75 80 

Leu Thr Thr Met Asp Lys He Glu Lys Val Met He Val Cys Asp Pro 

85 90 95 

Gly Met Val Glu Phe Gly Tyr Thr Lys Thr Val Glu Asn Val Leu Arg 

100 105 110 

Gin Arg Thr Glu Gin Pro Gin He Lys He Phe Ser Glu Val Glu Pro 

115 120 125 

Asn Pro Ser Thr Asn Thr Val Tyr Lys Gly Leu Glu Met Met Val Asp 

130 135 140 

Phe Gin Pro Asp Thr He He Ala Leu Gly Gly Gly Ser Ala Met Asp 
145 150 155 160 

Ala Ala Lys Ala Met Trp Met Phe Phe Glu His Pro Glu Thr Ser Phe 

165 170 175 

Phe Gly Ala Lys Gin Lys Phe Leu Asp He Gly Lys Arg Thr Tyr Lys 

180 185 190 

He Gly Met Pro Glu Asn Ala Thr Phe He Cys He Pro Thr Thr Ser 

195 200 205 

Gly Thr Gly Ser Glu Val Thr Pro Phe Ala Gly He Thr Asp Ser Glu 

210 215 220 

Thr Asn Val Lys Tyr Pro Xaa Ala Asp Phe Gly Leu Thr Thr Asp Val 
225 230 235 240 

Gly He He Asp Leu Asn Leu Val Met Ser Val Pro Xaa Ser Val Thr 

245 250 255 

Ala Gly Thr Gly Met Val Val Leu Thr His Ala Met Glu Ser Tyr Val 

260 265 270 

Ser Cys Asn Gly Phe Arg Leu Ser Arg Gly Trp Gly Leu Gin Ala He 
275 280 285 

Asn 
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(2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

Met Xaa Asn Leu Lys Asn Ser Leu Asn Asp Lys Asp Thr Thr Leu Ala 

15 10 15 

Ser Pro Xaa Leu Cys Lys Met Gin Asp Pro Glu Lys Xaa Asn Ala Tyr 

20 25 30 

Asn Glu Ala Val Arg Asn Ala Glu Asn lie Leu Asn Lys Ser Thr Gly 

35 40 45 

Thr Asn Val Pro Lys Asp Gin Val Glu Ala Ala Met Asn Gin Val Asn 

50 55 60 

Thr Thr Lys Ala Ala Leu Asn Gly Thr Gin Asn Leu Glu Lys Ala Lys 
65 70 75 80 

Gin His Ala Asn Thr Ala lie Asp Gly Leu Ser His Leu Thr Asn Ala 

85 90 95 

Gin Lys Glu Ala Leu Lys Gin Leu Val Gin Gin Ser Thr Thr Val Ala 

100 105 HO 

Glu Ala Gin Gly Asn Glu Gin Lys Ala Asn Asn Val Asp Ala Ala Met 

115 120 125 

Asp Lys Leu Arg Gin Ser He Ala Asp Asn Ala Thr Thr Lys Gin Asn 

130 135 140 

Gin Asn Tyr Thr Asp Ala Ser Pro Asn Lys Lys Asp Ala Tyr Asn Asn 
145 150 155 160 

Ala Val Thr Thr Ala Gin Gly He He Asp Gin Thr Thr Asn Pro Ser 

165 170 175 

Leu Asp Pro Thr Val He Asn Gin Ala Ala Gly Gin Val Ser Thr Ser 

180 185 190 

Lys Asn Ala Leu Asn Gly Asn Glu Asn Leu Glu Ala Ala Lys Gin Leu 

195 200 205 

Ala Thr Gin Ser Leu Gly Ser Leu Asp Xaa Leu Asn Asn Ala Gin Lys 

210 215 220 

Gin Ala Val Thr Asn Gin He Asn Gly Ala His Thr Gly Asp Glu Ala 
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225 230 235 240 

Asn Gin lie Lys Gin Asn Ala Gin Asn Leu Asn Thr Ala Met Gly Asn 
245 250 255 

Leu 



(2) INFORMATION FOR SEQ ID NO: 530: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

Met Lys His Leu Leu Gly Thr Lys Ser Gly Leu Leu Ala Thr Pro Asn 

1 5 10 15 

Glu Asp Glu Lys Pro Glu Glu lie Thr Trp Arg Glu Glu Thr Thr Gly 

20 25 30 

Lys Leu Asp Leu Val Val Ser Leu Asp Phe Arg Met Thr Ala Thr Pro 

35 40 45 

Leu Tyr Ser Asp lie Val Leu Pro Ala Ala Thr Trp Tyr Glu Lys His 

50 55 60 

Asp Leu Ser Ser Thr Asp Met His Pro Tyr Val His Pro Phe Asn Pro 
65 70 75 80 

Ala lie Asp Pro Leu Trp Glu Ser Arg Ser Asp Trp Asp lie Tyr Lys 

85 90 95 

Thr Leu Ala Lys Ala Phe Ser Glu Met Ala Lys Asp Tyr Leu Pro Gly 

100 105 110 

Thr Phe Lys Asp Val Val Thr Thr Pro Leu Ser His Asp Thr Lys Gin 

115 120 125 

Glu He Ser Thr Pro Tyr Gly Val Val Lys Asp Trp Ser Lys Gly Glu 

130 135 140 

He Glu Ala Val Pro Gly Arg Thr Met Pro Asn Phe Ala He Val Glu 
145 150 155 160 

Arg Asp Tyr Thr Lys He Tyr Asp Lys Tyr Val Thr Leu Gly Pro Val 

165 170 175 

Leu Glu Lys Gly Lys Val Gly Ala His Gly Val Ser Phe Gly Val Ser 

180 185 190 

Glu Gin Tyr Glu Glu Leu Lys Ser Met Leu Gly Thr Trp Ser Asp Thr 
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195 200 205 

Asn Asp Asp Ser Val Arg Ala Asn Arg Pro Arg lie Asp Thr Ala Arg 

210 215 220 

Asn Val Ala Asp Ala lie Leu Ser lie Ser Ser Ala Thr Asn Gly Lys 
225 230 235 240 

Leu Ser Gin Lys Ser Tyr Glu Asp Leu Glu Glu Gin Thr Gly Met Pro 

245 250 255 

Leu Lys Asp lie Ser Ser Glu Arg Ala Ala Glu Lys lie Arg Phe 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

Met Ala Ala lie Lys Pro lie Thr Thr Tyr Lys Gly Lys lie Val Pro 

15 10 15 

Leu Phe Asn Asp Asn lie Asp Thr Asp Gin lie lie Pro Lys Val His 

20 25 30 

Leu Lys Arg lie Ser Lys Ser Gly Phe Gly Pro Phe Ala Phe Asp Glu 

35 40 45 

Trp Arg Tyr Leu Pro Asp Gly Ser Asp Asn Pro Asp Phe Asn Pro Asn 

50 55 60 

Lys Pro Gin Tyr Lys Gly Ala Ser lie Leu lie Thr Gly Asp Asn Phe 
65 70 75 80 

Gly Cys Gly Ser Ser Arg Glu His Ala Ala Trp Ala Leu Lys Asp Tyr 

85 90 95 

Gly Xaa His lie lie lie Ala Gly Ser Phe Ser Asp lie Phe Tyr Met 

100 105 110 

Asn Cys Thr Lys Asn Ala Met Leu Pro lie Xaa Leu Glu Lys Ser Ala 

115 120 125 

Arg Glu His Leu Ala Gin Tyr Glu Glu lie Glu lie Asp Leu Pro Asn 

130 135 140 

Gin Thr Val Ser Ser Pro Asp Lys Arg Phe His Phe Glu lie Asp Glu 
145 150 155 160 

Thr Leu Glu Lys 
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(2) INFORMATION FOR SEQ ID NO: 532: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 

Met Trp His Arg Gin Ala Leu Ser Cys Leu Leu Val lie Leu He Leu 

15 10 is 

Met Ser Gly Val Ala He He Leu Arg Glu Gin Xaa Xaa Val Lys Xaa 

20 25 30 

Ser Asn Leu Asn Asn Gin Ser His Leu Ser He Xaa Lys Gly Val Leu 

35 40 45 

Xaa Met Ala Gin Thr Leu Ala Gin Xaa Lys Xaa He Ser Gin Ser His 

50 55 60 

Thr Xaa Asp Val Ser Gin Ser His His Lys Thr Pro Ala Asp Thr Xaa 
65 70 75 80 

Leu Thr Val Leu He Cys Pro His Lys Tyr 
85 90 

(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 

Met Met Ala Arg Arg Lys Val He Arg Val Arg He Lys Gly Lys Leu 

1 5 10 15 

Met Thr Leu Arg Glu Val Ser Glu Lys Tyr His He Ser Pro Glu Leu 
20 25 30 
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Leu Arg Tyr Arg Tyr Lys His Lys Met Arg Gly Asp Glu Leu Leu Cys _ 

35 40 45 

Gly Arg Lys Asp Ser Lys Ser Lys Glu Glu Val Gly lie Tyr Glu Glu 

50 55 60 

Ser Asn Lys Arg 
65 

(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 534: 

Met lie Cys Xaa Leu Gin Glu lie Gly lie Ser Ser Asn Tyr Xaa Pro 

1 5 10 15 

Thr Gly Cys Xaa Ser Gly lie Tyr Val Phe Ser Glu Ser Xaa Val Pro 

20 25 30 

Gly Thr Asp Ala Leu Gin Ala Xaa Tyr Leu Asp Asp Xaa Val Met Glu 

35 40 45 

Cys Asp Xaa Thr Pro Thr Arg Ala Asp Ala Leu Arg Met lie Gly Thr 

50 55 60 

Ala Tyr Glu Val Ala Ala Leu Tyr Asn Thr Xaa Leu Thr Thr Pro Glu 
65 70 75 80 

Thr Thr Ser Asn Glu Leu Xaa Leu Ser Ala Asn Asp Xaa Leu Thr Val 

85 90 95 

Thr Xaa Xaa Asn Glu Asp Thr Ser Thr lie Leu 
100 105 

(2) INFORMATION FOR SEQ ID NO: 535: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:535: 



Met Asp Trp Met Glu Gin Glu Xaa Arg Pro Trp Tyr Tyr Tyr His Asn 

1 5 10 15 

Leu Leu Gin Gin Gin Ala Ala Trp Glu Gly His Arg Val Asn lie He 

20 ' 25 30 

Asp Thr Pro Gly His Val Asp Phe Thr Val Glu Val Glu Arg Ser Leu 

35 40 45 

Arg Val Leu Asp Gly Ala Val Thr Val Leu Asp Ala Gin Ser Gly Val 

50 55 60 

Glu Pro Gin Thr Glu Thr Xaa Trp Arg Gin Ala Thr Thr Tyr Gly Val 
65 70 75 80 

Pro Arg He Val Phe Val Asn Lys Met Asp Lys Leu Gly Ala Asn Phe 

85 90 95 

Glu Tyr Ser Val Ser Thr Leu His Asp Arg Leu Gin Ala Asn Ala Ala 

100 105 no 

Pro He Gin Leu Pro He Gly Ala Glu Asp Glu Phe Glu Ala He He 

115 120 125 

Asp Leu Val Glu Met Lys Cys Phe Lys Tyr Thr Asn Asp Leu Gly Thr 

130 135 140 

Glu He Glu Glu He Glu He Pro Glu Asp His Leu Asp Thr Ser 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 536: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 6: 



Met Glu Lys Tyr Leu Gly Asp Glu Glu He Ser Val Ser Glu Leu Lys 

1 5 10 15 

Glu Ala He Arg Gin Ala Thr Thr Asn Val Glu Phe Tyr Pro Val Leu 

20 25 30 

Cys Gly Thr Ala Phe Lys Asn Lys Gly Val Gin Leu Met Leu Asp Ala 

35 40 45 

Val He Asp Tyr Leu Pro Ser Pro Leu Asp Val Lys Pro He He Gly 
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50 55 60 

His Arg Ala Ser Asn Pro Glu Glu Glu Val lie Ala Lys Ala Asp Asp 
65 70 75 80 

Ser Ala Glu Phe Ala Ala Leu Ala Phe Lys Val Met Thr Asp Pro Tyr 

85 90 95 

Val Gly Lys Leu Thr Phe Phe Arg Val Tyr Ser Gly Thr Met Thr Ser 

100 105 110 

Gly Ser Tyr Val Lys Asn Ser Thr Lys Gly Lys Arg Glu Arg Val Val 
115 120 125 

Val Tyr 
130 

(2) INFORMATION FOR SEQ ID NO: 537: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 

Met Asn Met Lys Thr lie Ala Lys Thr Ser Leu Ala Leu Gly Leu Leu 

1 5 10 15 

Thr Thr Gly Ala lie Thr Val Thr Thr Gin Ser Val Lys Ala Glu Lys 

20 25 30 

lie Gin Ser Thr Lys Val Asp Lys Val Pro Thr Leu Lys Ala Glu Arg 

35 40 45 

Leu Ala Met He Asn He Thr Ala Gly Ala Asn Ser Ala Thr Thr Gin 

50 55 60 

Ala Ala Asn Thr Arg Gin Glu Arg Thr Pro Lys Leu Glu Lys Ala Pro 
65 70 75 80 

Asn Thr Asn Xaa Glu Lys Thr Ser Ala Ser Lys He Glu Lys He Ser 

85 90 95 

Gin Pro Lys Gin Glu Glu Gin Lys Ser Leu Asn He Ser Ala Thr Pro 

100 105 110 

Ala Pro Lys Gin Xaa Gin Ser Gin Thr Ala Thr Glu Ser Thr Thr Pro 

115 120 125 

Lys Thr Lys Val Thr Thr Pro Pro Ser Thr Asn Thr Pro Gin Pro Met 

130 135 140 

Gin Ser Thr Lys Ser Asp Thr Pro Gin Ser Pro Thr He Lys Gin Ala 
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i* 5 150 155 160 

Gin Thr Asp Met Thr Pro Lys Tyr Glu Asp Leu Arg Ala Tyr Tyr Thr 

165 170 175 

Lys Pro Ser Phe Xaa Phe Glu Lys Gin Phe Gly Phe Leu Leu Lys Pro 

180 185 190 

Trp Thr Thr Val Arg Phe Met Xaa Val He Pro Asn Arg Phe He Tyr 

195 200 205 

Lys He Ala Leu Val Arg Lys Arg 
210 215 



(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 8: 



Met Lys Asn Lys Ser Val Ser Leu Val Phe Trp Val Ser Leu Val He 

1 5 10 15 

Cys Thr He Phe Val Ala Phe Gly Ala He Phe Pro Lys Gin Leu Glu 

20 25 30 

Lys Leu Thr Gin Asn He Thr Thr Phe He Ala Leu His Phe Ser Trp 

35 40 45 

Tyr Tyr Leu Leu Leu Val Leu Val He Leu Phe Val Cys Val Tyr He 

50 55 60 

Leu Phe Ser Arg Tyr Ala Asn He Thr Leu Gly Glu Glu Gly Glu Asp 
55 70 75 80 

Pro Glu Phe Ser Leu Pro Ser Trp Phe Ala Met Leu Phe Ser Ala Gly 

85 90 95 

Met Gly He Gly Leu Val Phe Trp Thr Thr Ala Glu Pro He Ser His 

100 105 no 

Ala Phe Lys Leu Thr Pro He His Lys Ala Gly Thr Gin Ser Ala He 

115 120 125 

Asn Asp Ala Met Gin Phe Ser Phe Phe His Trp Gly He His Ala Trp 

130 135 140 

Ala Val Tyr Gly He Val Ala Leu Val Phe Ala Tyr Phe Ser Phe His 
i45 150 155 160 

Lys Gly Tyr Pro Gly Leu Val Ser Ala Thr Leu Thr Pro Leu Leu Gly 
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165 170 175 

Glu Lys Ala Met Arg Gly Pro Leu Gly Gly Ala He Asp Val Leu Ala 

180 185 190 

Val He Ala Thr Val Thr Gly Val Ala Ala His 
195 200 

(2) INFORMATION FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:539: 

Met Val Leu Val Arg Asn He Asn Leu Ala Leu Arg Ala Gin Tyr Leu 

1 5 10 15 

Xaa Xaa Ser Xaa Val Asp Tyr Phe Val Tyr Xaa Gly Asp He Val Leu 

20 25 30 

Thr Asp Xaa He Thr Gly Arg Xaa Leu Pro Xaa Thr Lys Leu Gin Ala 

35 40 45 

Gly Leu His Xaa Ala He Glu Ala Lys Glu Gly Met Glu Val Ser Thr 

50 55 60 

Asp Lys Ser Val Met Pro Thr Asn Tyr Pro Ser Arg He Tyr Leu Asn 
65 70 75 80 

Phe Leu Asn Gin Phe Ser Gly Met Thr Ser Tyr Arg Lys He Arg Arg 
85 90 95 

He Arg Val Leu 
100 

(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 540: 

Met Asp lie Pro Asn Asn Leu Leu He Ala Gin Asn Val Pro Lys Glu 

1 5 10 15 

Ala Gin Met He Ala Glu Ala Gly Gin He Gly Ser Met Thr Val Ala 

20 25 30 

Thr Ser Met Ala Gly Arg Gly Thr Asp He Lys Leu Gly Glu Gly Val 

35 40 45 

Glu Ala Leu Ala Gly Leu Ala Val He He His Glu His Met Glu Asn 

50 55 60 

Ser Arg Val Asp Arg Gin Leu Arg Gly Arg Ser Gly Arg Gin Gly Asp 
65 70 75 80 

Pro Gly Ser Ser Cys He Tyr He Ser Leu Asp Asp Tyr Leu Xaa Lys 

85 90 95 

Arg Trp Ser Asp Ser Asn Leu Ala Glu Asn Asn Gin Leu Tyr Ser Xaa 

100 105 no 

Asp Ala Gin Arg Leu Ser Gin Ser Asn Leu Phe Asn Arg Lys Val Lys 

115 120 125 

Gin He Val Val Lys Ala Gin Arg He Ser Glu Arg Thr Arg Gly 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541; 

Met Arg Ala Leu Leu He Phe Tyr Met Tyr Phe Ala Val Thr Asp Asn 

15 10 15 

Gly Leu Gly He Asp Lys Thr Thr Ala Met Ser He Met Ser Val Tyr 

20 25 30 

Gly Ser Leu He Tyr Met Thr Ser Asn Thr Arg Arg Asn Gly Leu Leu 

35 40 45 

Thr Glu Leu Gin Ala Leu Glu Ala Leu Leu Tyr 
50 55 



(2) INFORMATION FOR SEQ ID NO: 542: 

574 



WO 97/30070 



PCT/US97/02318 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 



Met Phe Xaa He He He Gly Ser 

1 5 
Asn He Gly Gly Arg Leu Tyr Pro 
20 

Gly Cys Val He Phe Tyr Met Ser 

35 40 
Pro He He Val Gin His Cys Val 

50 55 
Phe Leu He Ala Ala Val Gly Met 
65 70 
Leu Xaa Xaa Arg Lys Asn Leu Gly 
85 



Gly Leu Met Lys Pro Asn He Pro 

10 is 
Glu Asn Asp Arg Arg Met Asp Ala 
25 30 
Val Asn Met Gly Ala Leu Leu Ser 
45 

Asn Val Lys Asn Phe His Gly Gly 
60 

Ala Leu Gly Leu Val Trp Tyr Val 

75 80 
Ser Gly Gly He Glu Thr Xaa 
90 95 



(2) INFORMATION FOR SEQ ID NO: 543: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

Met Leu Phe Ala Xaa He Phe Glu Glu Xaa He Gly Lys Leu Glu Arg 

1 5 10 15 

Tyr Phe Asn Asp Val Pro Xaa Ala Val Ala His Val Lys Val Lys Thr 

20 25 30 

Tyr Ser Xaa Ser Ala Thr Lys He Glu Val Thr He Pro Leu Lys Asn 

35 40 45 

Val Thr Leu Arg Ala Glu Glu Arg Asn Asp Asp Leu Tyr Ala Gly He 
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50 55 60 

Asp Leu He Asn Asn Lys Leu Glu Arg Gin Val Arg Lys Tyr Lys Thr 
65 70 75 80 

Arg He Asn Arg Lys Ser Arg Asp Arg Gly Asp Gin Glu Val Phe Val 

85 90 95 

Ala Glu Leu Gin Glu Met Gin Glu Thr Gin Val Asp Asn Asp Ala Tyr 

100 105 110 

Asp Asp Asn Glu He Glu He He Arg Ser Thr Arg He Pro Gly 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 

Met Leu Leu He Met Phe Thr Asn He Val Leu Glu Leu Gly Gly Lys 

15 10 15 

Asn Pro Asn He He Phe Asp Asp Ala Asp Xaa Glu Leu Ala Val Asp 

20 25 30 

Gin Ala Leu Asn Gly Gly Tyr Xaa His Ala Xaa Gin Val Cys Ser Ala 

35 40 45 

Gly Ser Arg He Leu Val Gin Asn Ser He Lys Asp Lys Phe Glu Gin 

50 55 60 

Ala Leu He Asp Arg Val Lys Lys He Lys Leu Gly Asn Gly Phe Asp 
65 70 75 80 

Ala Asp Thr Glu Met Gly Pro Val He Ser Thr Glu His Arg Asn Lys 

85 90 95 

lie Glu Ser Tyr Met Asp Val Ala Lys Ala Glu Gly Ala Thr He Ala 

100 105 110 

Val Gly Gly Lys Arg Pro Asp Arg Asp Asp Leu Lys Asp Gly Leu Phe 

115 120 125 

Phe Glu Pro Thr Val He Thr Asn Cys Asp Thr Ser Met Arg lie Val 

130 135 140 

Gin Glu Glu Val Phe Gly Pro Val Val Thr Val Glu Gly Phe Glu Thr 
145 150 155 160 

Glu Gin Glu Ala He Gin Leu Ala Asn Asp Ser He Tyr Gly Leu Ala 
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165 170 175 

Gly Ala Val Phe Ser Lys Asp He Gly Lys Ala Gin Arg Val Ala Asn 

180 185 190 

Lys Leu Lys Leu Gly Thr Val Trp lie Asn Asp Phe His Pro Tyr Phe 

195 200 205 

Ala Gin Ala Pro Trp Val Asp Thr Asn Asn Gin Val Ser Val Glu Asn 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545 : 

Met He Ala Asp Gly Val Asn Asp Ala Pro Ala Leu Ala Ala Ser Xaa 

15 10 15 

Val Gly Xaa Ala Arg Leu Gly Xaa Gly Thr Asp Thr Xaa Xaa Glu Thr 

20 25 30 

Ala Asp He Xaa 
35 

(2) INFORMATION FOR SEQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 

Met Asn Leu Glu Lys Gly Asn He Glu Arg Lys Lys His Gly Val His 

15 10 15 

Val Asn Glu Tyr Leu Gin Ser Val Ser Asn Pro Asn Val Tyr Ala Ala 
20 25 30 
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Gly Asp Ala Ala Ala Thr Asp Gly Leu Pro Leu Thr Pro Val Ala Ser 

35 40 45 

Ala Asp Ser His Val Val Ala Ser Asn Leu Leu Lys Gly Asn Ser Lys 

50 55 60 

Lys lie Glu Tyr Pro Val He Pro Ser Ala Val Phe Thr Val Pro Lys 
65 70 75 80 

Met Ala Ser Val Gly Met Ser Glu Glu Glu Ala Lys Asn Ser Gly Arg 

85 90 95 

Asn He Lys Val Lys Gin Lys Asn He Ser Asp Trp Phe Thr Tyr 
100 105 HO 



(2) INFORMATION FOR SEQ ID NO: 547: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 



Met Thr Arg Val Val Leu Ala Ala 

1 5 
Phe Gly Gly Ala Phe Lys Asp Val 
20 

Leu He Glu His He He Lys Glu 

35 40 
Asp Glu Val He He Gly Asn Val 

50 55 
Pro Ala Arg He Ala Ala Met Lys 
65 70 
Ala Phe Thr Val Asn Lys Val Cys 
85 

He Ser He Ser He Tyr Xaa 
100 



Ala Tyr Arg Thr Pro He Gly Val 

10 15 
Pro Ala Tyr Asp Leu Gly Ala Thr 
25 30 
Thr Gly Leu Asn Pro Ser Glu He 
45 

Leu Gin Ala Gly Gin Gly Gin Asn 
60 

Gly Gly Leu Pro Glu Thr Ala Pro 

75 80 
Gly Leu Trp Val Lys Val Xaa Ser 
90 95 



(2) INFORMATION FOR SEQ ID NO: 548: 



(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 

Met Gly Arg Ser Ala Ala Ala Lys Ala Lys Gin Gly Ala Leu Phe His 

15 10 15 

Glu Arg Pro Asp Asp Val Ala Ala Lys Val Leu Gin Gly Val Leu Lys 

20 25 30 

Arg lie Asp Gly Lys Phe Asn Lys Xaa Met lie Glu Asp Val lie Val 

35 40 45 

Gly Thr Ala Phe Pro Glu Gly Leu Gin Gly Gin Asn lie Ala Arg Thr 

50 55 60 

lie Ala Leu Arg Ala Gly Leu Ser Asp Thr Val Pro Gly Gin Thr Val 
65 70 75 80 

Asn Arg Tyr Cys Ser Ser Gly Leu Gin Thr lie Ala lie Ala Ala Asn 

85 90 95 

Gin lie Met Ala Gly Gin Gly Asp lie Leu Val Ala Gly Gly Val Glu 

100 105 110 

Leu Met Ser Ala Val Pro Met Gly Xaa Asn Glu Pro Thr Asn Asn Pro 

115 120 125 

Thr Leu Gin Tyr Asp Asp lie Gly Ala Ser Tyr Pro Met Gly Leu Thr 

130 135 140 

Ala Glu Asn Val Ala Ser Gin Phe Asp Val Ser Arg Ser Lys Met Pro 
145 150 155 160 

Arg Cys Leu Cys Cys Gin Lys Ser Ser Thr Cys Leu 
165 170 

(2) INFORMATION FOR SEQ ID NO: 549: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 



Met Gly Ala Lys Leu Thr Val Asn Ala Asp lie Thr Lys Leu Asn Gly 
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15 10 15 

Leu Gin Ser Ala lie Asp Lys Phe Asn Asp Glu Asp Leu Glu Pro Met 

20 25 30 

Val Leu Phe Val Asn Pro Leu Asp Ala Gly Lys Leu Arg Ser Glu Met 

35 40 45 

His Gin Leu Thr Leu His Val Gin Pro Asn 
50 55 



(2) INFORMATION FOR SEQ ID NO: 550: 



<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

Met Tyr Lys Val lie Glu Arg Phe Glu Asp Ala Gin Asp Asn Gly His 

15 10 15 

Glu Tyr Gin Val Gly Asp lie Tyr Pro Arg Asp Gly Leu Glu Val Ser 

20 25 30 

Glu Glu Arg Phe Thr Glu Leu Ser Thr Thr Asn Asn Arg Arg Asn Leu 

35 40 45 

lie Ala lie Lys Leu Val Glu Asp Asp Thr Thr Glu Gin Ser Glu Ala 

50 55 60 

Ser Ala Asp Glu Gin Lys Ser Leu Ser Asp Met Lys Val Ala Glu Leu 
65 70 75 80 

Xaa Glu Leu Ala Lys Lys Arg Glu lie Lys Gly Tyr Ser Asp Met Lys 

85 90 95 

Lys Asp Glu Leu lie Lys Ala Leu Glu Gly Val Lys Leu Trp Thr Gin 

100 105 110 

Lys Thr Ser Lys 
115 



(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

580 



WO 97/30070 



PCT/US97/02318 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

Met Gly Lys Ser His Ala Cys Tyr Gin Gly Val Thr His Ala Cys Thr 

1 5 10 15 

Asn Arg He Ala Phe Val Asp Ala Asp Val Thr Phe Leu Arg Lys Asp 

20 25 30 

Ala Val Glu Thr Leu He Asn Gin Tyr Gin Leu Gin Gly Glu Lys Gly 

35 40 45 

Leu Leu Ser Val Gin Pro Tyr His He Thr Lys Arg Phe Tyr Glu Gly 

50 55 60 

Phe Ser Ala He Phe Asn Leu Met Thr Ser Arg Trp Tyr Glu Cys He 
65 70 75 80 

Xaa Tyr Leu Arg Arg Arg Cys Gly Leu Thr Ser Met His Leu Asp Arg 



(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

Met Arg Leu Phe He Asn Leu Lys Gin He He lie Arg Lys He Ala 

15 10 15 

Gly Lys Ser Lys Ser Gly He Gin Met Val Tyr Gin His Leu Met Asn 

20 25 30 

Gly Val Ser Phe Met Xaa Pro Phe He Gly Val Gly Gly Leu Leu He 

35 40 45 

Xaa He Ala Leu Thr Leu Gly Gly Glu Thr Thr Ser Lys Gly Leu Val 

50 55 60 

He Pro Asp Asp Ser Leu Trp Glu He His 
65 70 



85 



90 



95 



(2) INFORMATION FOR SEQ ID 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 



Met Lys He Val Ala He Thr Ser Cys Pro Asn Gly He Ala His Thr 

15 io is 

Tyr Met Ala Gin Glu Lys Leu Glu Gin Val Ala Lys Glu Met Gly Val 

20 25 30 

Asp He Lys Val Glu Thr Gin Gly Gly Val Gly Ala Glu Asn Val Leu 

35 40 45 

Thr Thr Gin Asp He Glu Glu Ala Asp Gly Val He He Ala Ala Asp 

50 55 60 

Lys Gin Val Asp Leu Ser Arg Phe Val Gly Lys Arg Leu He Asn Glu 
65 70 75 80 

Asn Val Arg Ser 



(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554: 



Met Asn Lys Leu Lys Leu Val Gly Ala Asp Val Trp Ala Ser He Glu 

1 5 io 15 

Ser Ala Phe Ala Pro Val Met Glu Glu Leu He Lys Lys Leu Ser Val 

20 25 30 

Ala Val Asp Trp Phe Ser Ser Leu Ser Asp Gly Phe Lys Arg Ser He 

35 40 45 

Val He Phe Gly Gly He Ala Ala Ala He Gly Pro Val Val Phe Gly 
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50 55 60 

lie Arg Cys lie His Lys His Asn Trp Gin Arg Ser Asn Cys Phe Ser 
65 70 75 80 

Ser He He Ser 



(2) INFORMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 272 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 : 

Met Leu Tyr Val He Asn He Leu Asp Ala Tyr Arg Asn Ala Glu Arg 

15 10 15 

Phe Asn Arg Asn Glu Glu He Lys Asp Arg Gin Ala Arg Met Val Ala 

20 25 30 

Thr Trp Asp Lys Thr Phe Pro Tyr Leu Leu He Ser Pro Gly Thr Phe 

35 40 45 

Leu Leu He Phe Val Val Val Phe Pro Leu He Phe Met Phe Gly Val 

50 55 60 

Ala Phe Thr Asn Tyr Asn Leu Tyr Asn Ala Pro Pro Arg His Thr Leu 
65 70 75 80 

Glu Trp Val Gly Leu Asp Asn Phe Lys Thr Leu Phe Thr He Gly Val 

85 90 95 

Trp Arg Lys Thr Phe Phe Ser Val He Thr Trp Thr Leu Val Trp Thr 

100 105 110 

Leu Val Ala Thr Thr Leu Gin He Ala Leu Gly Leu Phe Leu Ala He 

115 120 125 

He Val Asn His Pro Val Val Lys Gly Lys Lys Phe He Arg Thr Val 

130 135 140 

Leu He Leu Pro Trp Ala Val Pro Ser Phe Val Thr He Leu He Phe 
145 150 155 160 

Val Ala Leu Phe Asn Asp Glu Phe Gly Ala He Asn Asn Asp He Leu 

165 170 175 

Gin Pro Leu Leu Gly Val Ala Pro Ala Trp Leu Ser Asp Pro Phe Trp 

180 185 190 

Ala Lys Val Ala Leu He Gly He Gin Val Trp Leu Gly Phe Pro Phe 
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195 



200 



205 



Val Phe Ala Leu Phe Thr Gly Val Leu Gin Ser He Ser Ser Asp Trp 

210 215 220 

Tyr Glu Ala Ala Asp Met Asp Gly Ala Ser Ser Trp Gin Lys Phe Arg 
225 230 235 240 

Asn He Thr Phe Pro His Val He Tyr Ala Thr Ala Pro Leu Leu He 

245 250 255 

Met Gin Tyr Ala Gly Tyr Phe Asn Xaa Xaa Xaa Leu He Tyr Leu He 



(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 556: 

Met Leu Leu He Pro Glu Pro Trp Leu Tyr Asn Glu Ala Asn Asp Ala 

1 5 10 15 

Asn Val Arg Ala Phe Tyr Glu Phe Tyr Ser Tyr Leu Met Glu Pro Trp 

20 25 30 

Asp Gly Pro Thr Met He Ser Xaa Cys Asn Gly Asp Lys Leu Gly Ala 

35 40 45 

Leu Thr Asp Arg Asn Gly Leu Arg Pro Gly Arg Tyr Thr He Thr Lys 

50 55 60 

Asp Asn Phe He Val Phe Ser Ser Glu Val Gly Val Val Asp Val Pro 
65 70 75 80 

Glu Ser Asn Val Ala Phe Lys Gly Gin Leu Asn Pro Gly Lys Leu Leu 

85 90 95 

Leu Val Asp Phe Lys Gin Asn Lys Val He Glu Asn Asn Asp Leu Lys 

100 105 110 

Gly Ala He Ala Gly Glu Leu Pro Tyr 



260 



265 



270 



115 



120 



(2) INFORMATION FOR SEQ ID 



NO: 557: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 62 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii), MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

Met Thr Arg Lys Gly Tyr Gly Glu Ser Thr Gly Lys lie lie Leu He 

15 10 15 

Gly Glu His Ala Val Thr Phe Gly Glu Pro Ala He Ala Val Pro Phe 

20 25 30 

Asn Ala Gly Lys He Lys Val Leu lie Glu Ala Leu Glu Ser Gly Asn 

35 40 45 

Tyr Ser Ser He Lys Ser Asp Val Tyr Asp Gly He Val He 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

Met He Glu Leu Lys His Val Thr Phe Gly Tyr Asn Lys Lys Gin Met 

15 10 15 

Val Leu Gin Asp He Asn He Thr He Pro Asp Gly Glu Asn Val Gly 

20 25 30 

He Leu Gly Glu Ser Ala Cys Gly Lys Ser Thr Leu Ala Ser Leu Val 

35 40 45 

Leu Gly Leu Phe Lys Pro Val Lys Gly Glu He Tyr Leu Ser Asp Asn 

50 55 60 

Ala Val Leu Thr He Phe Gin His Pro Leu Thr Ser Phe Asn Pro Asp 
65 70 75 80 

Trp Thr He Glu Thr Ser Leu Lys Glu Ala Leu Tyr Tyr Tyr Arg Gly 

85 90 95 

Leu Thr Asp Asn Thr Ala Gin Asp Gin Leu Leu Leu Gin His Leu Ser 
100 105 110 
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Thr Phe Glu Leu Asn Ala Gin Leu Leu Thr Lys Leu Pro Ser Glu Val 

115 120 125 

Ser Gly Gly Gin Xaa Gin Arg Phe Asn Val Met Arg Ser Leu Leu Ala 

130 135 140 

Gin Pro Arg Val Leu lie Cys Asp Glu lie Thr Ser Asn Leu Asp Val 
145 150 155 160 

He Ala Xaa Gin Asn Val He Asn He Xaa Lys Ala Gin Thr Val Xaa 

165 170 175 

Glu Leu Lys Ser Phe Tyr Arg Tyr Phe Ser 
180 185 

(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 : 

Met Glu Asp Leu Arg Gly Arg Phe Ser Glu Thr Ser Arg Glu Gly Phe 

15 10 15 

Leu Leu Ala Gin Asp He Ser Ser Tyr Ser Leu Thr He Val Ala His 

20 25 30 

Glu Ala Lys Lys Leu Met Pro Glu Gly Gly Ser He Val Ala Thr Thr 

35 40 45 

Tyr Leu Gly Gly Glu Phe Ala Val Gin Asn Tyr Asn Val Met Gly Val 

50 55 60 

Ala Lys Ala Ser Leu Glu Ala Asn Val Lys Tyr Leu Ala Leu Asp Leu 
65 70 75 80 

Gly Pro Asp Asn He Arg Val Asn Ala He Ser Ala Gly Pro He Arg 

85 90 95 

Thr Leu Ser Ala Lys Gly Val Gly Gly Phe Asn Thr He Leu Lys Glu 

100 105 110 

He Glu Glu Arg Ala Pro Leu Lys Arg Asn Val Asp Gin Val Glu Val 

115 120 125 

Gly Lys Thr Ala Ala Tyr Leu Leu Ser Asp Leu Ser Ser Gly Val Thr 

130 135 140 

Gly Glu Asn He His Val Asp Ser Gly Phe His Ala He Asn Xaa He 
145 150 155 160 
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lie Gin Gin Leu Gly Phe Thr Val Tyr His lie Leu Trp Ser Lys Ser 

165 170 175 

Phe Xaa Gly Phe lie Asn Asn Xaa Ala Asp Gly Lys Leu Leu Asp lie 
180 185 190 

Ser Thr 



(2) INFORMATION FOR SEQ ID NO: 560: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: prote: 

(xi) SEQUENCE DESCRIPTION 

Met Lys His Ser Asp He Val Val 

1 5 
Thr Asn Asp Asn Arg Gin Leu Asp 
20 

Gin Gin Ala Asn Lys Glu Ala Asp 

35 40 
Gin Gin Tyr Lys Gly Lys Arg His 

50 55 
Leu Asn Tyr Gin Leu Val Lys Ala 
65 70 
Gly Gin Leu Asn Val Leu Glu Glu 
85 



SEQ ID NO: 560: 

Thr Val His Asp He Asp Gin Tyr 

10 15 
Gin Arg Leu Asn Asp Leu Gin Gly 
25 30 
Lys Gin Arg Leu Ser Gin Gin He 
45 

Gin Leu Asp Asn Asp Val Glu Ser 
60 

Thr Glu Ala Phe Glu Lys Tyr Thr 

75 80 
Arg Xaa Glu Lys Ser He 
90 



(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 

587 



WO 97/30070 



PCT/US97/02318 



Met Glu Leu Leu Glu Asn lie Ser Asn Glu He Ser Glu Ala Xaa Asp 

1 5 10 15 

Thr Tyr Lys Ser Leu Lys Ser Lys His Xaa Arg Thr Gin Cys Cys His 
20 25 30 

Ser 



(2) INFORMATION FOR SEQ ID NO: 562 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 



Met Val Asp Val Thr His Ser Thr 

1 5 
Thr Ala Lys Ala Ala Leu Ala Val 
20 

Val His Pro Asp Pro Ser Val Ala 

35 40 
Asp Leu Asp Xaa Phe Gin Ala Phe 

50 55 
Asp Leu Tyr Asn Ala Lys Lys Leu 
65 70 



Gly Arg Lys Asp He Met Leu Pro 

10 15 
Gly Ala Asp Gly Val Met Ala Glu 
25 30 
Leu Ser Asp Ala Gly Gin Gin Met 
45 

Tyr Asp Glu Leu Lys Pro Leu Ala 
60 

Lys 



(2) INFORMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563 : 
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Met Asp Leu Asn Gin lie Xaa Ala Val Val Phe Asp Leu Glu Gly Thr 

15 10 15 

Leu Leu Asp Arg Val Lys Ser Arg Glu Lys Xaa lie Glu Glu Gin Tyr 

20 25 30 

Glu Arg Xaa His Asp Tyr Leu Xaa His Val Gin Leu Ala Asp Phe Lys 

35 40 45 

Lys Ala Phe lie Glu Leu Asp Asp Asp Glu Asp Asn Asp Lys Pro Asp 

50 55 60 

Leu Asp Lys Glu lie lie Lys Arg Phe His Val Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 



Met Thr Val Asn Asp Leu Ala Asp 

1 5 
lie Val Arg Phe Ser Arg Lys Met 
20 

Lys lie Ala lie Ser Arg Tyr Leu 

35 40 
His Leu Glu Leu lie Glu Asn Glu 

50 55 
Met lie Ala Arg Ala Thr Glu Tyr 
65 70 



Val Thr Asn Val Ser Thr Ala Ser 

10 15 
Thr His Gin Gly Phe Gin Glu Leu 
25 30 
Pro Glu Asp lie Ala Thr Asn Pro 
45 

Ser Val Glu Thr Leu Lys Asn Lys 
60 

Gly Cys Asp Leu 
75 



(2) INFORMATION FOR SEQ ID NO: 565: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



{ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

Met Leu Thr Glu Asn Gin Ser Val Asn Ala Lys Gly Lys Tyr Glu Lys 

1 5 10 15 

Met Tyr Arg Leu Tyr Asp Thr Asn Gin Leu His Gin Tyr Tyr Ser Gly 

20 25 30 

Pro Ser Phe Asp Leu Thr Asn Leu Ser Gly Leu Gin Ser Arg Leu Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 : 

Met Asn Arg Leu His Gly Xaa Xaa Val Xaa He Gly Tyr Gly Asp Asn 

15 10 15 

Thr He He Asn Lys Leu Asp Val Glu Xaa Pro Xaa Gly Lys Val Thr 

20 25 30 

Ser He He Gly Pro Asn Gly Cys Gly Lys Ser Thr Leu Leu Lys Ala 

35 40 45 

Leu Ser Arg Leu Leu Ala Ala Xaa Xaa Gly Glu Val Phe Leu Asp Gly 

50 55 60 

Trp Lys He Phe Asn Thr Gin Ser Thr Lys Glu He Ala Lys Lys He 
65 70 75 80 

Xaa He Leu Pro Gin Ser Pro Glu Val Ala Arg Trp Leu Asn Leu Leu 
85 90 95 

Gly Asn 



(2) INFORMATION FOR SEQ ID NO: 567 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

590 



WO 97/30070 



PCT/US97/02318 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 

Met Xaa Pro Asn Pro Pro Gin Glu Leu Leu Gin Gin Leu Gly Phe Ser 

15 10 15 

Ser Glu Phe Thr His Gin Thr Tyr Thr Phe Ser Asn Met Glu Leu Pro 

20 25 30 

Trp Val Ser Phe He Val His Phe Ser Phe Ser He Val He Ala He 

35 40 45 

He Tyr Cys He Leu Val Lys Lys Tyr Ala Tyr lieu Ala He Gly Gin 

50 55 60 

Gly Ala Val Phe Gly He Ala He Trp Val Leu Phe His Leu He He 
65 70 75 80 

Met Pro He Met His Thr Val Pro Ala Val Trp Asp Gin Pro Tyr Gin 

85 90 95 

Glu His Leu Ser Glu Phe Phe Gly His He Val Trp Met Met Thr He 

100 105 110 

Glu Leu Val Arg Gin His Phe Val Tyr Arg Tyr Lys Leu Asn 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 



Met Met Asp Tyr Asn Thr Thr Gly Asp Gly Leu Leu Thr Gly He Gin 

15 10 15 

Leu Ala Ser Val He Lys Met Thr Gly Lys Ser Leu Ser Glu Leu Ala 

20 25 30 

Gly Gin Met Lys Lys Tyr Pro Gin Ser Leu He Asn Val Arg Val Thr 

35 40 45 

Asp Lys Tyr Arg Val Glu Glu Asn Val Asp Val Lys Glu Val Met Thr 
50 55 60 
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Lys Val Glu Val Glu Met Asn Gly Glu Gly Arg lie Leu Val Arg Pro 
65 70 75 80 

Ser Gly Thr Glu Pro Leu Val Arg Val Met Val Glu Ala Ala Thr Asp 

85 90 95 

Glu Asp Ala Glu Arg Phe Ala Gin Gin He Ala Asp Val Val Gin Asp 

100 105 no 

Lys Met Gly Leu Asp Lys 
115 

(2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 

Met He Gin Gly Ala Glu Asn Val Ala Lys Met Tyr Asp Val Ser Arg 

15 10 15 

Glu Leu Gin Asp Glu Phe Ala Tyr Arg Ser His Gin Leu Thr Ala Glu 

20 25 30 

Asn Val Lys Asn Gly Asn He Ser Gin Glu He Leu Pro He Thr Val 

35 40 45 

Lys Gly Glu He Phe Asn Thr Asp Glu Ser Leu Lys Ser His He Pro 

50 55 60 

Lys Asp Asn Phe Gly Arg Phe Lys Pro Val He Lys Gly Gly Thr Val 
65 70 75 80 

Thr Ala Ala Asn Ser Cys Met Lys Asn Asp Gly Ala Val Leu Leu Leu 

85 90 95 

He Met Glu Lys Asp Met Ala Tyr Glu Leu Gly Phe Glu His Gly Leu 

100 105 , HO 

Leu Phe Lys Glu Trp Cys Tyr Gly Arg Cys 
115 120 

(2) INFORMATION FOR SEQ ID NO: 570 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

Met Pro Ala Tyr Val Val Asp Thr Thr Lys Gin lie lie Lys Ala Leu 

15 10 is 

Ser Gly Asn Lys Val Thr Val Phe Gly Leu Thr Tyr Lys Gly Asp Val 

20 25 30 

Asp Asp lie Arg Glu Ser Pro Ala Phe Asp lie Tyr Glu Leu Leu Asn 

35 40 45 

Xaa Xaa Pro Asp He Xaa Val Cys Ala Tyr Asp Pro His Val Xaa Leu 

50 55 60 

Asp Phe Val Glu His Asp Met Ser His Ala Val Lys Asp Ala Ser Leu 
65 70 75 80 

Val Leu He Leu Ser Asp His Ser Xaa Xaa Lys Asn Leu Ser Asp Ser 

85 90 95 

His Phe Asp Lys Met Lys His Lys Val He Phe Asp Thr Xaa Asn Val 

100 105 HO 

Val Lys Ser Ser Phe Xaa Asp Val Ser Tyr Tyr Asn Tyr Gly Asn He 

115 120 125 

Xaa Asn Phe He Asp Lys 
130 

(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 

Met Gly Leu Val He Glu His Val Thr Lys Arg Phe Gly Lys Met Thr 

15 10 15 

Ala Val Asn Asp He Ser Leu Glu Leu Glu Ser Gly Lys Met Leu Gly 

20 25 30 

Phe Leu Gly Arg Asn Gly Ala Gly Lys Thr Thr Thr Phe Arg Met He 
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35 40 45 

Leu Gly Leu Ser Glu Pro Thr. Glu Gly His lie Thr Tyr Asn Gly Lys 

50 55 60 

Lys Leu Asp Lys Thr Met Tyr Asn Arg He Gly Tyr Leu Pro Glu Glu 
65 70 75 80 

Arg Gly Leu His Ala Gly Asn 
85 

(2) INFORMATION FOR SEQ ID NO: 572: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:572: 



Met Lys He Leu Xaa Tyr He Gly Tyr Leu Leu Leu Gly Gly He Val 

1 5 10 15 

Gly Gly He He Gly Gly He Leu Xaa Asn Phe Asp Gly Leu Gly lie 

20 25 30 

Glu Asn Leu Thr Phe Ala Thr Tyr Asn Asn Val Val Val He Ser He 

35 40 45 

Val Ala Thr Met He He He Leu Val Glu Ala He Val Leu Met Asn 

50 55 60 

Gin Xaa Xaa Ala Leu Lys Tyr Lys Arg Leu Val Asp Glu Glu Val His 
65 70 75 80 

He Asp Ala Thr Asp Gin Tyr Glu Leu Leu Ala Asn Arg Tyr Val Leu 

85 90 95 

Asn Gly Ser He Leu Ser Val He Xaa Thr He He Ala Phe Val Val 

100 105 110 

Leu Leu He Phe Val Val Gly Gin Ala Glu Ala Asn Ala Met Leu Phe 

115 120 125 

Phe Leu Leu Pro Phe Phe Xaa Ser Ala Xaa Xaa Asn Thr Gin Phe Thr 

130 135 140 

Leu Phe Asn Arg Lys Phe Asp Asp Arg Met Pro Lys He Ala Asp Lys 
145 150 155 160 

Asn Tyr Thr Glu Lys Arg Leu Glu He Leu Asp Glu Gly Glu Arg His 

165 170 175 

He Arg He Asn Cys He He 
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(2) INFORMATION FOR SEQ ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 

Met Asp Asn Ser Asn Lys Ser Thr Asp Tyr Leu Lys Gin Ser Asp Glu 

15 10 15 

Phe Ala Asp Leu Tyr Arg Asp Lys Leu Asn Asp Ala Asn Lys Leu Ser 

20 25 30 

Lys Tyr Asn Phe Thr Gin Lys Tyr Gly Val Ser Pro Asn Asn Tyr Lys 

35 40 45 

Asn Lys Thr He Val Ala Asp Gly Gly Asn Ser Glu Gly Gly Ala Gly 

50 55 60 

Ala Lys Tyr Gin Gly Ala Lys His Pro Asn Glu Lys Val Val Ala Thr 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

Met Arg Gin Arg Val Met He Ala Met Ala Leu He Leu Lys Pro Gin 

15 10 15 

He Leu He Ala Asp Glu Pro Thr Thr Ala Leu Asp Ala Ser Thr Gin 

20 25 30 

Asn Gin Leu Leu Gin Leu Met Lys Ser Leu Tyr Glu Tyr Thr Glu Thr 
35 40 45 
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Ser lie lie Phe lie Thr His Asp Leu Gly Ala Val Tyr Gin Phe Cys 

50 55 60 

Asp Asp Val He Val Met Lys Asp Gly Ser Val Val Glu Ser Gly Thr 
65 70 75 80 

Val Glu Val Phe Leu Asn Arg His Asn He Pro He Gin Asn Ala 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 

Met Leu Ser Xaa Leu Val Lys 

1 5 
Ala Gly Phe Asn Asn He Asp 
20 

Asp Val Thr Asn Thr Xaa Lys 
35 

Thr He Gly Leu Val Leu Ala 

50 55 
Gin Leu Ser Arg Thr Thr Gly 
65 70 
Arg Gly Arg Glu Val Ser Gly 
85 

Glu He Gly Ser Ala Val Ala 
100 

Val Leu Tyr Thr Gly Pro Asn 
115 

Gly Ala Lys Tyr Val Arg Phe 
130 135 
Tyr His Tyr Gin Arg Cys Leu 
145 150 

(2) INFORMATION FOR SEQ ID NO: 576: 

(i) SEQUENCE CHARACTERISTICS: 

5% 



Asp Xaa Lys He He Ala Asn Tyr Gly 

10 15 
He Glu Tyr Xaa Arg Xaa Lys Ser He 

' 25 30 
Ala Ser Xaa His Ala Xaa Ala Asp Leu 
40 45 
Val Ala Arg Arg He Val Glu Gly Asp 
60 

Phe Asp Gly Trp Ala Pro Leu Phe Phe 

75 80 
Lys Thr He Gly He He Gly Leu Gly 

90 95 
Arg Arg Ala Arg Ala Phe Asp Met Asp 

105 HO 
Arg Lys Glu Glu Xaa Glu Arg Glu He 
120 125 
Arg Tyr Thr He Lys Glu Cys Arg Phe 
140 
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(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 



Met Ser Gly Gly lie Ala Xaa Val lie Pro Ser Asp Val Glu Ala Phe 

15 10 15 

Val Glu Asn Asn Gin Leu Asp Thr Leu Ser Phe Thr Lys lie Lys His 

20 25 30 

Gin Glu Glu Lys Ala Phe lie Lys Gin Met Leu Glu Glu His Val Ser 

35 40 45 

His Thr Asn Ser Thr Arg Ala lie His Val Leu Lys His Phe Asp Arg 

50 55 60 

lie Glu Asp Val Val Val Lys Val He Pro Lys Asp Tyr Gin Leu Met 
65 70 75 80 

Xaa Gin Lys He His Leu His Lys Ser Leu His Asp Asn Glu Asp Glu 

85 90 95 

Ala Met Leu Ala Ala Phe Tyr Asp Asp Ser Lys Thr He Asp Ala Lys 

100 105 110 

His Lys Pro Ala Val Val Tyr 
115 



(2) INFORMATION FOR SEQ ID NO: 577: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 577: 



Met Gin Leu Asp Ala Lys Thr Pro Ser Lys He Ser Leu Gin Tyr Lys 

15 10 15 

Thr Ala Val Gly Gin Lys Glu Glu Val Ala Lys Asn Thr Glu Lys Val 

20 25 30 

Val Ser Asn Val Leu Asn Asp Phe Asn Lys Asn Leu Val Glu He Tyr 
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35 40 45 

Leu Thr Ser lie lie Asp Asn Leu His Asn Ala Gin Lys Asn Val Gly 

50 55 60 

Ala lie Met Thr Arg Glu His Gly Val Asn Ser Lys Phe Ser Asn Tyr 
65 70 75 80 

Leu Leu Asn Pro lie Asn Asp Phe Pro Glu Leu Phe Thr Asp Thr Leu 

85 90 95 

Val Asn Ser lie Ser Ala Asn Lys Asp lie Thr Lys Trp Phe Gin Thr 

100 105 no 

Tyr Asn Lys Ser Leu Leu Ser Ala Asn Ser Xaa Thr Phe Arg Val Asn 

115 120 125 

Thr Asp Tyr Xaa Val Ser Thr Leu lie Glu Lys Gin Asn Ser Leu Phe 

130 135 140 

Asp Glu His Asn Thr Ala Lys Asp Lys Met Leu Pro Asp Tyr Lys Ser 
145 150 155 160 

Pro Lys Asp Met Trp Asn Leu lie Thr lie Ser Met His 
165 170 



(2) INFORMATION FOR SEQ ID NO: 578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 



Met Asn His lie Met Glu Ser Val Thr Asn Ala lie Tyr Pro He Gly 

1 5 10 15 

Met Met Leu Leu He Xaa Gly Gly Gly Gly Thr Phe Lys Gin Val Leu 

20 25 30 

Xaa Asp Gly Gly Val Gly Asn Thr He Ala Lys Met Phe Glu Gly Thr 

35 40 45 

Glu Met Ser Pro He Leu Leu Ala Trp Xaa Val Ala Ala Arg Ala Thr 

50 55 60 

Tyr Arg He Arg Phe Gly Tyr Ser Ser Cys Asp Phe Asn Tyr Arg Tyr 
65 70 75 80 

Cys Leu Thr He He Thr He He Arg Cys Lys Cys Cys He Ser Cys 

85 90 95 

Thr Cys Asp Arg Cys Arg 

598 
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(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOIjOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

Met Lys Ala Ala Tyr Lys Asn Lys Gin Val Asp Gly Phe Thr Thr Asn 

1 5 10 15 

Pro Ser lieu Met Pro Lys Ala Ala Val Ser Arg Leu Gin Ser Phe Cys 



(2) INFORMATION FOR SEQ ID NO: 580: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

Met Asp Arg Ala Lys Pro Met Asp Glu Xaa lie Xaa Asn Gly Glu Asp 

15 10 15 

Tyr Ala Ser Leu lie Xaa Xaa Ala Lys Xaa Lys Gly Leu Ser Asp lie 



20 



25 



30 



20 



25 



30 



Pro 



Asn Thr Lys lie Phe Lys Tyr Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 581: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 77 amino acids 



(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 581: 

Met Met Gin Val Arg His Pro His Val Val Ala His Leu Tyr Asn Lys 

15 10 is 

Lys Asn Lys Ser Tyr lie Asp lie Lys Thr Asp Phe Phe Val Pro Asn 

20 25 30 

Asn Val Gin Val Gin Ser Leu Thr Glu Ala lie Arg Ser Asp lie Lys 

35 40 45 

Lys Asn Val Glu Tyr Phe Thr Glu Met Pro Val Arg Lys Leu Glu Val 

50 55 60 

Asn Val Arg Asp Gin Lys Thr Ser Gly Pro Arg Val Leu 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 582 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 

Met Glu Ala Asn His Ser Lys Ala Lys Val Pro lie Gly Asn Gin lie 

15 10 15 

Val Thr Val Glu Arg Gly Gin Arg Leu Thr Ser lie Leu Xaa Leu Ser 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

Met Gly Gly Asp Tyr Asp Ala Leu Glii Thr Leu Met Ala Ser Arg Pro 

15 10 15 

Asp lie Leu Asn His Asn lie Glu Thr Val Arg Arg Leu Thr Pro Arg 

20 25 30 

Val Arg Ala Arg Ala Thr Tyr Asp Arg Thr Leu Glu Phe Leu Arg Arg 

35 40 45 

Ser Xaa Glu Leu Gin Pro Asp lie Pro Thr Lys Ser Ser lie Met Val 

50 55 60 

Gly Leu Gly Glu Thr He Glu Glu He Tyr Glu Xaa Met Asp Asp Leu 
65 70 75 80 

Arg Ala Asn Asp Val Asp He Leu Thr He Gly Gin Tyr Leu Gin Pro 

85 90 95 

Ser Arg Lys His Leu Lys Val Xaa Lys Tyr Tyr Thr Pro Leu Glu Phe 

100 105 110 

Gly Lys Leu Arg Lys Val Ala Met Asp Lys Gly Phe Lys Gin Cys Gin 

115 120 125 

Leu Asp Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 584 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

Met He Asn Met Lys He Leu Val Val Cys Gly His Gly Leu Gly Ser 

15 10 15 

Ser Phe Met Val Glu Met Asn Ala Gin Glu Ala Leu Arg Gin Leu Asn 

20 25 30 

Ala Pro Ser Asp He Glu Val Glu His Ser Asp He Met Thr Ala Ser 

35 40 45 

Pro Glu Met Ala Asp Leu Phe He Cys Gly Arg Asp Leu Ala Arg Lys 
50 55 60 
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Cys Arg Thr Pro Lys Gly Cys Leu Ser Xaa 
65 70 

(2) INFORMATION FOR SEQ ID NO: 585: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 

Met lie Leu Pro He Leu Ala Thr Leu Ser Val Ala Val Gly Val His 

1 5 10 15 

Pro Leu Leu Leu Met Ala Pro Ala Ala Met Ala Ala Asn Cys Ala Tyr 

20 25 30 

Met Leu Pro Val Gly Thr Pro Pro Asn Ala He He Phe Gly Ser Gly 

35 40 45 

Lys He Ser He Lys Gin Met Ala Ser Val Gly Phe Trp Val Asn Leu 

50 55 60 

He Ser Ala He He He He Leu Val Val Tyr Tyr Val Met Pro He 
65 70 75 80 

Val Leu Gly He Asp He Asn Gin Pro Leu Pro Leu Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 586: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 

Met Asn Ala His He Tyr Leu Glu Leu Ala Gly Asp Tyr Gin Ala Leu 

1 5 10 15 

Ala He Leu Pro Leu Trp His His Asp Asp Thr Tyr Leu Phe Asp Leu 
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20 

Leu Leu Arg Lys lie Glu Asp Met 

35 40 
Lys Val Lys Gin Thr Gin Leu Leu 

50 55 
Thr Thr Leu 
65 



25 30 
lie Leu Pro Lys Lys Ser Val Ser 
45 

Thr Thr Glu Gly Asn Tyr Lys Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 587: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 

Met Phe lie lie Glu Leu lie Lys Gly lie lie Leu Gly Val Val Glu 

15 10 15 

Gly Xaa Thr Glu Phe Ala Pro Val Ser Ser Thr Gly His Met lie Leu 

20 25 30 

Val Asp Asp Met Trp Leu Lys Ser Ser Glu Phe Leu Gly Ser Gin Ser 

35 40 45 

Ala Phe Thr Phe Lys lie Val lie Gin Leu Gly Ser Val Phe Ala Ala 

50 55 60 

Ala Trp Ala Phe Arg Ala Lys Arg Phe Leu Glu lie Leu His lie Gly 
65 70 75 80 

Lys His Lys His Val Glu Gly Glu Asn Asp Gin Gin Arg Arg Ser Lys 

85 90 95 

Pro Arg Arg Leu Asn Leu Leu His Val Leu Val Gly Met Val Pro Ala 

100 105 110 

Gly lie Leu Gly Leu Leu Phe Asp Asp Phe He Glu Glu His Leu Phe 

115 120 125 

Ser Val Pro Thr Val Met He Gly Leu Phe Val Gly Ala He Tyr Met 

130 135 140 

He He Ala Asp Lys Tyr Ser Val Lys Val Lys Asn Pro Gin Thr Val 
145 150 155 160 

Asp Gin He Asn Tyr Phe Gin Ala Phe Val He Gly He Ser Gin Ala 

165 170 175 

Val Ala Met Trp Pro Gly Phe Ser Arg Ser Gly Ser Thr He Ser Thr 
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160 



185 



190 



Gly Val lie Asn Glu lie Lys Ser 
195 200 



(2) INFORMATION FOR SEQ ID NO: 588: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE 



TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

Met Lys Lys Trp Ala Asn Gly Leu Pro Leu Thr Asp Lys Arg He Ser 

15 10 15 

Pro He Asn Gly Thr He Glu Gly Leu Pro Pro Val Tyr Met Phe Gly 

20 25 30 

Gly Gly Arg Glu Met Thr His Pro Asp Met Lys Leu Phe Glu Gin Met 

35 40 45 

Met Leu Gin His His Gin Tyr He Glu Phe Tyr Asp Tyr Pro Lys Met 

50 55 60 

Val His Asp Phe Pro He Tyr Pro He Arg Gin Ser His Lys Ala He 
65 70 75 80 

Lys Gin He Ala Lys Ser He Asp Glu Asp Val Thr Gin Asn Asn 



(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 



85 



90 



95 



Met Asn Tyr Gin Asn Ala Ser Tyr Leu Ser Ala Phe Thr Ala Gly Leu 
1 5 10 15 
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Gly lie Tyr Phe lie Met Lys Gly Ser Val Lys His I#ys Trp lie Tyr 

20 25 30 

Val Leu Phe Thr lie lie Asp lie Pro lie Val Phe He Pro Gly Arg 

35 40 45 

Arg Gly Ser Ala He Leu Leu He Leu Tyr Gly Phe He Cys He Tyr 

50 55 60 

Thr Tyr Tyr Val 
65 

(2) INFORMATION FOR SEQ ID NO: 590: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 

Met Lys Val Ser Ala Lys Asn Val Met Lys Gin Lys His Phe His Lys 

1 5 10 15 

Gly He Glu Gin Leu Val Gin Glu Gly Ala He Gin Tyr Tyr Lys Thr 

20 25 30 

Leu His Thr Asn Gin He He Leu Gly Ala Val Gly Gin Leu Gin Phe 

35 40 45 

Glu Val Phe Glu His Arg Met Lys Asn Glu Tyr Asn Val Asp Val Val 

50 55 60 

Met Glu Pro Val Gly Arg Lys He Ala Arg Trp He Glu Asn Glu Asp 
65 70 75 80 

Gin He Thr Asp Lys Met Asn Thr Ser Arg Ser He Leu Val Lys Asp 

85 90 95 

Arg Tyr Asp Asp Leu Val Phe Leu Phe Glu Asn Glu Phe Ala Thr Arg 

100 105 110 

Trp Phe Glu Glu Lys Phe Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 591: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 



Met Ser Asp Tyr Asn Gly Met Asn Lys He Asp Met Met Asn Gin He 

15 10 15 

Lys Val Asp Thr Met Leu His Gly Tyr His Ala Gly Phe Leu Phe Ala 

20 25 30 

Leu Leu He Thr Val Val Ser Phe Phe Leu His Arg He He Gin 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

Met Asn Glu Met Leu Met Ala Asn Tyr Leu Phe Asn Thr Ser Asp Asn 

15 10 15 

Pro Arg Phe Lys Arg Trp Val He Gly Ser lie Leu Ser Arg Thr Tyr 

20 25 30 

Tyr His Asn Met Val Thr His Leu Leu Glu Ala Ala Tyr Gin Arg Glu 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO: 593: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

Met Leu Arg Glu Ala Tyr Asp Gin Arg Trp lie Asp Phe Ala Arg Asn 

15 10 15 

Lys Gly Lys Asp Thr Gly Ala Phe Cys Xaa Ser Pro Tyr Phe Thr His 

20 25 30 

Ser Tyr Val Phe lie Ser Trp Thr Gly Lys Met Ala Glu Ala Phe Val 

35 40 45 

Leu Ala His Glu Leu Gly His Ala Gly His Phe Thr Leu Ala Gin Lys 

50 55 60 

His Gin Pro Tyr Leu Glu Ser Glu Ala Ser Met Tyr Phe Cys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 

Met Asn Gin Asp Leu Tyr Gin Asp Thr Leu Cys Gly Gin Arg Val Gly 

1 5 10 15 

Asn Tyr Lys Asp Tyr Gly Met Tyr Trp He Lys Asn Glu Val Lys Pro 

20 25 30 

Asp Ala He He Glu Phe His I*eu Asp Ser Ala Ser Pro Gin Ala Ser 

35 40 45 

Gly Gly His Val He He Ser Asp Arg Phe Pro Ala Asp Asp He Asp 

50 55 60 

Lys Ala Leu Ser Ser Ala Leu Asp Lys Thr Val Gly Lys lie Arg Gly 
65 70 75 80 

Val Thr Pro Arg Gly Asp Leu Leu Asn Ala Asn Val Ser Ala Asp Leu 

85 90 95 

Asn Leu Asn Tyr Arg Leu He Glu Leu Gly Phe He Thr Ser Thr Lys 

100 105 HO 

Asp Leu Asn Tyr He Xaa Asn Asn Leu Asp Ser Phe Xaa Lys Arg He 
115 120 125 
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Ala Glu Ala lie He Gly Arg Gin lie Asp Ala Pro Ser Ser Asn Pro 

130 135 140 

Ser Ala Asp 
145 

(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 95: 

Met Trp Phe He Arg Pro Phe Tyr Ala Lys Glu Thr Thr Ala Asn Lys 

1 5 10 15 

Leu Arg Ser Ala Val Thr Pro Val Lys Gin Asp Lys Leu Ser Gin Gly 

20 25 30 

Lys Lys He Met Leu Val Ala Gly His Gly He Gly Ala Tyr Ser Thr 

35 40 45 

Asp Pro Gly Ala Val Ala Asn Gly Glu Asn Glu Arg Asp Phe Xaa Arg 

50 55 60 

Lys Asn He He Pro Arg Val 
65 70 

(2) INFORMATION FOR SEQ ID NO: 596; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 : 

Met Met Gin Gin Leu Ser Leu Lys His Arg Leu Asn Asn Gly Asp Ser 

1 5 10 15 

Val Tyr Gly He Phe Asn Ser He Pro Asp Pro Leu Met He Glu Val 
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20 



25 



30 



He Ala Ala Ser Gly Tyr Asp Phe Val Val He Asp Thr Glu His Val 

35 40 45 

Ala He Asn Asp Glu Thr Leu Ala His Leu He Arg Ala Ala Glu Ala 

50 55 60 

Ala His He He Pro He Val Arg Val Thr Ala Val He Asp Arg Asp 
65 70 75 80 

He He Lys Val Leu Asp Met Gly Ala Arg Gly He He Val Pro His 

85 90 95 

Val Lys Asp Arg Glu Thr Val Gly Ala Tyr Trp 



(2) INFORMATION FOR SEQ ID NO: 597: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 

Met Glu Thr Leu Lys Asn Lys Val Asp Gin Val Asn Thr Asp Lys Ala 

15 10 15 

Gin Pro Asn Tyr Thr Glu Ala Ser Thr Asp Lys Lys Glu Ala Val Gly, 

20 25 30 

Ser Ser Val Thr Ser Cys Thr Xaa His Tyr Arg Ser Asn 



(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 



100 



105 



35 



40 



45 
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Met Thr Lys Leu Lys Val Gly Val He Gly Val Gly Gly He Ala Gin 

1 5 10 15 

Asp Arg His He Pro Ala Leu Leu Lys Leu Lys Asp Thr Val Ser Leu 

20 25 30 

Val Ala Val Gin Asp He Asn Thr Val Gin Met He Asp Val Ala Lys 

35 40 45 

Arg Phe Asn He Pro Gin Ala Val Glu Thr Pro Ser Glu Leu Phe Lys 

50 55 60 

Leu Val Asp Ala Val Val He Cys Thr Pro Asn Lys Phe His Ala Asp 
65 70 75 80 

Leu Ser He Glu Ala Leu Asn His Gly Val His Val Leu Cys Glu Lys 

85 90 95 

Pro Met Xaa Met Thr Thr Glu Glu Cys Asp Arg Met He Glu Ala Ala 

100 105 no 

Asn Lys Asn His Lys Leu Leu Thr Val Ala Tyr His Tyr Arg His Thr 

115 120 125 

Asp Val Ala Met Thr Ala Lys Lys Ala He Glu Ala Gly Val Val Gly 

130 135 140 

Lys Pro Leu Val Ala Pro Leu Ser Xaa Arg Cys Val Gly Val Lys Xaa 
145 150 155 160 

Leu Gly Gly Ala Phe Leu Pro Asn Lys Thr Val Ala Arg Trp Arg 
165 170 175 

(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 

Met Val Pro His Glu Val Leu Gly Lys Thr Tyr Asn Gin Leu Ser Lys 

15 10 15 

Gin Pro Asn Gin He Asn Asp Trp Gly Thr Phe Asp His Thr Lys Phe 

20 25 30 

Asp Val Asp Asp His Val Thr Ser Tyr Met Thr Phe Ala Asn Arg Ala 

35 40 45 

Ser Met Gin Phe Glu Cys Ser Trp Ser Ala Asn He Lys Glu Asp Lys 
50 55 60 
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Val His Val Ser Leu Ser Gly Glu Asp Gly Gly lie Asn Leu Phe Pro 
65 70 75 80 

Phe Glu lie Tyr Glu Pro Arg Phe Gly Thr He Phe Glu Ser Lys Ala 

85 90 95 

Asn Val Glu His Asn Glu Asp He Ala Gly Glu Arg Gin Ala Arg Asn 

100 105 110 

Phe Val Asn Ala Cys Leu Gly He Glu Glu He Val Val Lys Pro Glu 

115 120 125 

Glu Ala Arg Asn Val Asn Ala Leu He Glu Ala He Tyr Arg Ser Asp 

130 135 140 

Leu Asp Asn Lys Ser He Gin Leu 
145 150 



(2) INFORMATION FOR SEQ ID NO: 600: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 



Met Xaa lie Glu Glu Lys Pro Asp Gly Phe Met Ala Phe Val His Ala 

15 10 15 

Leu Tyr Arg Ala Asp Asp He Val Asp Lys Asp Met Ser Lys Ala Leu 

20 25 30 

Asp Ala Leu Met Ser He Asp Phe 
35 40 



(2) INFORMATION FOR SEQ ID NO: 601: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 
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WO 97/30070 



PCT/US97/02318 



Met Leu Lys Phe Glu Arg Val lie Asp lie Phe Asn Lys Lys Asn Asn 

15 10 15 

Gly Ala Trp Asn Phe Cys Pro Gin Asn Ala Gly Tyr Trp Glu His He 

20 25 30 

Pro Lys Ser He Thr Lys Leu Ser Asp Leu Lys He Val Gly Leu Asp 

35 40 45 

Phe Tyr He Thr Thr Glu Glu Ser Lys Arg Phe Thr Asp Phe Pro Lys 

50 55 60 

Asp Phe Lys Gly He Ala Gly Trp He Leu Glu Val Lys Ser Asn Thr 
65 70 75 80 

Pro Gly Asn Thr Thr Gin Val Leu Arg Arg Asn Asn Phe Pro Ser Ala 

85 90 95 

His Gin Phe Xaa Val Arg Asn Phe Gly Thr Arg Leu Val Ala Leu Val 

100 105 110 

Asn Gly Ser Leu Phe Glu Gly Lys Val Val Glu 
115 120 

(2) INFORMATION FOR SEQ ID NO: 602: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 602: 

Met Val Met Glu Asn Leu Ala Ser Lys Leu Asn His Gin Ser Lys His 

15 10 15 

Arg Leu Asp Gin Leu Asp His Leu Asn Asn Ala Gin Lys Gin Gin Leu 

20 25 30 

Gin Ser Gin He Thr Gin Ser Ser Asp He Ala Ala Val Asn Gly His 

35 40 45 

Lys Gin Thr Ala Glu Ser Leu Asn Thr Ala Met Gly Asn Leu He Asn 

50 55 60 

Ala He Ala Asp His Gin Ala Val Glu Gin Arg Gly Asn Phe He Asn 
65 70 75 80 

Ala Asp Thr Asp Lys Gin Thr Ala Tyr Asn Thr Ala Val Asn Glu Ala 

85 90 95 

Ala Ala Met He Asn Lys Gin Thr Gly Gin Asn Ala Asn Gin Thr Glu 
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100 105 110 

Val Glu Gin Ala lie Thr Lys Val Gin Thr Thr Leu Gin Ala Leu Asn 

115 120 125 

Gly Asp His Asn Leu Gin Val Ala Lys Thr Asn Ala Thr Xaa Ala lie 

130 135 140 

Asp Asp Leu Thr Ser Leu Asn Asp Pro Gin Lys Thr Gly Leu Lys Asp 
145 150 155 160 

Gin Val Thr Gly Gly Thr Leu Xaa Thr Gly Gly Gin Gin lie Gly Thr 

165 170 175 

Lys Trp Xaa Phe Gly 
180 



(2) INFORMATION FOR SEQ ID NO: 603: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 



Met Ser Ala Thr lie His Thr Pro Gly Gly Gly Arg Ser Leu Asn Gly 

1 5 10 15 

Ser His Ala Ser Leu Val Val Ser lie Asn Asp Val Phe Tyr Val Thr 

20 25 30 

Asp Val Gly Phe Gly Asp Leu Pro Leu His Ala lie Pro lie Thr Ser 

35 40 45 

Ser Glu His Thr Gin Pro lie Thr Asp lie Ser Gly Thr Phe Arg Ala 

50 55 60 

lie Phe Asn Asn Glu Asp Lys Asp lie Phe Tyr Val Gin Lys Phe Glu 
65 70 75 80 

Asn Asp His Trp His Thr Lys Tyr Glu Ala Glu Phe Lys Pro Lys Gin 

85 90 95 

lie Glu Asp Phe Asn Ser Asn lie Glu Tyr Asn Gin Thr His Pro Asp 

100 105 110 

Ser He Phe Val Gin His Leu Leu He Thr Met Pro Gin Ser Phe Gly 

115 120 125 

Arg Ala Thr Met Ser Glu Asn His Leu Thr ^eu Thr Arg Asn Gly Ser 

130 135 140 

Ser Glu Lys Phe Asp Val Thr Lys Asp Asn Tyr Lys His Phe Leu Glu 

613 
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145 150 155 160 

Lys Tyr Phe Gly Leu Asn Val Thr lie Asn Arg He Glu Lys Gin 
165 170 175 

(2) INFORMATION FOR SEQ ID NO: 604: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 

Met Pro Lys Phe Leu Lys Ser Phe Phe Asn Glu His Pro Phe He Asp 

1 5 10 15 

Val Ser Leu His Val Lys Asn Glu Asn He Glu Lys Asp He Asn Asn 

20 25 30 

His Thr Tyr Asp He Gly He Ser Arg Asn Gin Pro Lys Leu Arg Glu 

35 40 45 

Val His Ser Glu Lys Val Cys Glu Gly Lys He Val Leu He Ala Pro 

50 55 60 

Asn Lys Glu Asn Asn His Leu Leu Thr Glu Ala Ser Leu Phe Glu Lys 
65 70 75 80 

Tyr Lys He He Ser Asp Asn His Pro Glu Tyr Trp Ser Ser Leu Lys 

85 90 95 

Asn Asn He Leu Asn He Tyr Gly Lys Ala Gin Phe Leu Ser He Asn 

100 105 HO 

Asp Val His Thr Ser He Lys Leu He Glu Met Asn Gin Gly He Ser 

115 120 125 

Phe Leu Pro He Tyr He Asn Asn Arg 
130 135 



(2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



614 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

Met lie Leu Lys Tyr Met Asp lie Thr Glu Lys Val Pro Met Ser Glu 

1 5 10 15 

Ser Glu Val Thr Gin Leu Leu Lys Gly Lys Gly He Leu Glu Asn Arg 

20 25 30 

Gly Lys Val Phe Leu Glu Ala Gin Glu Lys Tyr Glu Val Xaa Val He 

35 40 45 

Tyr Leu Val Ser His Ala Leu Val Glu Thr Gly Asn Gly Lys Ser Glu 

50 55 60 

Leu Ala Lys Gly He Lys Asp Gly Lys Lys Arg Tyr Tyr Asn Phe Phe 
65 70 75 80 

Gly He Gly Ala Phe Asp Ser Ser Ala Val Arg Ser Gly Lys Ser Tyr 

85 90 95 

Ala Glu Lys Glu Gin Trp Thr Ser Pro Asp Lys Ala He He Gly Gly 

100 105 HO 

Ala Lys Phe He Arg Asn Glu Tyr Phe Glu Asn Asn Gin Leu Asn Leu 

115 120 125 

Tyr Gin Met Arg Trp Asn Pro Glu Asn Pro Ala Gin His Gin Tyr Ala 

130 135 140 

Ser Asp He Arg Trp Ala Asp Lys He Ala Lys Leu Met Asp Lys Ser 
14 5 150 155 160 

Tyr Lys Gin Phe Gly He Lys Lys Asp Asp He Arg Gin Thr Tyr Tyr 
165 170 175 

Lys 



(2) INFORMATION FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDE0NESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 

Met Arg Glu Lys Phe Xaa His Tyr Cys Glu His Glu Val Leu Ser Ser 
1 5 io is 

615 



WO 97/30070 



PCT/US97/0Z318 



lie lie Asn Gly Ser Tyr He He Val Lys Thr Ser Pro Gly Phe Ala 

20 25 30 

Gin Gly He Asn Tyr Phe He Asp Gin Leu Asn He Glu Glu He Leu 

35 40 45 

Gly Thr Val Ser Gly Asn Asp Thr Thr Leu He Leu Thr Ala Ser Asn 

50 55 60 

Asp Met Ala Glu Tyr Val Tyr Ala Lys Leu Phe Lys 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 

Met Lys Gly Thr Asp Ser Asp Ala Asn Leu Asp Ala Val Ala He Ala 

15 10 15 

Lys Lys Ala Tyr Ala He Tyr Lys Thr Ala He Val He Thr Gly Lys 

20 25 30 

Glu Asp Val He Val Gin Asp Asn Lys Ala He Val Leu Ala Asn Gly 

35 40 45 

Ser Pro Leu Leu Ala Arg Val Thr Gly Ala Gly Cys Leu Leu Gly Gly 

50 55 60 

Val He Ala Gly Phe Leu Phe Arg Glu Thr Glu Pro Asp He Glu Ala 
65 70 75 80 

Leu He Glu Ala Val Ser Val Phe Asn He Ala Ala Glu Val Ala Ala 

85 90 95 

Glu Asn Glu Asn Cys Gly Gly Pro Gly Asn Val Phe Thr He Val Ala 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 608 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 
<B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 

Met Glu Thr Lys Thr Leu Gin Met Pro Lys Asp Lys Ala Asn Ala Phe 

15 10 15 

Val Asn Ser Leu Asn Gly Leu Asn Gin Gin Gin Gin Asp Leu Ala His 

20 25 30 

Lys Ala lie Asn Asn Ala Asp Thr Val Ser Asp Val Thr Asp He Val 

35 40 45 

Asn Asn Gin He Asp Leu Asn Asp Ala Met Glu Thr Leu Lys His Leu 

50 55 60 

Val Asp Asn Glu He Pro Asn Ala Glu Gin Thr Val Asn Tyr Gin Asn 
65 70 75 80 

Ala Asp Asp Asn Ala Lys Thr Asn Phe Asp Asp Ala Lys Arg Leu Ala 

85 90 95 

Asn Thr Leu Leu Asn Ser Asp Asn Thr Asn Val Asn Asp He Asn Gly 

100 105 110 

Ala He Gin Ala Val Asn Asp Ala lie His Asn Leu Asn Gly Asp Gin 

115 120 125 

Arg Leu Gin Asp Ala Lys Asp Lys Ala He Gin Ser lie Asn Gin Ala 

130 135 140 

Leu Ala Asn Lys Leu Lys Glu He Glu Ala Ser Asn Ala Thr Asp Gin 
145 150 155 160 

Asp Lys Leu lie Ala Glu Lys He Lys Gin Glu Glu Leu Ala Asn Gin 

165 170 175 

His His Gin Gin His 
180 



(2) INFORMATION FOR SEQ ID NO: 609: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609 : 



Met Thr Gin Ala Arg Pro Gly Asn His Tyr Met Leu 

617 
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1 5 10 is 

Pro Asn Gly Gin Phe He Asp Arg Leu Leu Val Lys Asn Gly Gly His 

20 25 30 

Gly Thr His Asn Ala Tyr Arg Tyr He Val Gin 
35 40 

(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 : 

Met Thr Glu Pro Asn Gly Gin Ser He Gin Val Asn Leu Asn Asn Ala 

15 10 15 

Gin Gly Asp Leu Gly Tyr Leu Thr Ala Gly Asn Tyr Tyr Ala Thr Arg 

20 25 30 

Val Pro Asp Leu Pro Gly Ser Val Glu Ser Tyr Glu Gly Tyr Leu Ser 

35 40 45 

Val Phe Val Lys Asp Asp Thr Asn Lys Leu Phe Asn Phe Thr Pro Tyr 

50 55 60 

Asn Ser Lys Lys He Tyr Thr Arg Ser He Thr Asn Gly Arg Leu Glu 
65 70 75 80 

Gin Gin Trp Thr Val Pro Asn Glu His Lys Ser Thr Val Leu Phe Asp 

85 90 95 

Gly Gly Ala Asn Gly Val Gly Thr Thr He Asn Leu Thr Glu Pro Tyr 

100 105 no 

Thr Asn Tyr Ser He Leu Leu Val Ser Gly Thr Tyr Pro Gly Gly Val 

115 120 125 

He Glu Gly Phe Gly Leu Thr Ala Leu Pro Asn Ala He Gin Leu Thr 

130 135 140 

Lys Pro Asn Val Val Asp Ser Asp Gly Xaa Gly Gly Gly He Tyr Glu 
145 150 155 160 

Cys Leu Leu Ser Lys Thr Ser Ser Thr His Phe Lys Asn Arg Gin Arg 

165 170 175 

Cys Val Phe Arg Phe Arg Gin Asn He Arg Phe Trp Asn Glu Cys Gin 

180 185 190 

Gin Ser Tyr Tyr Asn 

618 
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195 



(2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 

Met Met Lys Met lie Tyr Pro Thr Phe Lys Asp lie Lys Thr Phe Tyr 

15 10 15 

Val Trp Gly Tyr Tyr Lys Asn Glu Gin He Lys Trp Tyr Val Asp Lys 

20 25 30 

Gly Leu He Asp Lys Glu Glu Tyr- Ala Leu He Thr Gly Glu Lys Tyr 

35 40 45 

Pro Glu Thr Lys Asp Glu Lys Ser Gin Val 
50 55 



(2) INFORMATION FOR SEQ ID NO: 612: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 

Met Phe Gly Phe Thr Lys Arg His Glu Gin Asp Trp Arg Leu Thr Arg Leu Glu 
Glu Asn Asp Lys Thr Met Phe Glu Lys Phe Asp Arg He Glu Asp Ser Leu Arg 
Ala Gin Glu Lys He Tyr Asp Lys Leu Asp Arg Asn Phe Gly Arg He Lys Ala 
Arg Gin Gly Arg Arg 

(2) INFORMATION FOR SEQ ID NO: 613: 



(i) SEQUENCE CHARACTERISTICS: 

619 
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(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

Met Gly Leu Pro Asn Pro Lys Thr Arg Lys Pro Thr Ala Ser Glu Val 

15 10 15 

Val Glu Trp Ala Lys Ser Asn lie Gly Lys Arg lie Asn He Asp Asn 

20 25 30 

Tyr Arg Gly Ser Gin Cys Trp Asp Thr Pro Asn Phe He Phe Lys Arg 

35 40 45 

Tyr Trp Gly Phe Val Thr Trp Gly Asn Ala Lys Asp Met Ala Asn Tyr 

50 55 60 

Arg Tyr Pro Lys Gly Phe Arg Phe Tyr Arg Tyr Ser Ser Gly Phe Val 
65 70 75 80 

Pro Glu Pro Gly Asp He Ala Val Trp His Pro Gly Asn Gly He Gly 

85 90 95 

Ser Asp Gly His Thr Ala He Val Val Gly Pro Ser Asn Lys Ser Tyr 

100 105 HO 

Phe Tyr Ser Val Asp Gin Asn Trp Val Asn Ser Asn Ser Trp Thr Gly 

115 120 125 

Ser Pro Gly Arg Leu Val Arg His Pro Tyr Val Ser Val Thr Gly Phe 

130 135 140 

Val Arg Pro Pro Tyr Ser Lys Asp Thr Ser Lys Pro Ser Ser Thr Asp 
145 150 155 160 

Thr Ser Ser Ala Ser Lys Ser Gin 
165 

(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 

620 
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Met Glu Thr Leu Glu Leu Gin Gly Ala Lys Leu Arg Tyr His Gin Val 

15 10 15 

Gly Gin Gly Pro Val Leu lie Phe lie Pro Gly Ala Asn Gly Thr Gly 

20 25 30 

Asn lie Phe Leu Pro Leu Ala Glu Gin Leu Lys Asp His Phe Thr Val 

35 40 45 

Val Ala Val Asp Arg Arg Asp Tyr Gly Glu Ser Glu Leu Thr Glu Pro 

50 55 60 

Leu Pro Asp Ser Ala Ser Asn Pro Asp Ser Asp Tyr Arg Val Lys Arg 
65 70 75 80 

Asp Ala Gin Asp lie Ala Glu Leu Ala Lys Ser Leu Ser Asp Glu Pro 

85 90 95 

Val Tyr lie Leu Gly Ser Ser Ser Gly Ser lie Val Ala Met His Val 

100 105 110 

Leu Lys Asp Tyr Pro Glu Val Val Lys Lys lie Ala Phe His Glu Pro 

115 120 125 

Pro lie Asn Thr Phe Leu Pro Asp Ser Thr Tyr Trp Lys Asp Lys Asn 

130 135 140 

Asp Asp lie Val His Gin lie Leu Thr Glu Gly Leu Glu Lys Gly Met 
145 150 155 160 

Lys Thr Phe Gly Glu Thr Leu Asn lie Ala Pro He Asp Ala Lys Met 

165 170 175 

Met Ser Gin Pro Ala Asp Thr Glu Glu Gly Arg He Glu Gin Tyr Lys 

180 185 190 

Arg Thr Met Phe Trp Ser Glu Phe Glu He Arg Gin Tyr Thr His Ser 

195 200 205 

Asp He Thr Leu Asp Asp Phe Thr Lys Tyr Ser Asp Lys He Thr Leu 

210 215 220 

Leu Asn Gly Thr Asp Ser Arg Gly Ser Phe Pro Gin Asp Val Asn Phe 
225 230 235 240 

Tyr He Asn Lys Glu Thr Gly He Pro He Val Asp He Pro Gly Gly 

245 250 255 

His Leu Gly Tyr He Gin Lys Pro Glu Gly Phe Ala Asp Val Leu Leu 

260 265 270 

He Cys Gly Cys 
275 



(2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

621 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 



Met Gly Arg Ala Gin Asp Gly Phe Leu Asp Gin Asp Lys Tyr Asp He 

1 5 10 is 

He Phe Lys Thr Ala Glu Asn Leu Gly Thr He Pro He Tyr Leu His 

20 25 30 

Pro Ala Pro Val Asn Ser Asp He Tyr Gin Ser Tyr Tyr Lys Gly Asn 

35 40 45 

Tyr Pro Glu Val Thr Ala Ala Thr Phe Ala Cys Phe Gly Tyr Gly Trp 

50 55 60 

His He Asp Val Gly He His Ala He His Leu Val Leu Ser Gly He 
65 70 75 80 

Phe Asp Arg Tyr Pro Lys Leu Asn Met He lie Gly His Trp Gly Glu 

85 90 95 

Phe He Pro Phe Phe Leu Glu Arg Met Asp Glu Ala Leu Phe Ala Glu 

100 105 no 

His Leu Asn His Ser Val Ser Tyr Tyr Phe Lys Asn Ser Phe Tyr He 

115 120 125 

Thr Pro Ser Gly Met Leu Thr Lys Pro Gin Phe Asp Leu Val Lys Lys 

130 135 140 

Glu Val Gly He Asp Arg He Leu Tyr Ala Ala Asp Tyr Pro Tyr He 
145 150 155 160 

Glu Pro Glu Lys Leu Gly Val Phe Leu Asp Glu Leu Gly Leu Thr Asp 

165 170 175 

Glu Glu Lys Glu Lys He Ser Tyr Thr Asn Gly Ala Lys Leu Leu Gly 

180 185 190 

Leu Ser Ser Asn Asn 
195 



(2) INFORMATION FOR SEQ ID NO: 616: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 



Met Thr Glu Gin Ser Val Lys Asn 

1 5 
Lys Pro Lys Gin Lys Thr Val Lys 
20 

Ser Asp Ser Lys Pro Tyr Glu Glu 

35 40 
Asn Leu Asn Val lie Ala Cys Leu 

50 55 
Asn lie Lys Gly Glu Lys Thr Thr 
65 70 
Leu Arg Arg Ser Met Trp Ser Met 
85 

Gin Arg Gin Ser Trp Ser Leu Arg 
100 



Thr Pro Gin His Glu Thr Gin Ser 

10 15 
Leu lie He Lys Arg Gin Asp Thr 
25 30 
Thr Phe Glu He Pro Tyr Arg Glu 
45 

Met Glu He Arg Arg Asn Pro Val 
60 

Pro Val Val Trp Asp He Glu lieu 

75 80 
Phe Tyr Gly Tyr Gin Trp Ala Gly 

90 95 
Leu Val Ser Pro Asn 
105 



(2) INFORMATION FOR SEQ ID NO: 617: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 



Met Pro Glu Ala His Lys Lys Val Asn Ser Asn Ser Phe He Glu Xaa 

15 10 15 

Asp Asp Glu Arg Leu Val Arg Asp Phe Leu His Asp Ala Phe Phe Phe 

20 25 30 

He Asn He Ala Glu He He Leu Phe He Glu Lys Gin 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 618: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

623 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 

Met Pro Xaa Xaa lie Tyr He Asn Met Leu He Asp Glu Glu Asp Ala 

15 10 15 

He Pro Phe Asn Ala Leu Tyr Ala Ser Asp Lys Asp Gly Leu Pro Phe 

20 25 30 

He Glu Lys Trp lie Lys Gin Tyr Gly Ser Gly Ser Leu Asp Gin Ser 
35 40 45 

Asn Phe 
50 

(2) INFORMATION FOR SEQ ID NQ:619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 



Met Ser Val Val Leu Pro His Gly Val Leu Phe Arg Gly Ala Ala Xaa 

1 5 10 is 

Gly Val He Arg Arg Tyr Leu He Glu Xaa Lys Ser Xaa Leu Glu Xaa 

20 25 30 

Val Xaa Gly Leu Pro Xaa Asn He Phe Tyr Gly Xaa Ser He Pro Thr 

35 40 45 

Cys He Leu Val Phe Lys Lys Cys Arg Gin Gin Asp Asp Asn Val Leu 

50 55 60 

Phe He Asp Ala Ser Asn Asp Phe Xaa Lys Gly Lys Asn Gin Asn His 
65 70 75 80 

Leu Ser Asp Ala Gin Val Glu Arg He He Asp Thr Tyr Lys Arg Lys 

85 90 95 

Ala Thr He Asp Lys Tyr Ser Tyr Ser Ala Thr Leu Gin Glu He Ala 

100 105 no 

Asp Asn Asp Tyr Asn Leu Lys Ala Tyr Arg Ala He Cys 
115 120 125 

624 
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(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 

Met Ser Asn Thr Gin Thr Lys Asn Val Pro Glu Leu Arg Phe Pro Gly 

15 10 15 

Phe Glu Gly Glu Trp Glu Glu Lys Lys Leu Gly Asn Leu Thr Thr Lys 

20 25 30 

lie Gly Ser Gly Lys Thr Pro Lys Gly Gly Ser Glu Asn Tyr Thr Asn 

35 40 45 

Lys Gly lie Pro Phe Leu Arg Ser Gin Asn lie Arg Asn Gly Lys Leu 

50 55 60 

Asn Leu Asn Asp Leu Val Tyr lie Ser Lys Asp lie Asp Trp Met Arg 
65 70 75 80 

Leu Lys Asn Ser Arg Thr Tyr Tyr Gly Asp Val Pro Phe Xaa Tyr Tyr 

85 90 95 

Arg Ser lie Asn Xaa 
100 

(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621: 

Met Arg Met Lys Arg Phe Leu Thr lie Val Gin lie Leu Leu Val Val 

15 10 15 

lie He He He Phe Gly Tyr Lys He Val Gin Thr Tyr He Glu Asp 
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20 25 30 

Lys Gin Glu Arg Ala Xaa Tyr Glu Lys Leu Gin Xaa Lys Phe Xaa Met 

35 40 45 

Leu Met Xaa Lys His Gin Glu His Val Arg Pro Gin Phe Glu Ser Leu 

50 55 60 

Glu Lys lie Asn Lys Asp lie Val Gly Trp He Lys Leu Ser Gly Thr 
65 70 75 80 

Ser Leu Xaa Tyr Pro Val Leu Gin Gly Lys Thr Asn His Asp Tyr Leu 

85 90 95 

Asn Leu Asp Phe Glu Arg Glu His Arg Arg Lys Gly Ser He Phe Met 

100 105 no 

Asp Phe Arg Asn Glu Leu Xaa Asn Leu Asn His Asn Thr He Leu Tyr 

115 120 125 

Gly His His Val Gly Asp Asn Thr Met Phe Asp Val Leu Glu Asp Tyr 

130 135 140 

Leu Lys Gin Ser Phe Tyr Glu Lys His Lys He He Gly Phe Asp Asn 
14 5 150 155 160 

Lys Tyr Gly Lys Tyr Gin Leu Gin Val Phe Ser Ala Tyr 
165 170 

(2) INFORMATION FOR SEQ ID NO: 622: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 : 

Met Asp Gly Met Asp Gin Leu Met Lys He Val Lys Lys Asp Phe Thr 

1 5 10 15 

Gly He Ser Phe Leu Asn Leu Val Asp Phe Asp Ala Leu Tyr Gly His 

20 25 30 

Arg Arg Glu Lys Pro Gly Tyr Ala Gin Ala He Lys Asp Phe Asp Asp 

35 40 45 

Xaa Leu Ala Arg Ser Leu Phe Ser Asn Leu Xaa Gly Arg Arg Phe Ser 

50 55 60 

Asn Tyr Tyr Lys Gin Thr Met Val Met Thr Pro Thr Ala Pro Gly Xaa 
65 70 75 80 

Gly Thr Asn Thr Arg Glu Tyr He Pro Val He He Val Lys Ser Arg 
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85 90 95 

Xaa Leu Lys Val Xaa Gin Cys Thr 
100 

(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

Met Asp Val Phe Ser Trp Ser Asn Gly Tyr Glu Lys Arg Tyr Gly Leu 

15 10 15 

Phe Tyr Val Asp Phe Glu Thr Gin Glu Arg Tyr Pro Lys Lys Ser Ala 

20 25 30 

Tyr Trp Tyr Lys Glu Leu Ala Glu Thr Lys Glu He Lys 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 624: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 



TTCGAGCAAT ATGGATTAAA TGTTGAGAAC CCAAAAAACT TTTAAGCAAG AGGAGCGTAT 60 

AAAATGCAAT TTTCATTACT AATATATATA GTCGTAATTT GTGCGGTTAT GTATTTCTTG 120 

ATGATCAGAC CACAACAAAA ACGTGCGAAA CAGCATCGTG AGTTGATTAA TAACATTCAA 180 

TCTGGTCAAA GAATTACAAC TATTGGTGGT ATTAAAGGTA CTGTTAAAGC AGTAGATGAA 240 

ACAACTGTTG TTATTACAGT TAATGGTCAT GGTACTGAAT TAACTTTCGA AAAACCTGCT 300 

ATTAAACAAG TTGGACCCNT CATAACGAGA AGTGGTATTA AAAAGGATGA AACGAATAGC 360 

GTNTCATCCC TCCTTAAGTT ATNAGTAGNA TTGTAGGAAT NGCTGATTTN NATNCCAAAA 420 

AAMGAAATTG NGANACATAA TANNNGNNAA NTTTAAOTCA AATNTATCTA TGACTAGTCT 480 
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TNNNACTCCC AGCTCTAGNA TGTTATTNTG TATTTGGTTA ATAATNTCTC GNCAGNNNNG 540 
ATTTANCTCG CGNTATAAAT TAGNATACAA GTNTTCTAAT ANCATACGAC AA 592 



<2) INFORMATION FOR SEQ ID NO: 625: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 



CGGGCAGNCT 


TTANAATTAT 


TTNAATGATG 


ATTAGCAATG 


CATATACGAT 


TAATTCGATT 


60 


ATGGNTTATA 


TCTNNATTGA 


TTAANTCACT 


TATATTTCAC 


AAGTGTGTTG 


CTTTGGAGCG 


120 


TCAACGATGC 


TATCACATAT 


TACGGCATAT 


GAAATTTTAT 


CTGAAATTCG 


NAAAAAATTA 


180 


GCTCAAAAAT 


TAATGCGCCT 


CCCTTTAGGN 


GTAGTGGAAT 


CTAAGAAAAT 


AGGTGAATTA 


240 


AAAAATATAT 


TTGNCGATAA 


GGTTGAAACA 


ATAGAATTAC 


CTTTAGCACA 


TATGATTCCT 


300 


GAAGTTATNG 


GAAACTTACT 


TGTGGCAGCT 


GCTATCTTCT 


TATACATAAT 


GNTCATNGAT 


360 


TGGCGTATTG 


GATAGGTGNC 


TTANTAGATA 


ACGATACCAA 


TNTCAATTTN 


CGCTTCTAAA 


420 


AAAGTNATGT 


CTTGGATTTT 


AATGAGANAA 


TTTGCTGGAC 


AAANTGAAAT 


CCAATTTNTN 


480 


TTTTTNATTT 


GNCGNAATGG 










500 



(2) INFORMATION FOR SEQ ID NO: 62 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626 : 



GAATTCGCGA CATAAGGAAG TCTTCATAAT CCATCGTTAA TCCAAATAAG ATACCTATGG 60 

TAATATACCG GGAAANANGN TAGNATTGGT CCGGNCGTTG CANTACCAAT CAGACCTTNC 120 

ATAAAACCAT CTTGCATTAC TAATGTTGTA AATCCTAGCG TTGCCATTAN TGANAAGACA 180 

NATCCTAAAA CTGCTNTTAN CGGTATNAGA ATTGNTCGGC AGACAATCAN TATTANGAAT 240 

CATGCCATTN CANCACNTCC TGTCGTATAT AATGGTGTCG TCTCAATTTN CTTTTNAGAC 300 

ATATCANTAT TATTTGNCAC TTNGGCCCCG ANNTCTCCNC CNTNGGAGCC CATATNGATC 360 
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TTGGTGCATC TNGATGGANT ATCCTCGCTN ATCCATGCAA CTNAANTCAT TTGGTGCTCT 420 
CTTNCATNTA GGCNCCTGGT TA 442 

(2) INFORMATION FOR SEQ ID NO: 627: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 



ACCCTGTTGT 


CCCCNGGGGT 


CAATAGACCC 


CGACATACTT 


TNNGGCTTCT 


GAAAATAAAA 


60 


TTTCAGGNNC 


GCACTTTAAA 


GATGGACGTC 


GATGTCGACT 


AATGTGATGA 


CNGCCTTTCA 


120 


TCTTTTTAAA 


AANTCATTAA 


TTTCCCTTCT 


TGTTTAANAC 


GTACATANAA 


GAAATACGCA 


180 


TACGGTACTA 


ATAAAATAGT 


TGTATATGTT 


GCGTGTGTTA 


ATAATAATAC 


ACCGATTAAT 


240 


TCAGGAATGA 


TGTTTAAGAA 


GTAATTTGGG 


TGTTTTGGTA 


ATTTTATATA 


ATCCAGATTT 


300 


AATAATAGGA 


TGGTTAGGTA 


AAATGAATAA 


TTTTAATGTC 


CAAATACCAC 


CTAAAGTTTT 


360 


AATAACCATA 


AATAACATGA 


TATAAGCAAA 


GATTAATATA ACTAAGCCAA 


TACCATTTGC 


420 


AAAGCTAAAT 


GTATCTTTAT 


TAATAAATGC 


CTCTACACCA 


GCCAATACAT 


AAATTAAAAC 


480 


GTGTGTTATT 


GCTAAAAACT 


TCGAATTTTT 


AACGCCATAT 


TCAACTGCAC 


CGTCTGCTTT 


540 


TAATTGTTTT 


GAGTGANTAA 


TACATATCTT 


TAAGCTGACA 


AGTCTGATAC 


AGAGAAAGAT 


600 


AAGTAATATA 


GATAGANTCA 


TGATGTCCTC 


CGTCATTATG 


TCATATGTAT 


AAGCGTTGAT 


660 


TTTGACAACA 


TAAAGTATTT 


TATAGATAAA 


ACTTGTCACA 


TACTATTAAC 


TATTTATTAA 


720 


TTTTAGTACA 


TAAATATGTT 


TCTAAGTATG 


TGTTTATGTT 


CAGTATTTTG 


GATAATTTAA 


780 


TAATTTTAAG 


GATATTAAGC 


GCTTACAG 








808 



(2) INFORMATION FOR SEQ ID NO: 628: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 



GAATTCGTGT TCAAGGAGGT AAAAAATAAT GGGTGAAAAT AAAAACAATT TGTCGATTAA 

629 



60 



WO 97/30070 



PCT/US97/02318 



CGACGATCAT TCTAATGCAG CTATGACGCA TACCTCTGAC GCTATCGCAT CATCTGATTT 120 

TATTATTAGA GAATTAGACT TGAATCAGGA ACCTGAAATG CAACGAGAAA GCACANNCTC 180 

NCGGCAAGAT GCTTGGNCTC AGTTAAAACG AAANAAGTTA GCTGTNGTCG GTATGATAGG 240 

TGGTNGTTNT CATTGNCCNA TTTGNTTTTA TCGGTCCAGC CNATAAATTA AACATGATTA 300 

TGCCTGAACA AAATGTTGAA CATAGAAATC TTCCNCCAAN AAATACCTGT ATTAGACAAA 360 

GTTCCCACCC CTNGCCTCCG GTGGGNANNG TTNNAGATTG GNAAAGATTC TTATTAAAGC 420 

AGGNAANNNN TAGNGNGAAT ATTGGTNGGG GGTGGNCACG GGGTCCGTGT TTTTGGTAAN 480 

AAAAC 485 

(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 

CTACAATGAT AAATCTGATT GATCTTTGAC AAAAATAACA ACTACACTTA ATACAATACC 60 

AGACGCAACA ATATTACTGA GGCTAGGAAT TTTAAACTTT TCAGNTCCAG CATAAAACCA 120 

TGAAATATCG AGTGCTGCAC CTATAATATA GATTCCTTGT AGTAGGAAAA TAAGATAGTA 180 

ATCATCAATA AATATAGTAA TTACGACCAT ATACACCGCA AAAACTGTTA ACGCTAAAAA 240 

TAATTTACTG ACAAAGATAT CCCAAAACTG CTGAGACAAT TGCCGTTTGT CGTTAACAGA 300 

CTTCGCGATA ACTCTATTAA AATATAACTG AACGCCAACA CTTGCAATCA TCAAAAAGTA 360 

TTGCACGATA TTGAAAGAAA ATGAAACAAT ACCCACACCA CTGGGACCAA ATGCACGTGA 420 

AATAATGGGT ATTGTAATCA GTGGTGTCAT CGTTCTAATC AATTGGTATA GCCCTTGATA 480 

AATAATATTT TCTTTTAGTG AATCACTCTT CATTACGATC CCCNTTNCCA TCTTGGCATA 540 

TTTGAACAAT AAAGACGANN CTTCCTAANA ATNAAAGGNG AAATGTCTTG AGTTATGACT 600 

GATACTGGTA GCTTGTAAGT TAGCATCGGN CAACATNNAA CCANTTTGCN GGGANATTTT 660 

TCCCAGTAAT GT 672 



(2) INFORMATION FOR SEQ ID NO: 630: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 



Al\ii-U lux 1 1 


InulnAvrl 1 1 


X * ULUnU 


ACANAATCGG 


ANAAATTTTT 


TCCCAATAAT 


60 


AAUNLLL XAb 




Annl\.vLAl X 


TTTTATfT AT 


ATAAGATNCA 


TCGCATTTAA 


120 


TV nw^m » rpp » TV**" 

A 1\j 1 AITjA IVj 


Ai lAlwinAA 


ATTC2TTTA AT 


r^AAAGCCAT 


CAAA(f3TTGTG 


ATACAGCGCA 


180 


L 1 1 AA lo 1 1 


inlAAAJi Inn 


ivin inr Ar*TA 


TT A r*TT A A AT 
x x nL x x nnn a 


V, X VW A X Vr XXX 


ATTATTAAGC 


240 


Aui 1 lAluAl 


TfTVH AO ATT 
1 X lunALAl 1 


X nu/vinnnl r\ 


AfV*TTTTf*TT 


GTTCAATATC 


CAATTCATTA 


300 


Al l At, 1 l\j 1 1 


1 1AAA1 1111 


A A A AfWA AA 


ATATTTTTf^A 
AX AX X X X XX* A 


ATAGTAAPTC 

XIX XVV7 x nnl* X >— 


A\ X Vmp^^X^ X X X V» 


360 


CCCATv-AAAT 




TA A ATTTY^T* A 
xAAAx X luLA 


TT ATOTATT A 
X I rV J. X/l 


TTAAfYlAATr' 


("2 A ATTTA AfyP 
Unn XXX nnU X 


420 


CGCTTTATTT 


GTGATACTALy 


TGATTTAAAA 


A OTT* A A f^flTT 
AL iuAAbul 1 


wnw|V*l A A^P A A 
X 1 IVAAAlAA 


A C* A (Tf2T A A 1Y1 


now 


TTCAATCCTA 


ATTTCACATT 


r* A A r* A Tfl A TY2 


x IniuWlAl 1 


TTAATATPAT 
X InnlnlVnl 


TPfiATAAATT 




GTTCTAATTT 


CCATCAAGGA 


TAAATCAaAA 


ACAAAATGCA 


CTTCGTAATT 


ATTATGATTA 


600 


CTTAATTCGT 


TTGGAGATAT 


TTGTCTGAAA 


ACACTCATAA 


AAGCTTCTTC 


AACAAAATAA 


660 


GTGCTATTGA 


TGTAAAGCCG 


TACTATCAAC 


TTTGTAAAGC 


TAAAGGCTTA 


TTGTCAATTA 


720 


TCGACAATAC 


TTTTTTAACA 


CCTTATCTTT 


CAACACCATT 


AGCAGAAGGT 


GCTGATATAG 


780 


TCTTACATTC 


AAGCTACGAA 


ATATATTGGG 


TGGGACAATA 


ACGATGGTAC 


TTAGCAGGGT 


840 


N6TCGGNAAC 


CGTCAAAAGA 


TGGAATCACT 


CGGCGCAACA 


AGNNGGGTTG 


GATTTNNCAC 


900 


AACAATGACT 


TGGGGCAACA 


CTTCCAACCA 


ATAGATTGGN 


TATTTGGGTG 


GGTACNGTNG 


960 


GACTTTAAAA 


CTTT 










974 



(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 : 



CAGAAAGTTT 


TTAACGAAGC 


TTTTCATGTG 


AATAATTACG 


ATATAAATGT 


AACTACACCA 


60 


GTTATTTCTG 


CACATACTGG 


TCAAGGTGCG 


ATTGGCCTCG 


TAGTCCTTAA 


GAAGTAAATT 


120 


TAATCTTTTC 


AGTGTTAATT 


NCTTCCATTT 


CANTCCTTTA 


TAGACTAAAT 


TTATAATTAG 


180 


NTAGATAGAG 


GAGGTAATTC 


ATATGACAAA 


AGAATATGCA 


ACATTAGCAG 


GAGGATGTNT 


240 


CTGGTGCATG 


GTTAAACCAT 


TTACATCATA 


TCCAGGCATC 


AAGTCAGTCG 


TATCTGGTTA 


300 


TAGTGGCGGN 


CATGTTGATA ACCCTACTTA 


TGAACAGGTA TGTACGAATA AAACCGGCCA 


360 


TGTCGAAGCA 


GNACAAATTA 


CGTGTGATCC 


AGAGGTTACT 


TCCTNTGAAA 


ATATATTAGA 


420 


CATATATTTC 


AAAACATGTG 


NACCCANCTG 


ATGATCAAGG 


GCAATTTNNC 


GATAGAGGCG 


480 


AAAGCTATCA 


NCCAGTCATT 


TTCTNTCATG 


ATGAGCATCA 


GAAAAAGGCT 


GCTGAGTCTA 


540 


ANAAGCAACA 


ATTAANTGAA 


CAAGGTATTT 


CAAAGAACCC 


AGGGATTACA 


CCTATNACCC 


600 
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ATATTAAAAA TTCTATCCAG GT _ 622 

(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 



ATTAACGTGC 


ACNCTATCGT 


TTTTATTATG 


AANGTTGGTA 


ACGCTCATGA 


GTTTCTCATC 


60 


TACTATNTAA 


TATTAATATT 


GATTAATCTA 


TGGAAAATGT 


TATCAATATC 


TNCGACAGGA 


120 


CAGNCAATTT 


TNACATCTTT 


TATCCAATGT 


ATAAAACCTA 


CACTCAACGA 


GTCATAAATA 


180 


TAAAATATGA 


GATTATCTGG 


GATGTCGCTG 


ACAATATCGA 


TACGGTCTCT 


ATTATTTGCG 


240 


ATATCGGTAT 


GCATAATTTT 


AATGGAGACA 


TTTNNTAATA 


CTTTATTAAA 


TACAGCGTCT 


300 


TCTTCTTGGA 


ATTCTGCGAT 


TTCTCTCAAA 


TCCTCTTTAT 


TATTAATCGT 


ATCACTCATC 


360 


GTGTGGAATG 


GATGATTAAG 


ACGGNCACTG 


ATATCTCTAG 


CAAAGTAGTC 


TTTAGTCAAT 


420 


TGATTGAACA 


AATACTCTAG 


AAGATCATAT 


TTATCATAAA 


AATGTTTATA 


AAATGTTGTG 


480 


CGGTGTACGA 


GTGCATTGTC 


GCAAATNTGA 


TTGACAGTAA 


TCGTNTGGAA 


TAATTGCTGN 


540 


TCTAACAATT 


GTAGCAAGCT 


ACTCGACAAT 


GCTTTTTGTG 


TCTTTATAAC 


TCGNAAATCA 


600 


GTCTCTNTCA 


TCTACATNTC 


TCCTNTGTGT 


GTAGGGNTTA 


ATATGACAAA 


AATTNNCCAA 


660 


TTGGATTCTT 


TTAAGAGGNN 


CGGNGGATTA 


GATACTTATT 


ANTCGAACAN 


ATAATCTTAC 


720 


ACTNGTATTT 


TTTAAACTTA 


TAACGCNGG 








749 



(2) INFORMATION FOR SEQ ID NO: 633 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 

CTGGTAATTT ATTTGGATGA AATATAATTC TTTAAACTCT ATTGATTAAT CAAATATGTA 60 
TCTATCAAAA TTCGTCTTAT TTATTCACTC TGACATATCC AAATACGCAA AAAGACTATT 120 
TCTACCTTGC ATCGATAGAA ATAGTCTTTT TATACTTAAT TGGTTTAATT TCACATAAAA 180 
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X X X X Xm- X (lv X X 


XXX X XaX^X^X^XV X 


GGGATGAAAT 


ATATTTTCCT GAAACATGTA ANTCATGTTC^ 


240 


GNTACAAATT 


GATAAAATAT 

xj*a ^ rifwi x **- x 


GATTAAGTGT 


TGGGTGCAAA 


TGACATTTTC 


TCAAAAAGCG 


300 


C TTC ACT ATC 


AATTGGGTAC 


TTGTGTGTGA 


ATTTCAGCAA 


AGTCGCTGCG 


ATAAATATTN 


360 


GNNPAACTTA 

umi\>/mw x xx» 


AATTGGGTAA 


NAATCTTTAT 


TACGGNGCGC 


CAGNTGGATA 


ATGCATCAAN 


420 


f2GTTTf , A AfY* 


Af^ ATTTTr* A A 


v_\j\_ x x luxiin 


TNGCTGAATT 


AACTTAATAG 


CCCGNTTTCT 


*mO\J 


CAfTPAATTT 

V*X\l«V.l~X&XlX 11. 


v_ a x x xvrv\_xw_ V— - 




TCTGCTGTAT 


CACCCATAAA 


CGCTTTAATA 




TCGATTAGTT 

X \~yJ<Z* X X XE\J A X 


GTTGTGGTTC 

VJ X X V7 X X X ^> 


AAGGGCATAT 


TCTTCGTTAA 


AACGATGTAA 


TGTATATCTA 


SDH 


TTATAAATGT 
a xxixxwix wx 


TA A A APfTTT 

X XWTlXVwX., XXX 


TTTAATTAGC 


CAGACTTCAA 


CATTGTCATT 


AATACATTGC 


DOU 


AfiTA A ATf*TT 


X VJ X X 


A AT A AT AT AG 
AnlnnlAlnU 


ACATCGNNAT 


CAGTTGAATA 


TTGTTGTGCT 


/zu 


A ATYSTArVTA 


TAAPATY^ATY* 
x AnvJi X \»xl. x \- 


fY^fTTT* AT AG 

^VjV_ X X V_X\ X XttJ 


TTTTTAACGC 


CAATATTTAC 


AAAGCCAAAT 


ion 




<|W|U|VMIH|tflt]l iX"» 

1 1 x\,x 1 XAAL 


A T A A TT* A A A T 
nlnAlwUlnl 


TGTYSGT A TO A 


X* 1 X X X \_ 1UU 






CGATTTTGGT 


TATAACCATC 


AAAACATATC 


ATTTCTAAAA 


GTTGATTGTC 


CCATATCCCA 


900 


ACATACAGGT 


ACATGTGTAG 


GGCGTATTTC 


ATGTATTGGC 


CGAAAAGATA 


TGACGCACAA 


960 


ATCCTNGTAT 


TCCATTTGTA 


GGGTACACCT 


TGGTGAATTG 


GTCATAAATT 


GGTNATGGAG 


1020 


GACTNGTGGC 


ATAGAAATGT 


CTTAATTATA 


GGGCCATACA 


ACTTCAAGTN 


ATATTNATTA 


1080 


GGGATTTAAA 


CGTCC 










1095 



(2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 



GGCGATCGGT 


ATCCATATTT 


AATATTTGTA 


TCTCATCCAA 


CATCTTAATA 


TCTCTCCAAT 


60 


CTATGTATCT 


TTAATATCTT 


NACATTACCC 


TAAATTNTCC 


AACAAACTCA ATTAATGCGA 


120 


ATTATCGCTT 


TCAATANNAA 


ATATTCANTA 


AATCATTNAA 


GATATTGAGT 


TCCAATACTA 


180 


TTT/TCACTTT 


TCATTAATTT 


TAGTCAAAAA 


AATACCCAAC 


CAAAAATGAA 


TTAAATCATT 


240 


CTTAGTTGGT 


TTATATATTA 


ATATCTATTC 


TGATTTTTCA 


TCTTCATCAG 


ACTGTCCGAT 


300 


AGTAGGTCTC 


GCTTCATTAA 


ATTCATAGTT 


TAATATCACC 


CAAATAATTT 


GGTGGTTATC 


360 


GATTTCTGAA 


ACAACCCAGC 


GATCATAAGT 


TGTNTCCACG 


TAATCANCTT 


GNTGGTAAAT 


420 


TGGTATTACG 


AGATNGGTAA 


CCATCCACCC 


TATCGNATCA 


ACTATCCTCA 


AGAGTCAACA 


480 


AAATNCTTAT 


ACCGAACTCT 


TCAAGTTAAA 


TCATCCAATA 


GTACTCTGNC 


ATNNTACTTG 


540 


GAATGTCTNA 


TTATCAATTN 


NTAACGATAT 


CANTCACTTT 


CATCATCAAT 


CAAAATNCAA 


600 


TCAAGAATTT 


TCTTCCAACC 


GATTTCTTCC 


TTAAA 






635 



(2) INFORMATION FOR SEQ ID NO: 635: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 



CTATTTTATA 


ATAGGAGGGA 


ATNTCAAATG 


ATAGACNACT 


TTATGCAGGT 


CCTTAAATTA 


60 


ATTAAAGAGA 


AACGTACCAA 


TAATGTAGTT 


AAAAAATCTG 


ATTGGGATAA AGGTGATCTA 


120 


TATAAAACTT 


TAGNCCATGA 


TAAGTTACCC 


AAGCAGTTAA 


AAGTGCATAT 


AAAAGAAGAT 


180 


AAATATTCAG 


NTGTAGGGAA 


GGTTGCTACT 


GGGAACTATA 


GTAAAGNTCC 


TTGGATTNCA 


240 


ATATATCCCC 


CGCGATGAGA 


ATATAACAAA 


AGAAACAAAG 


GATGGATATT 


ATNTGGTATA 


300 


TCTTTCTCAT 


CCGGAAGGAG 


AAGGCATATA 


CTTATCTTTG 


ANTCAAGGAT 


GGTCAAAGAT 


360 


AAGTGATATG 


TNTCCGCGGG 


ATAAAAATGC 


TGCAAAACAA 


AGAGCATTAA 


CTTTATCTTC 


420 


CGAACTCAAT 


AAATATATTA 


CATCAAATGA 


ATTTAATACT 


GGAAGATTTT 


ATTACGCAGA 


480 


AAATAAAGAT 


TCATCTTATG 


ATTTAAAAAA 


TGATTATCCA 


TCAGGATATT 


CTCATGGATC 


540 


AATAAGATNC 


AAATATTATG 


ATTNGAATGA 


AGGATNCACA 


GAAGAAGATA 


TGCTAGAGGA 


600 


TTTAANGAAA 


TNNNTTNCAG 


AACTTATTTA 


NTGAATTAGC 


TNCAAAAGTT 


ACAAAAAACA 


660 


TCCTATTGAT 


AGCTTGGGTC 


AATAGCATAG 


ACGAANTACA 


AGGAAGACAG 


NGAAATTGGA 


720 


GAAACTTAGA 


NCAGCACAAN 


NAGATAAGAC 


ACCANGGCNC 






760 



(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636: 



CTTATTGTTT 


TTAAGTTATC 


ACTTAAAAAT 


CGGTTGGCAA 


ATTCGTTGTG 


ACGCTTGTCC 


60 


ATCTTCTAAT 


GAACAGAATT 


TGTGATAAAA 


TACCGCGCGT 


GCTTCAATAT 


ACTCATNTGN 


120 


AGTCTCATCG 


ATNTGTTTTA 


ATGCATCAAT 


GAGTGCTGTG 


TGATTNACAA 


CAATTGGCCC 


180 


TGGCAACTCT 


TTTTTATAAT 


CCATGTAAAA 


CCCTCTAAGC 


TCATCGCCAT 


ATTTATCTAA 


240 


GTCATATGCA 


TAGAAAATGT 


GCGGACGCTT 


TAATACACCG 


AAGTCGAACA 


TGACAGATGA 


300 


GTAGTCGGTA 


ACTAACGCAT 


CGCTGATTAN 


GTATAANTCC 


CGGNAAAAAA TGNGGATGNC 


360 


TACATAATCT 


GANNCGTCNT 


TCNCGNAANA 


TCATCATGNT 


CATCAAGACG 


TGTCACNAGC 


420 
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TAAATNATAC ATGCGTNATA NAATNACATA ATCATCATCC ANCGCTGGAC GCAAATGCTT 480 

CTATATCAAA GTTAACATTA AATTGATATG AACCTTCNCG ANTCGCTTCA TCGTCACGCC 540 

AAGTTGGCGC GTACATAATC ACTTTTTTAT CTAATGGAAT ANTTAATCTT GTCTTAATAC 600 

CATTAATATA NTCAGTATCA TTGCGTTTAT GTGATAATTT ATCATTTCTT GGATAACCTG 660 

TTTCCAAAAT CTTATCTCGA CTAACATGAA ATGCATGTTG AAAG 704 

(2) INFORMATION FOR SEQ ID NO: 637: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 7: 

AAGATAATTT GGTACCCAAT TAATGCCTGN CCAAAGATGT AAATACCTTC CTAATTCNNG 60 

GACCCCCATT TGTCCGGAAT AGTTTGCCCA CCCTACCCNA TTGATCCAAT AATACCTGCG 120 

ATAATATTGC CTAAAATACC ACCTGGGATA TCTTTACCCA TAATAGCACC AGCAGCCCAT 180 

CCAATTAAGC CACCGACAAT TAACATTCCA ATAAATCCAA ACATAATTTT CAGTCTCCTT 240 

TTTCTATTTA TTTTGCGTTA TTCTAAGTAG TACCCCTTAT TTACAATTCT AAAACAAATT 300 

CAAATTATTT TTATCCAAAT ATTTTTAAAA GTAGTAATTG AATATCAATT TTATTCAATG 360 

TAGCTATCGT TATTTAAAGT CTCTGTACTG GCAATATCAT ATACATTTAC ATTATTTTTT 420 

CTGCC 425 

(2) INFORMATION FOR SEQ ID NO:638: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:638: 

CTTCATAAAG TTTTAAAAAT ATTCTGTTTT ATCACCTATT ATTAGTGGAA AAGTACAATT 60 

GCAATTGTAT ATAGTTTGCA TAACGCTTCA AAAGTAATTT CTTTTTTGTT TAGTTCAAAA 120 

AAATTTAGAG GTGATGTTAT ATGAATAACG GTTTTTTCAA TAGCGACTTT GATTCAATTT 180 

TTCGAAGAAT GATGCAAGAT ATGCAAGGTT CAAATCAAGT CGGTAACAAA AAGTACTATA 240 

635 



WO 97/30070 



PCT/US97/02318 



TTAATGGTAA AGAAGTTTCA CCTGAAGAAC TAGCGCAACT CACACAACAA GGTAGCAATC_ 300 

AATCTGCTGA ACAAAGTGCG CAAGCTTTTT CAACAAAGCA GCACAAAGAC AACAAGGGCA 360 

ACAAGGTGGC AACGGCAATT ATTTAGAACA AATTGGTCGT AACCTTACGC AAGAAGCACG 420 

TGACGGTTTA TTAGATCCAG TCATGGTCGT GATAAAGAAA TTCAAGAACT GCTGAAGTCT 480 

TAAGTAGACG ACTAAAACAA TCCTATATAG TGGAGAGCTG GTGTGGTAT 529 

(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 



CGAAAAGTTT 


AAAAAAGTAC 


GTACATCTTT 


ATTACCTTTG 


ACTTTCCATA 


AAATGGAGTC 


60 


TGTTGATAAT 


GTTTGACCGT 


TCCACNTTAT 


CACTAAACNC 


CACATTCCTA 


CCATATGCTT 


120 


TTCCAACATA 


GTCATGTGCA 


TCAGTTTGTA 


TCGCGACACG 


CCATTCCATA 


AATTTTATTG 


180 


TTGGCGCAAC 


AAATACTTTA 


ACAGGGTGAT 


GCTCTTGTTG 


AATTGCTTTT 


AAAGCTTGTT 


240 


GCTTAATTAC 


GGCTCTCATT 


TTAAAACGAA 


TCATGTAATA 


AATGATTAGT 


AAAGCGATTA 


300 


TAGGAAAGAA 


GACTGCAAAT 


GGATGCAAGC 


CTATAACCCA 


TAATACAATA 


CCAATACAAA 


360 


GAACTGTGAA 


AATAATAGGG 


TCAAATGTGG 


TTAATCACAC 


TTNAATTGAA 


TCCATTTGTT 


420 


TGTGATAGGT 


CTAAGCGCTT 


GTGTACCATA 


AGAATTGAAT 


ATATCTACAA 


AGACATGTAA 


480 


AAATACTGCG 


AGCTGAGCCC 


ACATCCATAC 


ATGAAAG 






517 



(2) INFORMATION FOR SEQ ID NO: 640: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640 : 

TTATCAGTNT CTTCTATTAT ATTGTCANCT TGTGTTGATG ATATGTANGT GTTGATACGC 60 
TCCTNACACA TAATGTNCCC TTTTGGTAAG CATTTGNATG TCTTTCTTCT ATTGTNTTNN 120 
TGGNTTTTNN AANTGNAGAT AAAATANAAG TGTNTTGTAA CAATATGTTT GTGAAAATGC 180 
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CAATTTCTTT ATCTCAAAAA 


l\.Anunn x a x 


TGTGATATAC 


TATTTTTCAT 
xnx xxx t\>m 


AGTCTTGAGT 


240 


TATAGATGAT 


ACTGCTAGCC 


1 AAA inl 


CAAPATPGGP 


ATGPAPATPT 


TAGANTTTAA 


300 


GCTAGTGGCA 


GCTTATAACT 




T ATT & TTTY^P 


AT AP A Afi ATG 


TATATAPATA 

x *\ x n x x ** 


360 


AATGGAAGGA 


TGATATTATG 


ACATCAATTA 


AAACAATTAC 


ACCCAAAGAT 


TTTATATTTA 


420 


GAGTTTTATC 


AGGTGTCGCC 


ATAGGTATTG 


TTGCGGGTTT 


AGNTCCTAAT 


GCGATACTTG 


480 


GAGAAATTTT 


TAAATACTTT 


ATGGATTATC 


ATCCGATTTT 


TAAAACGTTA 


TTAGGCGTTG 


540 


TCGGTGCAAT 


TCAATTTACA 


GTACCTGCTT 


TGATTGGTGC 


ATTGGGTTGC 


CATGAAATTT 


600 


GACCTATCGC 


CATTAGCAAT 


TGCTGTTGTG 


GCAAGTGCAG 


CAGGATAT 




648 



(2) INFORMATION FOR SEQ ID NO: 641: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 



TTTGGTCCAG 


AAATTGAAAC 


ATTTACACAA 


GTAGTAGAAG 


CAATTTATGA 


TGAAGAAGGT 


60 


AACAGCTTAG 


ACGCGGCGCG 


CCATCCATTA 


CAAATCGTTC 


AAATTAAAGT 


GGATCGCCCG 


120 


ATATATCCAA 


ACAACATGAT 


GAGAAAGGAA 


ATTGGCTAAT 


GAAAGCTACT 


ACAATCATTG 


180 


GCATAGCTGG 


TGGATCTGGC 


TCAGGAAAAA 


CAACTGTAAC 


TAACGAAATT 


ATGAAAAACT 


240 


TAGAAGGTCA 


TAGTGTCGCT 


TTACTTGCTC 


AAGATTACTA 


TTATAAAGNA 


TCAAAAAGCA 


300 


CTTGACTTTC 


GACCGAGCGC 


CTAGAAACCA 


ATTATTGANC 


CATCCATTTG 


CATTCCGATA 


360 


ATTGATTTTA 


TTAATTGGAA 


AATCTTTAAN 


GACTTGAAAA 


ATGGNTAAAG 


CAGGNNGGAN 


420 


GTACCCNCAT 


TTTGATTATG 


CTAGG 








445 



(2) INFORMATION FOR SEQ ID NO: 642 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 



GATATTCGTC TCCGNNACCG GNCCTTNATG NAGATATGNC CCTTNTTGGA TGCCGTATTA 
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AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATATG AGCAGGCAGT ACNAGCTTAT 120 

GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATAG ATGCGCAAGA TGCTATGGAT 180 

CAACATGATG CTTGGGACTA TAACGCTGAA ATTAAAACAA TCCTCTCAAA ACTAGGTATA 240 

CATGATACTA CTAAATACAT TAAAGAATTA TCCGGCGGAC AACAAAAACG TGTTGTNCTT 300 

GCGTAANACA ATTAATAGGA ACAAACCAGA TTTATTGGTT ATTAGATGGA ACCNACGGAA 360 

CCANNTAGGA CTTCGAATCA ATCAGCTGGT GGGTCAANTT ATGTGAAGCA ATNTCCNCAA 420 

TNCAGG 426 



(2) INFORMATION FOR SEQ ID NO: 643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 



CTACCGCGTG 


CGCTGTTAAC 


CCAATNCGTC 


ANCTTTTTCA 


GACTTAGGTG 


ANATGAAATG 


60 


TGGTCCCNCG 


ACGATCGTCA 


GTAACCAAAT 


AACAGGCAGT 


CATTGGATTA 


TGTTTANCGA 


120 


AGTAAGTAAA 


TAAATATTTT 


GAATTACCTA 


TATATTGATG 


ATGATCACCG 


ATAAAAACGA 


180 


TGTGATCTTT 


TAAAACGTTT 


GGTTGCTTCG 


TATACAGGCC 


GAAGTTTTGT 


AATTGTTTGT 


240 


TATCAGCACG 


TTGCTTTGCT 


ATTTTATATA 


ACCATTTAGG 


TAGCATAAAC 


TTGGTAGAAN 


300 


ACAACCAATT 


ATCTGNCGAA 


TTAAGTTGTT 


CTGAATCCTC 


GATAAAAAGG 


TTGCGAAAAG 


360 


GTATAGGATG 


ATGATCAATA 


GAAGCCATTT 


TATTCAAATA 


AAACTGAGAC 


ATCATAG 


417 



(2) INFORMATION FOR SEQ ID NO: 644 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 432 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 



CTACGTTGCC CCATCGTTTT CGCACGAATC GTCTTACCAA ATGCATCTTT AGTTATCTCT 60 

CCATCATANA AATCNAACAG ATGTTGAATT GTGTTATTAT GCGCCATTTT AATAGCTGCT 120 

TCAACATCTT NAATTGTAAT ATTATTACCT AAATCAATAN CCTNCAGCAA ATTGATTAAT 180 
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ACTGATTCCG CTTTCTCTAC GTTTTCTATT TGTGGNACCT TTAACGGCAA CTTCTTGTCC _ 240 

TCTTGNATGG ATGACAACAT CGAAACTCTC TNCAATTGCT TTTAAATGTT CATCATNATT 300 

TCCAATTAAA GCTTGAGATG GGNTCANATC GTCTATNTGT NTAATTCCAG GCATACACGC 360 

GCTCCNNNNC ATATATCATT TATATNCANN CTNTGATGCT NNAATNTCTA CTTANTNATA 420 

TCAAGCAATT GA 432 



(2) INFORMATION FOR SEQ ID NO: 645: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 



CAAGAAGAAA 


TGAAATATAA 


ACGTCCGTCA 


GGTGTAGACT 


TCGGTCCTGT 


AGATTNTGTA 


60 


AAATATGCAG 


AATCATTTGG 


CGCGAAAGGT 


TTACGAGTTA 


CTAATCAAGA 


AGAATTAGAA 


120 


GCGGCAATTA 


AAGAGGGCTA 


TGAAACAGAT 


GGTCCAGTAT 


TAATTGATAT 


ACCTGTAAAT 


180 


TACAAAGATA 


ATATCAAACT 


TTCAACAAAT 


ATGTTACCTG 


ACGTATTTAA 


CTAAAATAAA 


240 


GATAAATGTT 


AAAGAGGAGT 


GGGAGATTTT 


ATGACTAATG 


TTTTATNCCA 


ACATGGTACA 


300 


TTAGGCACAT 


TAATGGCAGG 


ATTATTAAAA 


GGAACTGCAT 


CAATAAATGA 


ATTATTGCAA 


360 


CATGGNGACT 


TAGGTATCGC 


TACACTAACA 


GGCTNCAAAC 


GGGTGAGGTA 


ATCTTTTTTA 


420 


GATGGAAAAG 


CTTACCATGC 


AAATGACATA 


AAGATTTGGT 


AGAATTAAAG 


GTGACGAGTT 


480 


ACACCATATG 


CAACTTGTAC 


TAATTTGGNG 


CAGGNTCAAA 


GCTTATGAAC 


GAAAGATAAT 


540 


CTTCCAGAGC 


AGGTT 










555 



(2) INFORMATION FOR SEQ ID NO: 646: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646: 



CTAGTACATT CCAAGGAATG GAACTGATTT CTAAATTAGA TTCNAGAACC CAAGTGATGA 
CGAACGACAA GATTACACTA GCATTTGATA TGAATAAGTG TCACTTTTGG GATGAAAANA 
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CAGGAAATCG TNTCGTCTAA GGGGGAGTAT TCATGTCTAA AATTTTAAAA TGTATCACGT 
TAGCCGNGGT AATGTTATTA ATCGTAACTG CATGTGGCCC NAATCGCTCG NAAGAAGATA 
TTGNTANAGC ATTGANTANA GATAATNCTN ANGACAAGNC TAACCAACTT ACGATGTGGG 
NGGGATGGCG ACAAAGCAAA TGGCGTTTTT NTNAAAAAAT TACGGCATCA ATCATACTAA 
AAAAACTGGC ATCAANGTAA AGCTTGGTNA NTATTGNTCA AAATNATCCA NCTAGNANAC 
ATTTCNGCTA GACGCTCCCG CAGGAAAAGG GCCCAGNTAT CTTNTTTCTT TAGCAACATG 
ATAATACTGN NANGTCCNAT CT 

(2) INFORMATION FOR SEQ ID NO: 647: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 

GAATTCCTAA AATTTCCTAA TTACAAGGAG GAAACACCAT GGCTAAAAAA ATTGTTTCTG 60 

ATTTAGATCT TAAAGGTAAA ACAGTCCTAG TACGTGCTGA TTTTAACGTA CCTTTAAAAG 120 

ACGGTGAAAT TACTAATGAC AACCGTATCG NTCAAGCTTT ACCTACAATT CAATACATCA 180 

TCGAACAAGG TGGTAAAATC GTACTATTTN CACATTTAGG TAAAGTGAAA GAAGAAAGTG 240 

ATAAAGCAAA ATTAACTTTA CGTCCAGTTG CTTGAAGACT TATCT 285 

(2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 648: 

ATTGACCCAT TCGTTAATNA ATCGNNNATN AGNANATGTT CCTNATAANT NTCATGATAA 60 

TACCNAANCA CATCNAATAA TGTTAGTTTT CCNACCGCCT CTTATGCCNA ANCCNAATAA 120 

TAANATANAC NGGCGCNATG TACTGTAGTA ATGTTGCAAT CGCAGCATTA CCTGTATTAA 180 

TAGATGCCAT ATAAGCATAT TGTACAACTA ACATGCCCAG TATACTGAAG ATAAGTAANT 240 

TGCCTAACAT ACGTCGATCN TGGAATACGC TAAATANTGA ACGNNTGGGT TGCATCATTC 300 
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TGTACATAAT ACAGTOAGNA AACACCACTG ACTACAAGTC GACCAGTGAC GTNCCAATCG_ 360 

ACATTAATAT TCTTATATGT GNACAAGAAA GCAGANACAG TNCCACCTAA TCCCCATAAA 420 

CAAGCACCGA CAATGGCAAG TATAATTCCT TTTAAACGCA AAGCTGAATG ATTATTCGCC 480 

AAGTATCTCA CCCCTTTAAA ACTTAACACC AACAACATAA AATATGTCGG TGTATGATAA 540 

TGATTTAATT CATTAGGAAT TCCG 564 

(2) INFORMATION FOR SEQ ID NO: 649: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 



TATAGAACCT 


ACTATATCTT 


TGTCTAATGC 


CATGTTATTC 


TCCTNCGATT 


CTTGTATTCT 


60 


TTATTATTAT 


AACATGTGTG 


AGTCATTCTA 


TCATTACAAC 


TTATATGAGG 


TCGAAACTGT 


120 


GACGATAATA 


TGATATAATT 


CTTCCGCTAG 


AATATCGAAT 


GAAAAAAAGG 


AGAAAATGCA 


180 


TGTTACAAGT 


AACTGATGTG 


AGTTTACGTN 


GTGGAGATCG 


NAAACTATTT 


GAAGATGTAA 


240 


ATATTAAATT 


TACAGAAGGT 


AATTGTTATG 


GATTAATTGG 


TGCGAATGGT 


GCAGGTAAAT 


300 


CAACATTCTT 


ANNAATATTA 


TCTGGTGAAT 


TAGATTCTCA 


AACAGGACAA 


TGNTTCANTA 


360 


GGGTAAAAAT 


GAACGNCTAG 


CTGGNNTTTA 


AAACAGGACC 


ACTATGCTTA 


TGAAGATGAA 


420 


CGCGTGCTTG 


GATGTTGGTA ATTAAAGGGT 


CACGAACCGT 


CTTTA 




465 



(2) INFORMATION FOR SEQ ID NO: 650: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 



CAATAACTAG GGTGGTACCA CGACGATCTC GTCCCTTGTA TAGGGGCGGG ATTTTTTGTT 60 

TTTTTCAGAC ATAAATGTTT GTTGGTGTCA TATATTCCCT GTTTATTGTT AATAGGTTTA 120 

ATGTTAAAAC GATGATTGTT GTTCAATTTT TTAACGAGGT CAGATAAAAG TATTTATAAA 180 

GCAAATAGGA GGGTTTAACA TGGAACAAAT TAATATTCAA TTTCCAGATG GTAATAAAAA 240 
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GGCGTTTGAT AAAGGTACTA CTACTGAAGA TATAGCACAA TCAATTAGTC CTGGATTACG. 300 

TAAAAAAGCT GTTGNCGGCA AAATTTAACG GGCAACTTGT AGATTTAACT AAACCGCTTG 360 

AAACTGATGG ATCAAATTGA AATTGTGACA CCAGGTAGTG AAGAAGCGTT AGAGGTATTA 420 

CGTCATTCTA CTGNACATTT AATGGCACAC GCGATTAAAA GGGTATATGG TAATGTTAAN 480 

TTTGGGNGTA GGTCCCTGTA ATAGAAGTGG GNTTCTTCTA TGACTTCGNA TGACCAAA 538 

(2) INFORMATION FOR SEQ ID NO: 651: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 651: 






CCGTCTATTA 


TCAACTCGTC TAAAGGNCAA 


GTGTTTANTG 


AAGNTTGAGN 


GTATTACACA 


60 


ANCATCAANT 


CAGCCGATTT GAAANCACAA 


GATTAAAANG 


TCAATATGAT 


GGCATNANTN 


120 


NCCACTGTGN 


ANTTNANACA TCGAACNCCC 


CCTANTTANC 


GGGTAAAANC 


ATNNNTAACC 


180 


AAGATAGTGA 


TATCAGNTAT ATCACACAGN 


NTTTTAAATA 


CAAANTCNAC 


GTTCCAAGAG 


240 


ACGCATGATA 


ANTCATCANA CATGATTGAT 


AAGGATGATG 


TTCATGNTGA 


AACGAAAGAT 


300 


GTTGGATGGC 


AACACATCTT NTGGANNATG 


ANCANACNTC 


AATANNTTAA 


CNTCAATTTT 


360 


CGAGAAACAT 


TTAAGTGACG AAAAATCAGA 


ACAATTATTG 


CATCATNTGC 


GNTCGGGNNT 


420 


AACTTTGCTT 


GATTCAATGA GAATAGTTAG 


TGCCGGAGNN 


AAACAGTGAC 


AAACAN 


476 



(2) INFORMATION FOR SEQ ID NO: 652: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 



GACTTCCTNA AGGCAGTAAT GGCAGCAACA TTGGGGAGTT AATCACTAAA TGGAACGATA 60 

TGAACATGAT GTGTCTGATG ATATAAGTGT CATGTACAGA CACCTCATAT TGGTATTAAN 120 

GGAGAAATGA ATATGAACGA NTCAGGAGAT AACAAACTCA GTAAATCTTC TTTAATTGGN 180 

CTAGTTATAG GANCCATGAT TGGTGGCGGT GCGNACAATA TAATGTCTGA TATGGGCGGT 240 
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AAAGCCGGTG GATTAGCCAT TATTATTGGG TGGATTATTA CAGCTATAGG AATGATNNCA_ 300 

TTAGCGCTCG TATCACAAAA TTTANCCAAT GAACGGCCGG AGCTAGACGG GGGTATTTAT 360 

AGCTATGCTC AAGCAGGATT TGGCGATTNT GTAGGATTNA TCAAGCGCTT GGAGATATTG 420 

GNTCTCAAGC GTCTTTAGGC AATGTTGGCT ATGCAACACT ATTGATGTCA AGCAAGTAGG 480 

TAACTTTCTC CCGATTNTTA AAGGAGGCAA CACATTACCA GTATTATTGT CGCTCGTTAC 540 

TACTCT 546 



(2) INFORMATION FOR SEQ ID NO: 653: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653 : 



TATCNGCANT 


AACAGTATCC 


GCCATCTAAG 


GCATNNCGAA 


TAAATTGTTT 


GTNGGGGGGA 


60 


ATAAATATAT 


CCTNCCATAT 


CCTCATCTTC 


AGGTCTTAAG 


AATCNCAAAN 


CCTTTTTTAT 


120 


CTTGACNTAA 


CGTTCCCTTT 


ATCAANTNTG 


ANTGACCTTN 


TGNACTATGC 


TTTNTTTGAT 


180' 


ATCTGNCTGN 


TTTNGTACGT 


TCAATTAATC 


CNGATNGTTC 


TAATTCTACG 


ANCACCTNAA 


240 


TTAAATCTCT 


AAATGAGTCG 


GCACTACTTA 


AACCTAATGC 


ATCTTGAAAA 


TCTGACACTG 


300 


ACATAGGTTC 


ATATTCAGGT 


TGATTAATAA 


TCTCTTCTAT 


TGATTGCTTT 


AAATTCATTA 


360 


NGCCCCTCCT 


TTCTAANGTT 


AAAATCTCTT 


TTTAG 






395 



(2) INFORMATION FOR SEQ ID NO: 654: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654: 

TGAGGTGAGC AGAGATGAAC TTTTTTGATA TCCATAANGA TNCCGANCAA AGGCATTCCA 60 

TTATCGGTAC AACGTAAATT ATGGCTTAGA AACTTCATNC AAGCTTCCTT TGTAGTGTTG 120 

CTTTGTTTAT ATGGCTATGT ATTTAATTCG AAACAACTTT AAGGCGGNCC ANCCGTTTTT 180 

ANAAGAGGAA ATTGGATTAT CTACATTAGA ACTTGGTTAT ATCGGATTAG CAGTTAGTAT 240 
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CACGTACGGT TTAGGGAAAA CATTACTTGG ATATTTNGTC GATGGACGTA ACACAAAACG 300 

TATTATCTCG TNCTTACTTA TCTTATCTGC GATTACAGTT TTAATTATGG GATTTGGTTT 360 

TAAGTTACTT TGGTTCTTGT AATGGGATTA TTAATTGGTA CTTTGGGGGA CTTAACGGGG 420 

TGTTCCAATC AAGNTGGGTG GGACCTGCAA AGTTATTCCA ACGANTTCAA GGATGGGCGC 480 

CAAGGACGAA ACGGTGGGCC GATACTTTAG GATTTTGGGG NATACANCAC ATAAATATCG 540 

GNNGGNNGCC AATTGAGGTG GCGGTNGGNA CTTTNGGGGG GGGTTAANGT TTTCCNTCCC 600 

CAGGAAATTG TTATAG 616 



(2) INFORMATION FOR SEQ ID NO: 655: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655: 



CAAAATTATT 


TACNGANANT 


GATTTGAANG 


ATGATGAAGT 


CATTTCAATA 


TTTGTTTTCG 


60 


ATAAGTCAAT 


AGAATAAGTG 


TTGCTGGTGT 


AAGGTACACG 


GTGCTGTNTG 


CTAACTTCGC 


120 


TTTGAACTTA 


ACAATAATTC 


AAGGGGGTGG 


TATGTCAAAC 


GGTGCCGTTT 


TTTGNGNCAT 


180 


ATTTTTAAAA 


CAAGCAACAT 


GCAACACGTA 


CTTTAAGGAA 


GTCAAAATTT 


ATCATTTAGG 


240 


AGAGATGGAT 


ATGAAAATCG 


NAGNATTATT 


TCCAGAAGNA 


GGAGAAGGTC 


AAGAAAATCA 


300 


ATTANTTAAT 


ACTNAAAAAG 


NAATAGGATT 


AAAAACATTT 


TNAGAGGAAA 


GAGGGACATG 


360 


ANTNCCACNA 


TTTTAGCAGG 


TTATGGGGGA 


AGACNTAGAT 


AAACATTTNC 


CACGATCTTG 


420 


GT 












422 



(2) INFORMATION FOR SEQ ID NO: 656: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656: 



TTTGATAGGG GTACAGCATT AAAAGAAATT GATTGCACCA TTTTTGGTAT TCATCCAAGG 
AAGANNCGTT AGTTATTGGA GATAGCGACA ATGATAAGAC CAATGTTTTG AATTTGTCAT 
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TAATALAAoT 


TGCTATGAAA AATGCACGCC 


v~ lunnnlLLn 


AfiPATTAAPT 
rtuv— «v i. a. mix* i 


TPAfiATGTAA 


180 


CGGCATACAC 


GAATGAAGAG 


GATGGCGCAC 


pa & A AT IITTT 


A f2p A A A fSP AT 


TTTTT A PJPTO 


240 


ATTAATAAAA 


TAGGTAGTTA 


TTTATTATTT 




Af2TTf2 A TO Af5 


T A ATT!T AP A A 


300 




AGTTATTTTC 


TATAAGAAAA 


Tf^TPTT A PPT 


/^T'Tf'pTTTfrT 


ATGCTAGTAA 


360 


TOT* Af2 IT APT 


AGTGTTACTT 


CTTCTTTTTT 




TCCATATATG 


GAATGAAGNA 


420 


AATCACCAGA 


ATAGGGCACA GAGTATAGCT 


ACAATTGAAC 


CGTATGTCAT 


ATTAGCATCA 




CCTTTTATAT 


GGTTTATGCA 


TTTAAGTATA 


CTATAACAAC 


TACCGTTATT 


TTAAGTAGGG 


540 


GTGTATTTGC 


GTTAGAGCCC 


CTAACAACTC 


CAAAATCGCA 


TTTGTGATTT 


TGGGATAATC 


600 


AAACATTTCA 


TTCCATAAGT 


AAGATATGTT 


TCACTTTATA 


TGAGTTTTTT 


ACATTCACAT 


660 


TCTGTAGTGG 


TTGTAAGAAA 


ATTTTTTCTT 


TCTCTTAGGC 


AGTAATAGGC 


AATTTACCG 


719 



(2) INFORMATION FOR SEQ ID NO: 657: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657: 



CAATTCACGT 


GAGACTTCCA 


AATGATTTGC 


CTCAAAATTT 


TTCAAAGTGT 


CGTCGTAAAA 


60 


ACTGTCTAGT 


AAATACTAAT 


AGTATGTCGT 


AGACCTATGA 


CAAATCTGAA 


TTATGACGAA 


120 


GATCAATCAA 


GAAAAACAGC 


ACCAAGATCA 


TTTCAATNGG 


CACGAGTACC 


TTACTGCTGT 


180 


TCTTTATTTA 


TTACATTTCA 


ATCTTAATAA 


TGACCGTAAT 


ATCACGTGAT 


TGTCATTTCC 


240 


TTGTNATCAT 


ACCTAAAACA 


ACAATTAATC 


CTGTGCGTGT 


CGCTGACTTT 


CTTTAGTATG 


300 


AATTAAATCA 


GTAATAAAGC 


GATTGACTGG 


CGCTTCAATA 


TGACGTTGTT 


TACCCAATGT 


360 


TGCAACTGCG 


CCATTAATAT 


AATCAATTTC 


AGTTTTTCTA 


TTATTAACAA 


TTAAATCTTG 


420 


ATACATGGAT 


GGATAATGCG 


CACCAACTTT 


TTCATTTAAA 


TCAACTAAAT 


ATTCAAATAC 


480 


TTCATCAACA 


TTTAAATGAA 


CATTATCAAT 


CGGCGGTTAC 


ATGCACTATT 


TCTTGCGGTT 


540 


AATTTATTAA 


TCAAAACACT 


TCGGATAACT 


ACTTTCAATC 


AGGGGATGCC 


ATATTACACT 


600 


CX^ACACTGG 


TGCTTTAATG 


CATTTGGCCG 


GACCAATAAC 


ACAAATCTTT 




650 



(2) INFORMATION FOR SEQ ID NO: 658: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658: 



GAATTCCATT 


GAGATTCATC 


TTCATTAGCA 


CGTTTTAACA 


TAGTATTNTG 


GAAAATTTCG 


60 


TATACCTCTT 


CTGAAACATT 


AAAACGTTTT 


TCAGGATCTA AACCGTAATT 


TTCGAATGTT 


120 


AATTTTCTTT 


CAACTTCACC 


TAAAGGTGCC 


CCATGAACAC 


CATTAGTTCC 


TGCTTTATTC 


180 


GGTGAACCAA 


ATCCGATTGT 


TGTTTTAACT 


TCAATAATCG 


NTGGNCCTTC 


TTGAGATTTA 


240 


GCTGTAGTAA 


TCGCTTTATC 


AATTTCTTCT 


AAATCATTAC 


CATCTTTTAA 


CTAGTAAGTA 


300 


ATTCCAACCA 


TATGCTTCAA 


AACGAGCTTN 


TGTGTTTTCA 


GAAAAAGCTT 


TGTTTAATTC 


360 


GCCATCTAAT 


GAAATATCAT 


NTGAATCGTA 


TAAAACAACT 


AATTTACTTA 


ATTTATTATG 


420 


TCCAGCAAAT 


GAAGNTGGTT 


CATGCGATAT 


ACCTTCCATT 


AAAATCAACC 


GNCAGAAGCT 


480 


TAATACATAT 


GTGTAATGGA 


TCTTACAAAA 


ATTATATCCC 


CCTTTAATTA 


AATTTCCCCG 


540 


GTAGGTGGAT 


CTTCCTGNTA 


AAGGTTAATC 


CCACTG 






576 



(2) INFORMATION FOR SEQ ID NO: 659: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659: 



TTAATCCTGG 


TNGGCNTTGT 


CCTNTCNATG 


NCTCTAGTTG 


TTTAATTTGA 


TAAACCCGCT 


60 


TCGCTGTCNC 


TCCATATGCT 


ACTTGNAAAT 


GAAGNCAGTA 


CATGAAACAT 


TTGATACAAT 


120 


NCCATGACAA 


TAGTTGCCCT 


TGCAGCAGGC 


GTATTTTCCA 


ATGAATCTNC 


TAATTTTAGT 


180 


GCTGCTTCAT 


ACGTNTCAAA 


TGATTTATTT 


CTATTATCTT 


CGAACCATTG 


CTCAATTACA 


240 


TGTTTCACTG 


AAGCAATTGT 


TTNCATGATC 


ATACCAATCT 


GTTTGAAAAG 


CGTTACATTC 


300 


CCCGAAANAT 


GCATTTCCTT 


TGTCATCAAT 


CAATTCGATA 


AACANACAAT 


CNCGATGCGT 


360 


TAAAGTGACT 


TTCGGTGTTA 


CAATTTGTGA 


CTTNAATGGC 


TCACTATATT 


TATNNAAATG 


420 


CAAAGCTGTC 


AACTTCATCA 


AATCATCCTC 


TATACAACTT 


ATTTNCTCTG 


TAATTTACCC 


480 


GTTGATGTAG 


AAGGTAAAGT 


ATCAACCTTT 


TCAAAGTGTT 


TCGGTACTTT 


ATATTTCGCT 


540 


AAATGTTTTG 


ATAAATATGC 


AATCAATTGT 


GCCTTTGAAA 


TGTCACTTTC 


ACTGACAAAA 


600 


TATAATTTAG 


G 










611 



(2) INFORMATION FOR SEQ ID NO: 660: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 356 base pairs 

646 



WO 97/30070 



PCI7US97/02318 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660: 

GAATTCCTTT CTGCCTCAAT GCGTGTTGTA GGGATACTGT TAGTAAGTGC TTTAATCACG 60 

TTACCAATTG CAATAAGTAT GCGAATAACA AAAAGTTTTA AGCAATTGAT ATTACTAAGT 120 

GTATTTTTAG GTGAATTATC AGTAATATTA GGCTTAGTCC TAGCTTTCTA TATGGACATT 180 

TCTCCGGGTG GNGTAATTGT AGNTCTACTA GTTATTTTAC TTATGATTAC AATGGNTTAT 240 

CAGAAAATGC GAATGAAGTT TAAAAAGGGA GCTAATATCA ATGAATACAA ATGATGCTAT 300 

TAAAATTTTA AAAGAGAACG GGTTAAAATA TNCAGATAAA CGTAAAGATA TGTTAG 356 

(2) INFORMATION FOR SEQ ID NO: 661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 661: 






GAATTCCTGT 


ATTAACAATA ATCGCATCAT 


TATTAGAAGT 


TTATGTAACC 


ATAAATTAAG 


60 


GAAGTGTATC 


ATGCTAGTTT TAAATAATCT 


TCAAAAAAAA 


TTTACTAATT 


CAAAAATGGT 


120 


TTTGAATAAT 


GTGAATGCTA AGTTTAGAAG 


TGGGAGCATC 


AATTGCATAG 


TGGGAGTAAA 


180 


TGGGGCAGGT 


AAAACGACTT TGTTAAATAT 


TATTAGTAGT 


ATATTGATGC 


CAACAAAAGG 


240 


TGATGTATAT 


TTAAATAGTG AAAGTATTTT 


TGAAAATAGC 


ACACTTAAAA 


AAAGAGATTT 


300 


TTCACACACC 


AGNAAATCCC TTTTTNTATG 


AAAATTTAAG 


TGCTAAAGAT 


AATTTGTACT 


360 


TAATATGTAG 


CCTATATAAT CGGAAAATAG 


ATCAAATACT 


ATTGAGAAA 




409 



(2) INFORMATION FOR SEQ ID NO: 662: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662: 

GAATTCCTTC TTTAAGTAAA TGGTTAATTA CAGCCCCACC GATACAGTTT GATTGATTTA 60 

CATTTGACGC ATTGTAAATG TCATACTTTC AATTACATTC GTTAAGGCAT TAGCTGTATC 120 

TAAAGATGTC AAACATGGAA TACCATTTTC AACTGTAGTA CGTCTAATTT GGAAGCCATC 180 

CCTTTCTACT TCTTTACCTT TAGTCATTGT ATTTATAACG ATTTGAACAT CACCATTTTG 240 

ANTACGTGTT AATAAATCAT TTTCGCCACC GATTTTG 277 

(2) INFORMATION FOR SEQ ID NO: 663: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 440 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663: 



GCAACAAATA 


AGGTTTATTC 


CGAGTTTTTC 


TGCAAGCCAG 


GCTTGACGCT 


ATCCGCTGCC 


60 


GGGATTTATT 


CATATACTCC 


TGGCGACTTG 


TATTCAGCTA 


AGACACTGCA 


CTGGATTAAG 


120 


ATGAAAACGA 


TTGAAGTTGA 


TGATGAACTC 


TACAGCTATA 


TTGCCAGCCA 


CACTAAGCAT 


180 


ATCGGCGAGA 


GCGCATCCGA 


CATTTTACGG 


CGTATGTTGA 


AATTTTCCGC 


CGCATCACAG 


240 


CCTGCTGCTC 


CGGTGACGAA 


AGAGGTTCGC 


GTTGCGTCAC 


CTGCTATCGT 


CGAAGCGAAG 


300 


CCGGTCAAAA 


CGATTAAAGA 


CAAGGTTCGC 


GCAATGCGTG 


AACTTCTGCT 


TTCGGATGAA 


360 


TACGCAGAGC 


AAAAGCGAGC 


GGGCAATCGC 


TTTATGCTGC 


TGTTGTCTTC 


ACTATATTTC 


420 


TCTTGACGNC 


CAGGCGTTTG 










440 



(2) INFORMATION FOR SEQ ID NO: 664: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664: 
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GAATTCCACA TTTTCAGGAT TAAAAAACGG TTTGGAGTTT ATATTTAAAT TCAAATAAGT. 60 

TTAGACCATC AATCGCAACC CTATCTTGGT AAACGTTCGN TGCTGATATA TCTGTAAAAT 120 

TGTGTGGACC GACACCTGCA ATAAACTTAA ACTCTGCTTC ATCTACCAAA TAATCATACG 180 

CTTGTGTATG TCTATCCTGT GCGCCATGTG GAAATACAAA CATATCTTGT TTTACCTACA 240 

ATTGGNTCAA CTTCATCTTT CCATCTTTTA GTATCACGTN TAATACCNCC TAAAGATGTT 300 

(2) INFORMATION FOR SEQ ID NO: 665: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665: 



GAATTCCTGC TTTAGTGTTA AAAATCTATA TCTGCACTAA TTAAACGGTG TGATTCACCC 60 

GCTTCATCTC TAACAATTAG ATAGCCATCG TAATCTAAAT CAATTGCTTG TCCTTTAAAC 120 

TGTTTATCAT TTTCTGTAAA TAGCAACGTT CTATTCCAAA TATTAGAAGC TGCAATATAT 180 

TCTTCACGAA TTTCAGAAAA AGGTAACGTT AAAAATTGAT TATATCTTTT NTCAATTTCT 240 

TGAAGTAATC TCTCTAAAAA TTGATATCTA TCTAATTTAT TTNTATCATG TAATTGTATA 300 

CTTGTTGCTC TATGTCTAAT ACTTACCATC AAAGTTTTCT AGTTG 345 



(2) INFORMATION FOR SEQ ID NO: 666: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 : 



GAATTCCCAA TATTACTAGG AACATCAACT ATACATGCAA TCGAGAGAGG TATGGCACGA 60 

GGTTTAGGTA CTATTTTAGG TGTGTTAGTA CTTTCAGTCA TATTGTTGTT TTCAATACCA 120 

ACACCTGTTG CAGTAATTTT AATGGGCATT GCAGCATTGT TTACTGAAGC ATTGGTGGGA 180 

GCAAATTATG CGATTGCAGT AGTTTTTATT ACAATACAAG TTATTTTAAT GAATGGATTA 240 

GCATCACAGA ATTTAACAAT TAACATTGCG TNTCCAAGAG TTATTGACGT TGCAATAGGT 300 

ATTGTGATTG CAATCATAGG TTTATTTGGC CCNTGGACAA CGTACCGCAT CCCGTATTTG 360 
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GTTCCCTAAT TTTAATTGGC TTGAGG _ 386 

(2) INFORMATION FOR SEQ ID NO: 667: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 : 



GAATTCCTTA 


GATTCTGAAA 


TTTCAGTGTG 


TACTTCAGCT 


ATAGGTGTCA 


TTGCCTCATT 


60 


ACCTGGAATG 


ACAAACTCTT 


GTTTAACTAG 


ATTCGATTGA 


TTAAGTTTAT 


ATGCAAGTGC 


120 


ATGTTCTCGT 


CCACCAGCAC 


CAATTACTAA 


TACATTCATT 


TATGTTACGC 


CCCTTTAAAA 


180 


TTAGTGTTTA 


AAATGTCGAG 


TGCCTGTGAC 


TACCATTGCA 


ATACCATGTT 


TATTAGCCAT 


240 


ATCAATTGAA 


TCTTGATCTT 


TAATCGAACC 


ACCCGGTTGG 


ATAATTGCCT 


TTATACCATG 


300 


TTGTGCTGCA 


AGTTCAACTG 


TATCTCCCAT 


AGGGAAAAAT 


CCATCAGATA 


CTAACGCTAC 


360 


ATGGATCATT 


GATTTCAATA 


GCTCTCTCTT 


AACGCAATTT 


TAGCAGCAAC 


CGACACGATT 


420 


CATTTGGTCC 


AGCANCTATA 


CCTACAGGNT 


GGNTTATTAT 


TACTTAAAAT 


AATTGGATTA 


480 


CTTTNTTACT 


GATGGGTACA 


ACTT 








504 



(2) INFORMATION FOR SEQ ID NO: 668: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668: 



GAATTCCCGA 


CGAAAAAAGT 


CAAGAAATNT 


TAAGTGATCT 


AGAAAAGGTT 


TGTATCGNAA 


60 


ACGTATGTTA 


AGTTATGGNG 


GCTTACTTAA 


AGAAAAGCAT 


AAANTTTTAA 


ACTTAGACGA 


120 


TGCCGAAGAT 


GGTAATTTGA 


TTAATACAAN 


TGATGAAGAT 


AAAACGACAG 


ACGAAGAAGA 


180 


AAAAGCACAT 


TCGANNACAG 


NANTNTGGNA 


TNTCGAAANG 


CAAAATTATT 


ATTTACAAAA 


240 


TTTAAGGAGT 


TANAAATATG 


AACACTAAAA 


TTTTCACTGG 


AATGACTGGA 


AGTTCTCTAN 


300 


GAACGAAAAA 


TCAATANTTT 


TATTAATGAT 


CCCNAATCGA 


AGTCATGGAT 


ATTAAGTTGN 


360 


CAAAGTTCTN 


TNTTCCNATT 


GAGG 








384 
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(2) INFORMATION FOR SEQ ID NO: 669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:669: 

GAATTCCGTC TAAATAATAT TTAAGTGCNG TCAGTAAGAA ACTAAAAATT GCGATACTAA 60 

TCGCGATTAT GTCCATACTT ATCACCTCCT TAGGTTGATA ACAACATTAT ACACGAAAGG 120 

AGCATAAACA ATATGCAAGC ATTAAAAACA AAATCGAACA TCGGCGAAAT GTTCAACATA 180 

CAAGAAAAAG AGAATGGAGA NATCGCAATC AGCGGTCGAG AACTTCATCA AGCATTAGAA 240 

GTTNCAACTA GGTACGATAA GTGGTCCGAA AGAATGACGG AATATGGTTT GGAGANCGGG 300 

ATAGGATTTT ATTTCGCAAG TTGGANAANG TACACGGGGC AAANNAAGGG CGCGGTNCTT 360 

ATTGAACAAA GTTAATCAAT ATTGCTCAAC TCTCGG 396 

(2) INFORMATION FOR SEQ ID NO: 670: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670: 

TTCAATTGCA ATATGTTCAG CATGCGCCGT TGGTTGTTGT NGTGTTTCTC TTAAATTATG 60 

TGCTCTAGCG ATAACTTCAT CATCTTTAGT GATGATAGCA CCTATAGGTA CTTCGCCTAG 120 

TTGAGCTGCT TTTTTAGCTT CTTCAATCGC TAATGTCATA ANNTNTNTGT CATNTGTCAT 180 

TTATGTCCAG ATACCTCACT TATGGTACAA TACTCAATGT CTATTTTAAC ATTTGGAGAT 240 

GTGAATATGA ACAAACCTTT TATAGCAATT GAAGGTCCTA TTGGCGTAGG TAAATCTTCA 300 

CTTGCACACA AATTAAGTCA AACTTTAGAT TTTTATGAAG AAAAAGAAAT CATCACAGAA 360 

AATCCATTTT TATCAGACTT TTTATGAAGA TATCTCTAAA TGGAGTTTTT CAAACTTGAA 420 

ATGTTCTTTT TATGCAATAG ATAT 444 



(2) INFORMATION FOR SEQ ID NO: 671: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671: 



GAATTCCTAT 


CGATAGCGTT 


TACCGACACA 


ATCGTAACGA 


CAATGAAAAT 


CGTCATCGAC 


60 


TGTAACTACA 


TTATTGACAT 


AAATATCATC 


TGGTAACGTT 


CGGTTCATTG 


CATATTGCCA 


120 


TTGTGACATA 


GGTATATTCA 


GCTCTGTGTC 


GAAATGAAAG 


TACTGCTGTA 


TCGCATGTAC 


180 


ACCTCTATCA 


GTCCTACTTG 


AAGGATGGAT 


TCTCACATGT 


CTTTTATGCA 


TGCGTTGTAA 


240 


TAGCTTTTCA 


AATTGTTGCT 


GTACCGTACG 


TCCATTCTGT 


TGAATTNGAA 


AGCCTAGAAA 


300 


ATTATTTCCT 


NGATACGCAA 


TTTCTACTAA 


TATACGCATG 


AATTTACACT 


CCTGAATATT 


360 


TCAATACGAA 


TTAAGAATAA 


TTGCAATTGG 


GATAATCATG 


GGCTAAAGAT 


ATCCGTATCT 


420 


CTCAANTTGC 


CATTTAGCTG 


TCTTGTAGTC 


CGCCCTCTTA 


ACATTGGAAT 


CAATAACCCC 


480 


TAACTTCCAA 


TTGNGACCGC 


TAATTCTTCC 


GGTGCGTTGG 


AAAGCTGAGA 


TGATTAGGGG 


540 


GACTANGCAA 


TGGAT 










555 



(2) INFORMATION FOR SEQ ID NO: 672: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672: 

GAATTCAGTA ATTAAATAAA GAAAAATACA AACGTTTGAA GGAGTTAAAA ATGAGTCGTA 60 

AAGAATCCCG AGTGCAAGCT TTTCAAACTT TATTTCAATT AGAAATGAAG GACAGTGATT 120 

TAACGATAAA TGAAGCGATA AGTTTTATTA AAGACGATAA TCCAGATTTA GACTTTGAAT 180 

TTATTCATTG GCTAGTTTCT GGCGTTAAAG ATCACGAACC TGTATTAGAC GAGACAATTA 240 

GTCCTTATTT AAAAGATTGG ACTATTGGAC GTTTATTAAA AACGGGTCGT ATATTTTAAG 300 

AATTGGAACA ATATTGAAAT TATTACACAG GGGGATACAC CTGCTTAAGG CCGGGNAATG 360 

AATTGAAGGC AGGTGGAATT TAACAAAANC AATTCAAGTG ATGATGATCC ATTATTAAAT 420 

TTTA 424 
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(2) INFORMATION FOR SEQ ID NO: 673: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673: 



ACACATAAAG 


ACACCAAACT 


CTCATCAATA 


GTTCCGTAAA 


TTTTTATTGA 


CAGAACTTAT 


60 


TGACGGCAGT 


GGCAGGTGTC 


ATAAAAAAAA 


CCATGAGGGT 


AATAAATAAT 


GATGAAGCGC 


120 


AATATTCTGG 


CAGTGATCGT 


CCCTGCTCTG 


TTAGTAGCAG 


GTACTGCAAA 


CGCTGCAGAA 


180 


ATCTATAACA 


AAGATGGCAA 


CAAAGTAGAT 


CTGTACGGTA 


AAGCTGTTGG 


TCTGCATNAT 


240 


TTTTCCAAGG 


GTAACGGNGA 


AAACAGTTAC 


GGTGGCAATG 


GCGACATGAC 


CTATGCCCGT 


300 


CTTGGTTTTA AAGGGGAAAC 


TCAANTCAAT 


TCCGANCTTG 


ACCGGTTATG 


GTCAGTGGGA 


360 


ATATANCTTC 


CAGGGTAACA 


CTCTGANGGC 


GCTGACNCTC 


AACCTGGGT 




409 



(2) INFORMATION FOR SEQ ID NO: 674: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674: 



GAATTCCCGC 


GTGGTGATAT 


TTNTGTACGT 


GGTGCAATCA 


AAGAACATAT 


GGCGAGTTTA 


60 


GTCTATAAAC 


TAGAAGAAAT 


GGGCGTTGAA 


TTGGACTATC 


ACGAAGATGG 


TATTCGTGTA 


120 


CGTGCTGAAG 


GGGAATGACA 


ACCTGTAGAC 


ATCAAAACAC 


TACCACATCC 


TGGATTCCCG 


180 


ACTGATATGC 


AATCACAAAT 


GATGGCATAG 


TTATTAACGG 


CAAATGGTCA 


TAAAGTCGNA 


240 


ACCGAAACTG 


TNTTTGAAAA 


CCGTTTTATG 


CATGTTGCAG 


AGTTCAAACG 


TATGAATGCT 


300 


AATATCAATG 


TAGAAGGTCG 


TAGTGCTAAA 


CTTGAAGGTA 


AAAGTCAATT 


GCAAGGTGCA 


360 


CAAGTTNAAA 


GCGACTGATT 


TAAGAGCAGC 


AGCAGCCTTT 


AATTTTAGCT 


GGATNAGCTG 


420 


GTGATGGNAA 


AACAAGCCGT 


TACTTGAATT 


AACGCACCTA 


GATAGAGGCT 


ATTGTTGACT 


480 


T 












481 



(2) INFORMATION FOR SEQ ID NO: 675: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675: 



GAATTCCTAT 


GTATTTGTAT 


TGCCTATTGG 


GCTTAACAAT 


CAATTAGATT 


AGATCAATTT 


60 


TTTAAAAAAG 


GATACGCCAC 


TCAATTACAA 


GTGTGTATGA 


TATGTGTTGC 


TATTTTATTA 


120 


GGCACTGCAG 


TAAGCAATTT 


TATTGTAGAT 


GTGTTACAAT 


ACTCGACGCA 


GGTAAAATAT 


180 


TTAATAAAAT 


AAGTCTAACT 


CTATGATGTG 


TAATCAAAAC 


TAGATATAAT 


TAAATAATGA 


240 


CTTAAAATAA 


TTTTAAAATA 


GGGAAATGTA 


AAGTAATAGG 


AGNTCTAAGT 


GGAGGATTTA 


300 


CGATGGATAA 


AATAGTACTC 


AAAGGTGGAA 


ATAAATTAAC 


GGGTGAAGTT 


AAAGTAGAAG 


360 


GTGCTAAAAA 


TGCAAGTATT 


ACCAATATTG 


ACAAGCATCT 


TTTATTAGCT 


TCTGATAACC 


420 


CGAGTAAATT 


AGGTTAATGC 


TCCAGCTTTA 


AGTGATGTAG 


AACAATAAAT 


AAT 


473 



(2) INFORMATION FOR SEQ ID NO: 676: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676: 

GAATTCCTAG TCTATATCTT TTTTCGTTTG ATTGAGTCTC TTNTGTAAAT TCAATAAATC 60 

ATCCGCTTTA CCATNCAATG CCGATTTAAC TAAATTAAAA ATTTTATTGT GATCTGTTTC 120 

TATTTTAGTA ATTAAATCTG CCAGTAATTG TGCCNTGTGT CTTTCTATAC GTGTTGCTAA 180 

AATCGTNTCA ATAGCTTGCT TNTTTATCTT TGGCATTATN CAAAATTGCT TNTAATATAT 240 

CATCTGAAGA CGTGTCGCCA G 261 

(2) INFORMATION FOR SEQ ID NO: 677: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 677: 



TTTTCGAACT 


AACATTCAAT 


TAATGGATTA 


CCTGCGATAA 


AAATGCCCTC 


GTACGCTCTG 


60 


ATTTAGGATG 


CGCAAAAAAT 


TCATCAGGTG 


CAGCTTGCTC 


CACTATTTCC 


CCACGATCCA 


120 


TAAAAATTAC 


CCGGTCAGCG 


ACGGTTCGTG 


CAAACCCCAT 


CTCATGTGTT 


ACACACAACA 


180 


TTGTCATACC 


CGACTGCGCC 


AGCCCAATCA 


TCGTATCCAG 


CACCTCTTTC 


ACCCATCTCA 


240 


GGNATCGGAG 


CGCCGACGGT 


TGGGCTCAAT 


CAAAACAAAC 


AATAATTTNC 


NGGCTTCCAT 


300 


ACACA 












305 



(2) INFORMATION FOR SEQ ID NO: 678: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678: 



GAATTCAGAT 


ATACATCGAT 


CTCGTCTATT 


AAATTCAAAA 


TACGATGAGA 


TGGGTGGAGA 


60 


TGTAATGAGA 


GATTATTATN 


CACTAATTTT 


CTTAGAAAAA 


TNNCAATGAG 


GGGTATTTAT 


120 


GATTANTGAN 


GCTTCACTAG 


CGATATTAGA 


TGATATTGAT 


GANCTAGCTG 


ATATGATAGT 


180 


TGCATCAGAT 


ATNTATGCAT 


CATTTGAGCA 


AGCTAAACAG 


GCGCTCGANA 


ACAATGANGA 


240 


AGCACATCTA 


CTTTATCAAT 


CANCTCNACA 


ATCAAAAGAG 


AAATNTGATG 


AGGTAGTGCG 


300 


TTTCGGGAAA 


TATCATCCNG 


GATTATGAGA 


AAGGTCATGT 


TAGGAGACAN 


AGACAACGCC 


360 


ATTNGAGCAT 


CTGAA 










375 



(2) INFORMATION FOR SEQ ID NO: 679: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679: 

GAATTCATTA TATATTTATA TACGTAAGAC TTTAGTGAAT ATATCTATAA TTATTTACTT 60 

GGTAAGCTGG TACCGATCTG TAAGTTAAAA TACCTGGTGC AGNTGAATAG NTCANTTCTG 120 

AAACTAAAAT NCTACCATCG NTATTTACAC GTTCTACAAA CATAACATGA CCATAATAAC 180 

CTACATCTGT TTGAGCGATA GAACCTACAG NAGGTCTATT GTCGATAGTG TCAACCATCT 240 

(2) INFORMATION FOR SEQ ID NO: 680: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680: 



GAATTCCTAT 


GATCCAGAGT 


GTTTCATCGA 


TTTAATCGGT 


TGAGGATTCC 


ATTCGTATCC 


60 


TTTATCTTTA 


CCAATTGTTA 


GACCACCAGT 


AGAGTGATCT 


GCTGTTGCAA 


CGACAAGTGT 


120 


ATCTTTATGT 


TTCTTAGCAT 


ATTGAATGGC 


GTCATCGAAT 


GCTTTTTCAA 


AACCTTCCAT 


180 


TTCAGACATA 


ACGCCTGTAA 


TATCGTTGGA 


ATGCGCTGAT 


TTATCTATAG 


AAGCACCTTC 


240 


GACCATTAAA 


AAGAAACTTT 


TTTATTACGC 


TCAAGCTTAC 


TAAGTGCACT 


TTGTTGCATA 


300 


TCAGCTAATG 


ATGGTTCCGT 


CTTTTAGAAG 


CATCTTTTGC 


AAGTGGCATA 


TTTTTTATCT 


360 


TGCAAACAAA 


CCAAGACTTT 


ATCTTTTATC 


AAGATTTTGA 


TGAATCCTTA 


TTGTCCNGGG 


420 


GAAGGTCNGG 


NACCATCTTN 


NNNGATG 








447 



(2) INFORMATION FOR SEQ ID NO:681: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681: 



GAATTCCCTG CAACACAGGA CGTTTCTCAG CGTATAATTT ATATTTATTT ATCTGTAAAC 
GTATGTTTAT TTACATCTAC TATTTTCAAG GTTGNTATTT NGTCTGGATA TTCAGGAAAA 
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TACCATTTAA AATATTCTGT GCCAGTAACA AATGTCATTG TTCCATCACC TTCATTATCT _ 180 

GAGTATAGTA TGTTTATTTT ATTAACTAAC ATAGAATCTA ATTTATTCAT TATTTTCAAA 240 

ACATCTTCAC TAGATAATAG TGCTCTTTGT ATATCTTTAC TCATCTTTCT ATTATTTATT 300 

TNNTCTTNAN CATAATGTTT CATTAATTGA CTTCAATAAA CTCATTTAAA AAGTAAATTT 360 

CATTAAAAGT ACCACCTACT AATGCAATTC ATTTNGTCAG ATGATTGTAA TACAATNGNG 420 

TGGGTTCAAG GA 432 

(2) INFORMATION FOR SEQ ID NO: 682: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 453 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682: 



GAATTCCGCT 


TTTAAAAACA 


AAATCCCTTC 


AACTGAAATC 


GTAGCGGAAA 


CAATTTATCA 


60 


AATTGTTAAA 


GAAAATTTGG 


CATCGCTCGA 


ACACCAACCA 


AAATGTATTC 


AAGTATTTGT 


120 


AAGAGAAACA 


CCAACAAGTT 


ATGTTGTATT 


TAGACCAAAG 


GAACAGGTGT 


AACTAATGGC 


180 


TAAAATTCCT 


GTACTCGAAA 


TATTCGGTCC 


AACCATTCAG 


GGTGAAGGTC 


GAGTAATCGG 


240 


TAGAAAAACG 


ATGTTTGTGA 


GAACAGCTGG 


ATGTGATTAT 


CGCTGTAGCT 


GGTGTGATTC 


300 


TGCATTTACT 


TGGGATGGTA 


GTGCTAAAGG 


CGATATTAAA 


ACTCATGACC 


GCTGAAGAAA 


360 


TTTATTGATG 


AATTAAAACG 


AATTGGGTGG 


CGATTTTATT 


TAATCACGTT 


ACAATTCTGG 


420 


TGGGGTANCC 


CAGCATTAAT 


TAAGGGTNTC 


CAG 






453 



(2) INFORMATION FOR SEQ ID NO: 683: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 : 

GAATTCCCAG AAACTAGATT TTNTCATAAT CGACCTATCT TTTGTCCAGA TACAGTGAGA 60 
CCCTTGTCAT TTAAATGATT TTTAATTCGT CTTGTACTAA AGACTTTTCT ATTAGAATTA 120 
AAAATATTTA TCGCGGCACG TTCTACGTGT GAATCATCTT TAGTGATTTT ATTATCTTNT 180 

657 



WO 97/30070 



PCT/US97/02318 



CTTTTTATAG AATCATAATA GGTACTTCTT GGTATTTTTA GGACTTTACA CATTGCTGGT 240 
ACTGAATATT GATGTGCATT CTTGTGAATG ACTTCTTATT TGNGCCCATA ATCAGCTCT 299 

(2) INFORMATION FOR SEQ ID NO: 684: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684: 

GAATTCCTNA CCATCCATAG CTGTGNACAC AACATGATCT TTATCAATCA TCATAATGTT 60 

GCAAAGGGAT TTCTTTNATC AAACTCAGGT GCATATGCTC GNACTTCTTC CATAGTTANG 120 

GGCAATTTCG TTGGGGTTAA AGGGTGTGTT GGATGCCCCT CTGGNNCCAA GCTCTCAGAA 180 

TAAGTTAACT CATCTGTCTT ATCACTATCT TTTACATGTT GTANCCATGA AAAAAAGTTA 240 

ATATCTTGTG GTAATCTNGA AAATTTAAAA CTGTGACTTA TTAATGTGTG TCTGTGAGAC 300 

ATGTGNTTAT ATGTCACATC TAAACTATCT CGACTGTGTA TTANCTCCTT CGNNGTAGTC 360 

GCTCTGANNA NTAGGGATAT CAANGGTCAT GACTCAACAC ATCTAACAAC NCCTTCTTAA 420 

NGGTTATTA 429 

(2) INFORMATION FOR SEQ ID NO: 685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685: 

GAATTCCTAC AAGCGTCGTT GTAAATGGCG AATTGTANTA TCTCTATGTG CAATGACAAT 60 

CATCGTTGTA TCTTTAAAAT GTTGAAACAG AACGTTCTGA ATCATGTGTT CTGTTTGAAT 120 

ATCTAAACCA GTTGCAGGCT CATCAAATAT AACGAGATCT GGCTTCATAA CTAAAAGACG 180 

GCAAAGTTCT ATGCGTCTCA TCTGTCCACC AGATAACATT TCGCCTCCTT CACCTATTAT 240 

TGTGTTAATG CCTTTTGTGA AAGATTGCAC TTTGTCTAAT AAACCTACTT CGNCTAGCAC 300 

TTCTTCAATA GNATTATTTT CTATATCTTN AAACATCGNN ATATTCTCTT TNATAGAAGC 360 

ACTGAAAATA TATGGCTGGN GACTAAGTAT TCCCATATTT NAANTCACGC TGG 413 
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(2) INFORMATION FOR SEQ ID NO: 686: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686: 



GAATTCCAAG 


AAACGAATTT 


GCATAAAAGA 


CTAGCTCTTT 


TAGTGTTATC 


AATTGTTGGT 


60 


AATAAAACTA GAAATATAGT 


TATTGGAGCA 


ATTATCGTTT 


CAATTGTACT 


TGCATTTNTC 


120 


GNTCCTTCTG 


CAACAGCTAG 


AGCAGGGGCA 


GNTGTNCCAA 


TCTTGCTGGG 


TATGATTGCG 


180 


GCATTTAAAG 


TTTCCAAAGA 


TAGCAAGTTA 


GCGTCTTTAT 


TAATAATTAC 


TTCAGNACAA 


240 


GCTGTGNCAA 


TTTGGAATAT NGGTATCAAA 


ACGGCGGCAG 


CACAAAATAT 


CGNAGCGAAT 


300 


TAANTTGATT 


AACCCATCAA 


NNAGGATTNG 


NTGGTTCCAA 


NGGGGCGAAT 


TGGTTCTTAT 


360 


NT 












362 



{2) INFORMATION FOR SEQ ID NO: 687: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 : 



GAATTCCTGG TAATACTAGT TGAAAAGGAG TGTAGTCATC ATGTCAGAAA CAATTTTCGG 60 

CAAAATTTTA ACTGGAGAAA TTCCTAGCTT TAAAGTATAT GAAGACGATT ATGTCTATGC 120 

CTTTTTAGAT ATATCACAAG TTACTAAAGG ACATACGTTA TTAATTCCTA AAAAAGCTTC 180 

TGCTAATATC TTTGAAACTG ATGAAGAAAC AATGAAACAT ATCGGTGCAG CATTACCTAA 240 

AGTAGCAAAT GCTATTAAGC GTGCATTTAN TCCTGANGGT TTAAACATTA TTCAAAATAA 300 

TGGTGAGTTT GGAGATCAAT CTTGTATTTC ATATTCCAAN NCCACTTAAT TCCNCGATAC 360 

GAAAATGATA TTTGATNGGT TTGGGTTATT AGNCGGGAAA CCACACCGAN NGCCATT 417 

(2) INFORMATION FOR SEQ ID NO: 688 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688: 



GAATTCAAAA 


AAGGTTAAGT 


ACAATTACTG 


ATTTCGTACT 


TAACCTTTTT 


TAAACTCTAA 


60 


TCATATGTTA 


GTTATTTCAT 


TCTTCGTAAT 


AATATTAAGA 


AGTATGGTGC 


ACCGATAATT 


120 


GCAATGATAA 


CCCCTACAGG 


AATATCCAGT 


GGCGGATGAA 


TGCCACGGGC 


TAAACCATCT 


180 


CCAAATGTTA ACAATATAGC ACCAATTAAC 


CCCGACATGA 


TAATAACGTG 


TAATGTTTTA 


240 


TTTCCTATTA 


ATTGGCTCGC 


AATATGAGGT 


GCAATTAATC 


CTAAAAAGCT 


AATACCACCG 


300 


ACAACTGAAA 


TTGCGGATCC 


CGCTAATATT 


ACTGCTAAAA 


TTAGCATAGC 


ATTTTAATTA 


360 


G 












361 



(2) INFORMATION FOR SEQ ID NO: 689 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689: 



TAGAATAGAA 


GCGTGTGAAA 


ATATAGAATT 


AAATATAATT 


TGAGGTGGAA 


AAATGATACT 


60 


AGTAATGTTA 


TCTCCATTAT 


TAATCATATT 


CTTTATAGTG 


TTGTCTATTT 


TAGAGGAGCG 


120 


TAAACGTACG 


AAGAAAAAGC 


AACTCGAGAA 


AGAAAAAGCA 


AATACACTAA 


ATCAAAATAC 


180 


AAATGACACG 


GAAAGTTCAA ATCAAGAGCC 


GTCATTGCAG 


CAGACTAAAG 


AACAAAAAGA 


240 


TAACAAAGGA 


TAATNCAATT 


GAAGGAAGAA 


GATTATAGAT 


GAAAATATTA 


ATTGTTGAAG 


300 


ATGATTTTGG 


TTATAGCAGA 


AAGTTTAGCA 


TCTTGAACTT AAAAAATGGA 


ATTACGGNGT 


360 


TATTGTCGGT 


GGAACAANTT 


GGATGATATT 


ACTTGTCTTA 


TCTTTTAACC 


CAACATCAAC 


420 


CCCAAGCTTG 


GTNTTGGCTA 


GATATTTAAT 


TTGGGCAACG 


NTAAATGGGT 


TTCCACTGGG 


480 


GGTCAAGGAA 


ATCCCGAAAA 


CATCTTAT 








508 



(2) INFORMATION FOR SEQ ID NO: 690: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 562 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690: 



ATCCCTAGGA 


TATACGAATG 


TNCCTGTTAA 


ATTAGATAAA 


GAAGTTGAAG 


GTACAATTCG 


60 


CGTACACACA 


GTTGANCAAT 


AAAGTTGGAT 


TGAAATAAGA 


GGTGTAACCA TTCATGGATA 


120 


GAATGTATGA 


GCAAAATCAA 


NTGCCACATA 


ACAATGAAGC 


TGAACAGTCT 


GTCTTAGGTT 


180 


CANTTATTAT 


AGATCCAGAA 


TTGATTAATA 


CTACTCAGGA 


AGTTTNGCTT 


CCTGAGTCGT 


240 


TTTATAGGGG 


TGCCCATCAA 


CATATTTTCC 


GTGCAATGAT 


GCACTTAAAT 


GAAGATAATA 


300 


AAGAAATTGA 


TGTTGTAACA 


TTGATGGATC 


AATTATCGAC 


GGAAGGTACG 


TTGAATGAAG 


360 


CGGGTGGCCC 


GCAATATCTT 


GCAGAGTTAT 


CTACAAATGT 


NCCAACGACG 


CGAAATGTCC 


420 


AGTATTATAC 


TGATATCGNC 


TCTTANGCAT 


GCATTAAANA 


CGTAGGATTG 


ATTCAAACTG 


480 


CAGATAGTAT 


TGCCAATGAT 


GGATATANTG 


ATGAACTNGG 


ACTAGATGCG 


ATTTAAGNGA 


540 


TGCAGAACGT 


CGNATTTTAG 


AG 








562 



(2) INFORMATION FOR SEQ ID NO: 691: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 : 



CATAAGATAT 


TCTCACAAGT 


ATTCTTTATT 


GTNAAATTTG 


GTATATCAAT 


GAGACTCATC 


60 


ATATCCACCT 


CGTNGTTTTA 


GCAATCTATC 


TCTTAGCGCA 


TCTCCGGTTA 


AATTAAAAAT 


120 


TAAAATAGTT 


ATAGCAATGA 


CTGAAGCAGG 


TGCAATCAAC 


ATAATTGGAT 


GAGACGAAAT 


180 


AAAATCACGA 


CCTTGTTGNA 


ACATAGCGCC 


CCACTCTGGT 


GTTGGCGGTT 


GTGCACCTAA 


240 


CCCAATAAAT 


GATAGTGAAC 


TTATATATAA 


AATGATTTTA 


CCGAAATCAA 


CGACCATCAA 


300 


AACGATAATA 


GCCGGTATAA 


TTTTAGGTGT 


TAAATGACGT 


ATTAATATTG 


TNCTTGTAGG 


360 


TACATGAAAT 


AATTGTGCCA 


TTTTTATATA 


AGGCTTATTC 


ATTTCGCTAT 


TAACTATACT 


420 


TCTTAGTCAA 


CCTTGTGTAA 


TTCATCCCAT 


TTTATTTACT 


TGTAATTGGA 


GATAACTAAA 


480 


TTNCCACAAA 


AGATGGTTGA 


AAAAAGCTTG 


CTTAAAAGCA ATCAATGATG 


ATANTTCTTG 


540 


GAATACTTAG 


GACCNACATC 


AATAANCCTA 


ACAATTATCG 


GTCAATCCAC 


CCTTTT 


596 
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(2) INFORMATION FOR SEQ ID NO: 692: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692: 



GAATTCCCGG 


TAAAGAACTT 


TGTATAAGAC 


AGAATGATAT 


TATTAAGAAT 


AAGTGTATTT 


60 


ATAATGAAAA 


TGTGGTAACT 


TAATATAAGT 


AGAGGTGANA 


TAAATGGGTT 


TGAAANTAAA 


120 


AGAGTATATA 


AATGATAGTA 


ATAGGAAATT 


GGATAAAGAA 


ACAGTTAGAA 


AGCACTTAGT 


180 


AGATATACTA 


AATTACACTG 


TNGAAGAAAA 


TAGTTTNCTA 


GATCAAATGA 


CAGNTGGCAA 


240 


AAGACTATAT 


TCAAATCAAT 


ATAAAAACTC 


GCCCTCTNTA 


TCATTAACTG 


NAAANCATAA 


300 


AATCAAAATA 


AATGAAGGCG 


ACGAGNGATT 


TTAGCTATAA 


ATTTTCTGAT 


AATGAAACTC 


360 


AAACTTTTAA 


CCTNTCAAAN 


GAGTTAGGAG 


TAGCNGGCGN 


NATGGNTAGN 


ATNQGGNCCC 


420 


AAAACT 












426 



(2) INFORMATION FOR SEQ ID NO: 693 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693: 



AATAAGGAAT 


TTTAAATTGT 


GACTTTTNCG 


GAATATTAAA 


TTTTAAAAAT 


ATGAGGTTTG 


60 


TGAAGCGGAT 


TTCTCACAAA 


ATTTTAAAAA 


ATTTAAGCCT 


GAAAATGATA 


AAGCGGTAGG 


120 


GAACGTTTCT 


CTGAAAGTTA 


GTGATACAAT 


AGTTTTAAGT 


TGAAATACAG 


GAGGATGAAT 


180 


AACATGAATC 


AGTCAGTCAA 


NTTACTTAAA 


CATTTAACAG 


ATGTAAACGG 


CATTGCTGGT 


240 


TATGAAATGC 


AAGTTAANGA 


AGCAATGCGT 


NACTATATAG 


AGCTGTCAGG 


ANCAAATATT 


300 


GGAGATAACT 


TGGGTGGGAA 


TTTGGGGNAA 


GAAAAATGCT 


GAGAATGG 




348 



(2) INFORMATION FOR SEQ ID NO: 694 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 467 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694: 



GAATTCCGAC 


GTTAGATCAA 


TTGAAATCCA 


AAGATCGAAA 


CATTGGGACA 


TATTTATTAA 


60 


CGACAATGGC 


TTCAACAGGA 


TTTGGTTCAT 


TCGGAAAATA 


TTTATTGGGC 


ATCATTGTGG 


120 


CGCTGGCATG 


TCTAACTACA 


GCATGCGGGC 


TTATTGTTGC 


AGTTTCTGAA 


TATTTCCATA 


180 


GAATCGTACC 


TAAAGTATCA 


TACAAAGCAT 


TTGTATTAGT 


TTTCATTTTA 


ATGAGTTTTA 


240 


TTATTGCTAA 


CCAAGGTTTA 


AATGCTGTTA 


TCTCAATGTC 


AATTCCGGTA 


TTAAGCATTG 


300 


TATACCCAGT 


AGCAATAACT 


GTTGTATTAT 


TAATTTNAAT 


TGNCAAATTC 


ATACCGACAA 


360 


AACGCATTTC 


ACAAACAAAT 


TCCAGTTATT 


ATCGTATTTT 


ATATTGGTCG 


GATTTTCAAG 


420 


TGGTTATTAG 


GTAAGTTAGG 


GTTGGGCTTG 


AAAAATTAAC 


TTTTATT 




467 



(2) INFORMATION FOR SEQ ID NO: 695: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695: 



GGGGGGGCTG 


TTATTAAGCC 


TANGNQGGNA 


CCAGCAATGC 


CANGGCGGNN 


GCATTANCAA 


60 


ANACGANCTT 


ACCGCAAAAN 


CCGNNATGGG 


NGGGCGTTCC 


AAAATGTCGC 


TAAATTAATG 


120 


ACAAGCTGCC 


GGTAGTAGTA 


TTTTTGAAAA 


ANTTACCTAT 


GATTTTCGCA 


TTAGGTGTCN 


180 


CAATCGGATT 


AGCTGGCGGT 


GATGGCGTAG 


CAGCTATCGC 


AGCATTCGTC 


GGTTACATAA 


240 


TCATGAACAA 


AACAATGGGC 


GACTTTTTAC 


AAGTTACACC 


TAAAAATGTT 


ANTGAACCAA 


300 


CGAGTGGTNA 


CCNCTAGCAA 


TTNAGGTAAA 


ACCAACATNA 


CAANNAGGTG 


TGTTCGGCGG 


360 


TATTATTATC 


GGGGCCCTGG 


CAGCTTGTGT 


TATAACAAGT 


TCNATAACAT 


TAACTTACCA 


420 


TCTTATTTAG 


GTTTCTTCGC 


TGGTAAGCGT 


TTCGTACCTA 


TTATGATGGC 


TACAACATCA 


480 


TTTATTTTAG 


CATTTCCAAT 


GGCATTAATT 


TGGCCAACGA 


TTCAATCAGG 


ATTAAATGCA 


540 


TTCAGTACAG 


GATTATTAGA 


TTCAAATACT 


GGTGTTGCCG 


TATTCTTATT 


TGGTTTCATC 


600 


AAGCGTTTAT 


TAATTCCATT 


CGGTCTACAT 


CACATTTTCC 


ACGCACCGTT 


CTGGTTCGAG 


660 


TTTGGTTCAT 


GGAAAAATGC 


AGCTGGTGAA 


ATTATTCACG 


GTGACCAACG 


TATCTTTATC 


720 


GAACAAATTC 


GTGAAGGCGC 


ACATTTGACA 


GCTGGTAAAT 


TCATGCAAGG 


T 


771 
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(2) INFORMATION FOR SEQ ID NO: 696: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 : 



CTTGCGCTTG 


TCGACGTTTA 


CTTTCATGAC 


TAAGCGAATG 


AATTTGCCTG 


AAGTTGACCT 


60 


TGAAAAAGTA 


ATAGCACCAT 


GGAAAGAAAT 


ACTAGCGACT 


AATAGAGAGA 


TACCAAATTT 


120 


AGATAATCAA 


ATGTGTATTG 


GTGGTTTAGA 


CTTTGCAAAC 


ATTCGAGATT 


TTGNAAGTGT 


180 


AGGGCTATTA 


TTCCGAAAAA ACGATGATTA 


CATTTGGTTA 


GGACATTCGN 


TTGTAAGACA 


240 


AGGGTTTTTG 


GATGATGTCA 


AATTAGAACC 


TCCTATTAAA 


GAATGGGGAA 


AAATGGGGTT 


300 


ATTGACCATN 


GTCGATGATG 


ATGTCATTGA 


AATTGAATAT 


ATAGTTGGAN 


TGGGTTTTTN 


360 


ANACCAAAAG 


GCTAGAGAAA 


AATATGGGAT 


TGAAAAAGTC 


ATAGGCTGAT 


AATTATAGAA 


420 


CTGATATTGT 


AAGACGTGCG 


TTTAGGGATG 


CTGGCATAAA 


ACTTGGAGTA 


CTTAGAAATC 


480 


CCA 












483 



(2) INFORMATION FOR SEQ ID NO: 697: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 : 



CAGTTAAAGT 


GTCTGTATCA ACTAACTACA ATACAATTGT 


TGAAGCTATG 


AAGTCTAAAA 


60 


AAGTTGATGT 


TQGTTTCTTA 


CCACCAACGG 


CATACACATT 


AGCACATGAT 


CAAAAAGCAG 


120 


CTGATTTATT 


ATTACAAGCA 


CAACGTTTTG 


GTGTAAAAGA 


AGATGGTTCA 


GCAAGTAAAG 


180 


AACTTGTAGA 


TAGTTATAAA 


TCAGAAATTC 


TTGTTAAAAA 


AGACTCAAAA 


ATTAAAAGCT 


240 


TGAAAGATTT 


AAAAGGTAAG 


AAAATTGCCT 


TACAAGATGT 


AACATCAACT 


GCTGGATATA 


300 


CATTCCCACT 


TGCGATGTTA 


AAAAACGAAG 


CAGGTATTAA 


TGCAACTAAA 


GATATGAAAA 


360 


TTGTGAATTG 


TTAAAGGTCA 


TGACCAAGCA GGTATCTCAT 


TATTAAATGG 


AGATGTAGAT 


420 


GCTGCGGCTT 


GTATTTAACG 


ATGCACGTAA 


TACTGTGAAA 


AAAGACCAAC 


CAAATGTATT 


480 
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TA A Af?Ar*Ar*A 


Pfl A A TTTTA A 
uunnl 1 X inn 


ATTAACACAA 

ill innwu«<m 


GCTATCCCGA ATGACACAAT TTCTGTAAGA_ 


540 


Of A/TA T 2k Tfifi 
V- V_AVjA 1 A 1 vjVj 


ATA A AfiATTT 
AlAnnuni J. 1 


PPAAftAAAAA 
\- i^aavaaaaa 


TTGAAAAAAG 


CTTTTATAGA 


CATTGCTAAA 


600 




f2TP a p a a a a t 


TATTAfWIAA 
1A1 XAO^uAA 


GTTTATTCAC 


ATGAAGGATA 


CACAGAAACG 


660 


AAAfSATTPAA 
AAAuAl 1\,AA 


ATTTPflAPAT 


T/7TAAGAGAf5 

X \J x nAUAunu 


TACGAAAAAT 


TAGTTAAAGA 


TATGAAATAA 


720 


TPATTATPfiA 
Iwil lnlA.UA 


ATTTf2f2TATT 
ax x iuuini x 


AAAAC^TTTC 


GTCAATAGAT 


ATATTCTAGA 


TAAATATTGG 


780 


A A A WfiPT AP^5 


\*vjv* iAnwVf x u 


AAAP AGAT A T 


AGAAAGGT6T 


CGCTGTACAT 


TTGAAACCAT 


840 


1 l\j 1 Al-AJL. nu 


A A A PPP A A TT2 
AnnLLLAn 1VJ 


TPTA Td AT AT 
X X AXUAl AX 


TTCAGTTTAC 


CTTGGCTTTT 


CTTTATTAAA 


900 


/2» 2k AOf^TVSTP 


nnAV^luiAul 


PAAATPfiAAT 


TTAAAAACGT 


CAGTAAAGTC 


TATCCTAACG 




GNCATfcTl ALHj 


LI IvxAAAAAI 


Ai lAiVL. X X AA 


ATA1 ivAAAA 


AGGTGAATTT 


GCAGTTATTG 




TCGGACTATC 


TGGTGCTGGG 


AAATCCACGT 


TATTAAGATC 


TTGTAAATCG 


TNTGCATGAT 


1080 


ATCACGTCAG 


GTGAAATTTT 


CATCCAAGGC 


AAATCCATCA 


CTAAAGCCCA 


TGGTAAAGCA 


1140 


TTATTAGAAA 


TGCGCCGAAA 


TATAGGTATN 


GATTTTCCAA 


CANNTNAATT 


TAGTTAAACG 


1200 


GNCAAGTGTA 


TTTACGAAAT 


GTNCTAAGTG 


GACGTGTAGG 


TTTATCANCC 


CACCTGGGAA 


1260 


AATGGTATTA 


GGGTTTANTC 


CC 








1282 



(2) INFORMATION FOR SEQ ID NO: 698: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698: 



CGTCTTGCCC 


GATCGAAACC 


ATGCTTCCAA 


AACTTGATTT 


CCCATTTGTC 


ATCCTAAAGG 


60 


TGCATATATG 


TATGGTTGGA AACCCCCGCC 


TGGTTATGAC 


AGCGACCAAT 


TTGACCAATT 


120 


TTTAGTGCAA 


GAGAAGTCAA 


TACTTGTTGC 


TCCAGGGAAA 


CCATTTGGAG 


AGAATGGTAA 


180 


TCGTTATGTG 


AGAATTTCAT 


TGGCGTTAGA 


TGACCAGAAA 


TTAGACGAAG 


CTGCAATAAG 


240 


ATTAACAGAA 


CTAGCATATT 


TATATGAATA 


ATAAAGTTAA 


GGGTGATGTT 


AAAATGACAA 


300 


AAATTAAAAT 


CATGAGTGTA 


CGTGATGAAG 


ATATGCCTTA 


TATCAAGGCG 


TGGGCAGAGA 


360 


AACATCATGT 


TGAGGTAGAT 


ATTACTAAAG 


AAGCATTAAC 


TGATGACAAT 


GTTGAAGGTG 


420 


TCGCTGGTTA 


TGATGGCTTA 


TCGCTATCAC 


AACAAATACC 


ATTATCAGAG 


CATGTTTATA 


480 


AACGATTAAA 


TGAACTAGGT 


ATTAAACAAA 


TCGCACAACG 


TAGTGCTGGA 


TTTGATACAT 


540 


ATGATTTGGA 


GTTAGCAAAC 


AAGTATAACT 


TGATTGTGTC 


AAATGTGCCG 


TCATACTCTC 


600 


CGANNCC 












607 



(2) INFORMATION FOR SEQ ID NO: 699: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 969 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699: 



GGCACGAGCG 


CTCACGCACC 


TCATCTTCAT 


TTTGAGCACC 


TATATCTAAT 


GTTAAATCTT 


60 


TAATTTCCGG 


TGCACCTTCA 


CTACCAGTAC 


GAAAATGTTT 


AGGTATATTA 


GAAACAACAC 


120 


CGATAATTTT 


ATCGCCATTT 


CTATTTTTAA 


TTACTAAGCG 


TTGTCCTTGC 


CAAATATCAT 


180 


TTGCAACACC 


ACCTAAATTT 


GTGAATTGAA 


TCATTCCATT 


TTTAGTGATA 


TTTGTAATCA 


240 


TAAATCCGAT 


TTCATCCATA 


TGTGCTGCAA 


TCATTACACG 


TTTTGCATTT 


GGATTTTTAG 


300 


ATTTTTTCAC 


ACCAAAAAAT 


CCACCCATAC 


GATTTTCAAT 


AAATTCATCT 


ACATACGGGG 


360 


GCAATTGGTG 


AGTCATATTA 


ATTTNNTACT 


TCTTCTTCCA AAACCTGGGT 


GNCCCATGAA 


420 


GGTCAAGTTA 


AAGTTTGGAT 


TCCGTTGTAA 


TGTTACTTTT 


TTATTTATGT 


TCATAAAAGT 


480 


TCTCACTCCT 


TATAAGTTAT 


TATATCATCT 


TCAATATGGT 


AAACTAATGA 


TATGAATTTG 


540 


AAATCAAGGG 


AGCATAAATG 


ATGACTAAAC 


TGAAATATAT 


AATTCCAACA 


ATAATTGCAG 


600 


v TAGCCATTGT 


AATTATTTCT 


ACCATTTCAA 


TCATCCAATT 


TATTAATCGT 


AAACGCTATA 


660 


ATCCCGTTAA 


AGTACTTAAT 


GAAGTAAAAT 


CATATTTTAT 


GAATGTCAAA 


GGCTCATATA 


720 


TCGTTTATGA 


ACCATTCGTT 


CATCCTGAAA 


CTGATAAATA 


CCGNTTAGTT 


TATCAAGGTG 


780 


GGAATTACAA 


CTATTAAAAA 


TGGGCAAAAA 


TATTCAATTA 


TGATTTNNAT 


GCAGATGCAT 


840 


ATTCTGGGGG 


AAGGTCATTA 


CATNGCNGGA 


GCGGTTAATA 


CGAATTTNNA 


TCCCATATTN 


900 


GGACTGGGGC 


CCGGNAAANC 


TTNAAAATGG 


AGNCACACCC 


CAATTAAGNA 


AGATGGGGTG 


960 


GNGGNTCAA 












969 



(2) INFORMATION FOR SEQ ID NO: 700: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:700: 



AATGACAAAA TATTAATCAA TTGGCGTATT ATACAAGAGT AATAGACATA TAAAAGTCGA 60 

ATGTAACTAC GTGAGTACTG ATTTTATTCT TTGTAAATTA CAAGCATTTC ATATCATAAA 120 

GTTTGAAAAG AGGTATATTG AAGTGGAGAA AAACGAATAT ACAGCTAAAT ATAATGAATA 180 

TAGTCAATTA TTAGACGCTA CTTACTCGCA AGCTGTAGCA TACCTTTTAA ATAAATATGG 240 

CGCTGTAACC GATGATTATT ATAAGGAAAA ATCATACACG CGATTTTTAA ATGGAGAAAT 300 
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CAAAAGTATT TCAAAAGGAA AATACAL. 1 AL» 




C!f2 A T"P AT & IT* 
wnl 1A1A1 1 


(TPTATTAPAT 


J O VJ 


AAGCGAAGAC AAATTCCAAA ACTTATt. 1 i\* 










CATACAACGT CAAAAGAAAG AAAACTTAGT 


tj X Av- 1 o 1 AiAl 




* ■fl' 11111% X|>1H|»IU| • 

All Irvul 1111 


HOU 


ACATGCAATT ATTACAAAAG AATCCAATGG 


CAATTGOGGT 


GTGGGCTGGA 


1 1 A 1\j IvAAA 




TGATCAAACC AACAGTCATT GATTGGGACA 


1 lAbLuAAln 




rr* a fsr* a tck a 


wUU 


TGCAAGCCAC CAAAtJCACvjl UCL. 1A1\j 


f"P Af2AATATT 


AfTPAnAfJAAA 


C* T* A fTY"* A TT* A 

V— 1 .TLV* 1 1 X A 


660 


NNATTGACGA TATGTTAGAA GGAATAGAAA 


TATATGATNT 


CCNTGAGTCT 


AGATAAATGA 


720 


TTAANGGAGA TATATTTCGT GNTCGNTGAG 


ATTGTGNTAT 


AGATAGNNTA 


ACCCNNTAAT 


780 


TAAATATTAT TCTATNAAAT AGACCACGCA 


TNCCTATCTA 


TAAACGGACA 


ACGTTTATAA 


840 


ATGAGTTTGC ATGGTCTTGA ATTGTATTAA 


ATNTCTTNTT 


GGGTTTTAAT 


AGNTCACTTA 


900 


GNGGGTATGT CAACATTCTT TNGACAGCTT 


CACGTGTATT 


TTT 




943 


(2) INFORMATION FOR SEQ 


ID NO:701: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701: 



CANTTGTGAA 


NTTGGCGTCC 


TTTTTGGANA 


CCCGATATAG 


CTTTATCTTG 


CCACCGNAGT 


60 


TTCATCTTGA 


CGTCCCAACC 


NACCTCCGGA 


CCAACATGGT 


AAATATGCTA 


TTTCATTATC 


120 


AGCCACCGGA 


TTTCCGAAAT 


ATTCAAGTTA 


NCGGATTGAC 


ATGATTGTTT 


CACCATATGA 


180 


AATAAGGATT 


GATAATTTCC 


TTGTAAAGGA 


TTTATTAATT 


CATCACGGAT 


CCGTTAAGTT 


240 


ATATTTAATC 


CTATAAAAAG 


CAGGCGTTAA 


ATGTAACAAG 


AGTTGCTGTG 


ATAATTTCTC 


300 


CTTATCTTCA 


ATGTTAATAA 


AAGTGATTTG 


TTCACAATGG 


TGAATCATTT 


GAGCGATGGC 


360 


CATCGTTAAA 


TTCGATATGT 


TATCTGATTC 


CTGCAAATCA 


GTCCATTGCA 


CACTTGTTGA 


420 


AAGTAAGTGT 


AATGTCAAAT 


ATAACTTTTC 


CGCTTCTGGC 


AAATCAGGCT 


CATGTTGCGT 


480 


CATAATCTCC 


GTTGCTTGAT 


ATTCTTTCGT 


ATCCCTCAAA 


TACTGATAAT 


TAATATTTAA 


540 


TGGATTCATC 


ACATGACCAC 


TTTGAAGTC 








569 



(2) INFORMATION FOR SEQ ID NO: 702: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702: 



TCTTTAAAAA 


GGGNATANTT 


GTNCCCAANC 


TGGGACAAAA 


NTCANACNNA 


TATNGGTCTA 


60 


TNCTTTTGAT 


TTAATAAAGG 


GANTTTCCCG 


ATTTCNGCCC 


TATTGGTTAN 


NAATTGANCT 


120 


GNTTTAACCA 


CNAATTTAAA 


CGCGAATTNT 


TAACCAAAAT 


ATTAACGGCN 


TACCAAATTT 


180 


GCCAATTCGC 


CATTCAGNCC 


TGCNCAACTG 


TTGNGAAGGG 


CGATCGGTNC 


GGNCCTCTTT 


240 


CGCTATTACG 


CCAGCTGGCG 


AAAGGGGGAT 


GTGCTGCCAA 


GACGAGTAAA 


GTTGGGTAAC 


300 


CGCCAGGGTT 


TTTCCCAGTC 


ACCGACGTTG 


TAAAACGACG 


NCCAGTGAAT 


TGTAATACGA 


360 


CNCNCTATAG 


GGCGAATTGG 


GTACCGGACC 


CCCCCTCGAG 


GTCGACGGTA 


TCGATAAGCT 


420 


TGCCACCTAT 


AAANTCATAT 


CTATCNTNCA 


NATCTTGAAG 


TTCTTCTTCC 


GATGGACGTT 


480 


TACCATGTCT 


AATATCTGTA 


TAATATGGCT 


CTATGTCACT 


TTCTTTATAA GGTGTGCCAT 


540 


AAGCCATAAC 


TAATAACCCC 


ATTTTTTTAG 


TCATTGATAA 


TACCTTCCTT 


TAAATGAATT 


600 


ATCTTTCATG 


TGCTTCAATG 


TAATACTATG 


ATTATCTTTG 


TGTATATGTG 


TGTACGAAGT 


660 


C 
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(2) INFORMATION FOR SEQ ID NO: 703: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703: 



CNTNTGTGGG 


TCACGCGGNG 


GNGGGCGGCT 


CGTCTAGAAT 


AGTGGATCCC 


CCGGGCTGCA 


60 


GGAATTCAGA 


ATCGTACCAA 


CAACTGTTGC 


ATTGACAATG 


ACATTGGACA 


AATTGGATTT 


120 


ACCAATTGTC 


GGCAAACCCA 


CGTCATATAA 


GACATTGCCT 


AATCGTTATA 


AAGATGTACC 


180 


GGAAATTGGT 


CAACCAATGG 


AGCCGAATGT 


TGAAGCTGTT 


AAAAAGTTAA 


AACCAACACA 


240 


TGTTTTGAGT 


GTGTCAACGA 


TTAAAGATGA 


AATGCAACCA 


TTTTACAAAC 


AATTAAATAT 


300 


GAAAGGCTAC 


TTTTATGATT 


TTGATAGTTT 


AAAAGGGATG 


CAAAAGTCGA 


TTACACAATT 


360 


AGGTGATCAA 


TTTAATCGTA 


AAGCACAAGC 


AAAAGAATTA 


ANTGACCATT 


TAANTTCTGT 


420 


AAAGCAAAAA 


ATTGANAATA 


AAGCAGGTAA 


ACAAAAGAAA 


CATCCCAANG 


TATTAATATT 


480 


AATGGGTGTA 


CCGGGTAGGT 


ATTTAGTAGC 


AACTGATAAA 


TCATATATTG 


GGTGATTTAG 


540 


NTAAAATAGC 


AGGTGGAGAA 


AATGTTATTA 


AAGNGAAAGA 


TCGCCAATNT 


ATTTCGNCTA 


600 


ATACTGAAAA 


TTTGTNGAAT 


NTCANTCCAG 


ATATTATT 






638 



(2) INFORMATION FOR SEQ ID NO: 704: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704: 



GACAACTTGC 


TAAAGCACGT 


GATGAAAAAG 


TAAGTGAATA 


TGGAATTGAA 


CAAGCTGATG 


60 


GTACATTAAT 


TCAATATGAT 


AGTGAAGCCA 


AGATATATGA 


ACATTTTAAT 


GTGAATTTTA 


120 


TACCACCTGC 


TATGCGAGAA 


GATGGTAGCG 


AATTTGATAA 


AGATCTAAGT 


AATATCATTA 


180 


CATTAGATGA 


TATTAATGGT 


GATATTCATA 


TGCATACAAC 


GTATAGTGAT 


GGTGCGTTTT 


240 


CTATTCGAGA 


CATGGTAGAA 


GCAAATATCG 


CAAAAGGTTA 


TAAATTCATG 


GTAATTACTG 


300 


ATCATTCACA 


AAGTTTACGT 


GTTGCTAATG 


GCTTACAAGT 


GGAAAGACTT 


TTTANGACAA 


360 


AAACGAAGGA 


AATTAAGGCT 


TTAGATAAAG 


AATATAGTGA 


AATTGGATAT 


TTATTCAGGT 


420 


ACAAGAAATG 


GATATATTAA 


CCTGATGGCT 


CGCTGGATTA 


TGATGATGAA 


ATTTNAGCAC 


480 


AACTTGGATA 


TGTNATTGGA 


GCTATTCAAC 


AAAGCTTNAN 


CCAATCAGAA 


GAACAAATNA 


540 


TGGAACGGAT 


TAGCTAATGC 


ATGTCGCAAT 


CCATACGTGC 


GACATATAGC 


GCATCCAACA 


600 


GGGCGTATTA 


TAGGTAGAAG 


AGATGGTTAT 


AAACCGAATA 


TTGAACAATT 


AATGGCATTA 


660 


GCTGAAGAAA 


CGAATACAGT 


ATTAGAAATT 


AATGCCAATC 


CACATCGACT 


GGATCTTGAA 


720 


CGCTGAAATC 


GNTCGNNAAT 


ATCCAAATGT 


GAAATTAACT 


NTTAACACTG 


ATGGGCATCA 


780 


TNCAAATCAA 


TTNGATTTTN 


TGGAATTATG 


G 






811 



(2) INFORMATION FOR SEQ ID NO: 705: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(V) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705: 



AGTAGGAAAT 


GCTTNCGCCA 


NTTANTTGNC 


CNCGGTTTCT 


ACCAAANTTN 


CCTGCATCCT 


60 


CCNAATGGAN 


ATAANACGNC 


AACCTCCNAC 


TAAACTTTGA 


AAATTCCACC 


CTAAACTAAT 


120 


CCCCCATAGT 


AAAATCAGGG 


ANCCCAAATT 


TTTAGCGCTG 


CTAATAATGC 


CGGACCAATG 


180 


CGTCAATAAA 


GCTGGTTGCG 


TGTTTTCAGT 


TTCACCCAAT 


TTACCTTCTT 


CATC AGT AAA 


240 


CATTGTCTCT 


AAAATATCAA AGTCTAACGT 


CTTTGCTGCT 


GAAGTTAAAA 


TTTCAGTTGC 


300 


TTGATCATTG 


TTATTAAACA 


AATCTTGTGC 


CATACCAACT 


TTTTGGGCAC 


CTTGTCCCGG 


360 
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AAAAATAATT GCTGTTTTAC TCATTTGATT CACCTACAGT CTCTTTCATT GTTTGTACAA 420 

TATTTTGTTC TCCTGCGATT TTCGCTTGTC TAATTGCAGA ATAAAAAGCT TTAGCATTTG 480 

AACTACCGTG TGCTTTAACT ACAGTACCTT CCAATCCTAA TAATACTGAA CCACCGTATT 540 

CTGAGTAATC CATCTTTTTA GCGAAGTC 568 

(2) INFORMATION FOR SEQ ID NO: 706: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706: 



CCANCGTTTT 


CTTAATTGCC 


CAAGCCTTCA 


AGNNATAAGA 


AGAATCCCGA 


ACCNAAGCAT 


60 


ATCGTCTCTC 


AATTTGGATC 


CGATTTGNAC 


ATANNNAAGA 


TGCTTTAGCC 


ATTAAGNNNA 


120 


AGTGNNNAAT 


NAAGTTGATA 


AATCTATTAA 


AACCAAGAAG 


TGAAGCGATA 


AGCTCAATAT 


180 


CAAGTTTAAC 


CGGAATATTA 


TTATTTGTAA 


CATCATTTTT 


AGGTATTACA 


TTCTTGATTG 


240 


CTGTATGTTG 


CATTATATAC 


ATTAAGCAAA 


TAGATGAAAC 


CGAAGATGAG 


TTAGAGAATT 


300 


ATAGTATATT 


GAGAAAGCTT 


GGATTTACAC 


AAAAAGATAT 


GGCAAGGGGA 


CTAAAGTTTA 


360 


AAATTATGTT 


TAATTTTGGG 


TTACCTTTAG 


TTATTGCACT 


ATCACATGCA 


TATTTTACAT 


420 


CATTAGCATA 


TATGAAATTA 


ATGGGTACAN 


CGAATCAAAT 


ACCGGTTTTC 


ATAGTAATGG 


480 


GATTATACAT 


TTGTATGTAT 


GCTGTTTTTG 


CAGTGACGGC 


TTATAATCAT 


TCCAAGCGAA 


540 


CAATTAGACA 


TTCCATATAA 


AATATACAGA 


TGGCTTTCAG 


TAGAGTAGTG 


GATTC 


595 



(2) INFORMATION FOR SEQ ID NO: 707: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707: 

ACCCTGATTT NCCCATCCAA TATATCCAAT TGAAACTTTT TAATTTACCT TCGCTTGTAA 60 
AGCCAANTTC ACATGATGTC CATGGTTCTA AATCTTCCTA CNNTGNGAAA TGCGTNTCNN 120 
GAAACTTCTT AANCCAATCT ATATAAATCC GTCCACAAAT CATAAAATAC TTTTTCAGAG 180 
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ACATAATACT 


CTCTAGGGAT 


ATATGTGTAA 


TAATTCAATT 


CATCACTACC 


TGGTTTAGTA 


240 


TAATTAAAAA 


TGACTTCCCC 


AGCTTGATCA 


GTTACATAAG 


CTATTGTAAA 


TACTTGCTCC 


300 


CATTCTACTG 


GTATCATCCC 


ACTAATCTCA 


TTCGCAATCT 


CGTTGTACAT 


TTGACTTAAT 


360 


TTTTCTTCAA 


AATTCATGTT 


ATCGCCCCTA 


TAGTTCAGCT 


TCATCTTGCT 


CTTTAACATA 


420 


ATCTTTTATT 


TTTTTACCCA 


ATTTATGGCA 


TATTCTTTTT 


CAGGCCAAAT 


TCCAAATTTT 


480 


TTATACATGT 


AATAATGTTC 


TCTTCCCATT 


TGTCCAAACT 


CTGTATTGAC 


CCAATCAATA 


540 


TAATCAAATG 


AAACTAATTC 


AATTTGCCGT 


CTCTTGTAAA 


GTCAAATTCA 


CATGATGTCC 


600 


ATGGTTCGAG 


TCCTTCTTCT 


TTAAATAAAT 


CTCTTAGTTT 


ATCAAATNGC 


TTATATAATT 


660 


CATATACTGA 


GTCCATAAAT 


TCTGATCTTG 


ATATATTATA 


TTTATTTAAC 


ACGCTCGTAT 


720 


AGTAAAATAA 


TTCATCACTT 


CTTGGCTCAG 


TATAATTGTA 


GAACACTTCT 


CCACTNCTTT 


780 


CATTTACATA 


AGCCATTGCA 


TATACCTTTN 


CCCATNCTAC 


CGGTATCATA 


CTGNTAATTT 


840 


TATTTGGAAT 


TTCATTGGTA 


CATTTGACTT 


AGTTTTTCTT 


CCGAAAGTCA 


TATTATCCGC 


900 


CCCTATTGGT 


TCAAGCTTCC 


ATCTTNGCTC 


TTTTAATATT 


TTTGCTCGGA 


TTTCTTTTAA 


960 


TTTCTTCCCA 


ATTTCATATT 


CCCATTTCTT 


GG 






992 



(2) INFORMATION FOR SEQ ID NO: 708: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) JSTRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 708: 



GAATCCTTAG 


TTTTAATTAA 


ATCCCATATC 


AATTGATCTT 


CAACATGAAT 


ATATCGATGT 


60 


ATCGGATCTT 


TAAAAACTTT 


TTCTTCTACT 


AATTTTAAAT 


CTACATATGC 


GTTAGTCATT 


120 


ATTCCCCTCC 


TTTTCGTTTA 


ATATAATATT 


TAATTTACTT 


AAAATGCTTT 


GTACATAAGT 


180 


GCTAAGTCTA 


ACTTTTCGCC 


ATACATTTCT 


GGCTGATAAG 


AGCGTAAGAT 


TGTAAAACCT 


240 


TGCTCTTTAT 


AGTAAGCTAC 


TGCTTCTNCA 


TTTNTATTAT 


CTACTTCTAA 


GTAAACACCT 


300 


ACAAATTTAT 


CTTCAAAACG 


TGATAATCCT 


TCATTTAACA 


AATGCTGTAC 


CATAACCTGT 


360 


ATGGTTGGAT 


TCTGGTTTAC 


ATAATGAGCT 


GATAAATATA 


ATTCTTCACA 


CCGNCAAATA 


420 


AAGTTAGCAA 


AGCCACGATG 


TCATTACCTT 


CCTCCAACGA 


CTAAGAATAA 


TGGTCTTGGA 


480 


GGGCTTNTCT 


TTTAAATGAT 


GGTCCAATAT 


ATGAAGCTCC 


TTACAAAGTG 


GATAACTGGC 


540 


GAGGG 












545 



(2) INFORMATION FOR SEQ ID NO: 709: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2247 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709: 



AGCAGAATCT 


TTTTTAGCAT 


GATCTGTCAT 


AATGATCATA 


CGCTCTGGAT 


TTAAATCAGC 


60 


TAAATGTTCA 


GTGTCTAATT 


GTAAGTAAGG 


TCCTTTCAAA 


TATTTACTTA 


AACCTTGTGT 


120 


TACATCGTCA 


CTTAATGCAT 


TTTTAAATCC 


TAGNTCGTTT 


AAAAATTGTC 


CAACATATGA 


180 


ATAGTGTGGA 


TGTGCTAATA 


AACCAGCTTT 


AGCAACTACT 


GCTGGAAGCA 


CTTTGTGATT 


240 


TCTATCAAAT 


TTAATTTCAT 


CTTTATACTT 


ATTGATTAAT 


TTATCATGCT 


CAGCAAGACG 


300 


TTTNNCGCCT 


TCTTTNTCTT 


TATTTAAAGC 


TTTAGCAATT 


GTTGTTGAAC 


GAATTAATAT 


360 


TGTGGGTGTA 


GTCTCCATCA 


AAACTCTTTA 


ATGATAATGT 


GGTGCAATGT 


GGGCTAATTC 


420 


TTTATTAATA 


CCCTTATGTC 


TACTGCTATC 


AGNGATAATT 


AATCCCGGNT 


TTAATTTACT 


480 


AATNTCTCTT 


AAGTTOGCTT 


GTTACGTGTA 


CCTACAGAAG 


TATTACCCCC 


AATTTTTCTC 


540 


TTACTGGGTT 


ATGATACGTT 


TTTTCTTACC 


ATCATCAGCA 


ATACCAACTT 


GGTNTAACGG 


600 


CTATATGCTG 


NTAATGCAAC 


CTTGCAAATG 


AGTACTCTAA 


TACAACGATA 


CGTTGTGCAT 


660 


CTTTAGGTAC 


TTTTACTGTA 


CCATTTTCAT 


CTTTTACCCG 


AAATAGTATC 


TTTAGTTGAT 


720 


GATTCTTCTT 


TTACTTGAAT 


TATCCGTATT 


ACCACAAGCT 


GCAACTAAAA 


GTAAGGCAAC 


780 


TATTAATCCC 


AATATACTAA 


AAGTTTTTAG 


ACCTCTCATC 


NGTCCCACTC 


CTTAATATGT 


840 


ATANCTTCAT 


TTATTATTTT 


ATTGATAACA 


ATTATCATTG 


TCAAGTAGCG 


TTCAATCTTT 


900 


TTTATATTTC 


TAAAATGTAT 


GACTATATAT 


TTCCTCTAAT 


AATTATGACT 


ACAATTAGCA 


960 


CATTTCCTTA 


GACAAAATAC 


TGATAATGTA 


TCATTGCTAT 


ATCATCTTTG 


CATTAATACA 


1020 


ATTGACACCA 


CTTAGCATGA 


CCGNTATCCC 


TGTAATTCAG 


CTGATATTAT 


CTGTTGCAAT 


1080 


TTTATGTGAC 


GAACTGTTGC 


ACTTAATTTG 


ATAANTCAAC 


AANTACAANA 


NATCTAAGTT 


1140 


GAACAATTAT 


GATACAACCG 


TGCAAACGAT 


ATGTAGTATA 


ACTTGTCAAC 


TTAGAATTAT 


1200 


TGATAAATAT 


ATTAATATTG 


GTTTACCATA 


GCAGGAGATT 


TCACATCAAA 


ATTTTGAAGT 


1260 


AGCGTATCAA 


TCTTTGAATC 


ATCAATATAT 


ACCTTATGTA 


AATTTTTCAT 


ATACATCGAA 


1320 


TGAGAAAGTG 


CTTCATAATT 


TAATGAAAAA 


GATATATGAT 


CTCCAACTTG 


ATAGTGTCCT 


1380 


TGACCATTTA 


AATCAAGCAT 


TAAATGATCA 


CTCGAAGCGC 


CTAAAATATT 


GATATGCTGA 


1440 


TCCATAGGTG 


AAATATTATC 


GACTTGTGTA 


TCTNAAATAA 


CCAATATCTA 


CAATAGCTTG 


1500 


TAAGAATGAT 


TCATGCGTGT 


GTGTATTAAC 


TCGAGGTTTA 


ATTTCTAAAA 


TCTCAGCCTC 


1560 


CAATGTAATC 


GCATCTTGAT 


ATAACATAGC 


GAATCGCTTG 


ATTTGCGTTG 


TTTCAACAAC 


1620 


TCTAAACAAC 


GTNTCANCTA 


TTCGGAANTC 


AATTTATTTT 


TACCCAAATC 


AATATATAAA 


1680 


AGGTGGGGGG 


NAACATGCTC 


CGAATTACCA 


CCCGGAAATA 


ATTTNCANTC 


GATATCCTAT 


1740 


TTCTCTTNCA 


ACAGCTGAGA 


CGAATCGATT 


AATCATAAAG 


ATATCANCAC 


CACTTGGCGC 


1800 


ATCAGATTTA 


AAACACATAA 


AATTGAATGC 


TAAACCTACA 


AAATGGATAT 


TTTNCAAGTG 


1860 


AATAATCTCT 


TTANTATAAT 


CTAAAACATC 


ATAAGTCAGA 


ACACCTTCAC 


GGACATCTTT 


1920 


CCAATCTACC 


ATTAATAAAA 


TCTTATGTTT 


TTTTCCTAAA 


ACTTCTGCTA 


CTTCATTTAT 


1980 


NTGATGTATG 


GTAGATAATT 


CTGTGTGGAT 


ACTCATATCA 


ACTTTCCTCT 


ATCATATCTG 


2040 


AAATCTCTTT 


TGNGGGAGGC 


GTACGCAATA 


ACGTATATGT 


TAAATCCTGA 


TCTGCAATAC 


2100 


TAATTATGTT 


ATCCAATCTG 


GATTCTGCAA 


CATGATTGAT 


ACCTAACGCT 


TTTAAGCTTN 


2160 
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CTACAATGGT ACGGGCANCA GCTATACACT TAATTACTGG TGTGANTNGN ATATTTTT^C 2220 
TTTGAAAACT NNGTGGAGGT ACTTGGG 2247 

(2) INFORMATION FOR SEQ ID NO:710: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710: 



TGATGCAAGT 


TGTATNATTT 


TCAAACCCCC 


NNCCCAAGAT 


CAAGTAAGAC 


AATGTATAAT 


60 


GTGGCACGAA 


CCGCGACAGG 


TGCAAAAAGT 


CATTTAAATT 


GAATTGGATC 


AAGATGATCA 


120 


ATTAATTAAA 


AATCTTGAAA 


ATAGACAAAT 


TGTACAAGCA 


CATCCAATGG 


GAGGCATACA 


180 


AATCATACCG 


GAAACCAATC 


AAGTTATTAG 


TCCAAGGTTT 


GGAACATTAA 


CAAATATGAT 


240 


TGCGATAGGT 


CAAATGACAA 


ACGGTGTTAA 


TAAACTACGA 


AATGGTGTGA 


AAATGATAGT 


300 


AGAACAAGTT 


GCGCATACAG 


TATCTCAATT 


ATATGATGCT 


TTAGAATCGA 


ATGAGCAACA 


360 


ACAGCGCAGT 


TACAATCAAT 


AATTTGTAAC 


TAGANGATAA 


TAANGNGAAC 


GCTCTAGAGA 


420 


GACGAATTGA 


AGGTTTGGAT 


TTTNCTGTCT 


TGTTAGTAAG 


CATCATATCA 


ATGAGATGCC 


480 


TATNGTCCTC 


AGATTATATT 


A 








501 



(2) INFORMATION FOR SEQ ID NO: 711: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:711: 

AAGCTGNCCG ACAATTCCAC AATTGAAATC CCACATCATT ACCNAAGATA AAATTAAGAN 60 

NNCCAGTAAT GGTTGGNACA ATACTGCTGT TGCTACAAGT TCATGATATT TCAAGATACT 120 

ATTCANCGGA TTAACTGGCA TTCCAGTTTC TCANATGGAT GATAACGATA TTGNACGTTT 180 

AAAAAATATT TCTAATCGTT TAAGAAGTAA AATCATAGTC AAAGATCAAC CTGAAGNNAT 240 

GGTTTCAACG TTGCAATTCN CCGTAATCGC GCCTGGGGTT TGATGACGGC CAACCCGTCC 300 

AATTGGTAGT TTCTTAATTT TGTCGGCCCT ACTTGTTGTT GGTAAAACAG AGCTTGCTAA 360 
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ACAATTACCA ATTGATCTAT 


TTGGTAATAA AGATGCACTT 


ATTCGACTTG 


ATATGAGTG& 


420 


ATATAGTGAC 


ACAACAGCTG 


TTTCAAAAAT 


GATTGGTACA 


ACTGCTGGTT 


ATGTCGGTTA 


480 


TGATGACAAT 


TCAAATACGT 


TAACTGAAAA 


AGTACGCCGT 


AATCCATACT 


CAGTCATTCT 


540 


ATTTGATGAA 


ATCGAAAAAG 


CAAATCCACA 


AATTTTAACA 


TTGTTATTAC 


AAGTAATGGA 


600 


TGATGGTAAT 


TTGACTGATG 


GTCAAGGTAA 


TGTCATCAAC 


TTTAAAAATA 


CAATTATTAT 


660 


TTGTACATCA 


AATGCTGGCT 


TTGGCAATGG 


CAATGACGCT 


GAAGAAAAAG 


ATATTATGCA 


720 


CGAAATGAAA 


AAATTCTTCC 


GCCCTGAATT 


CCTTAACCGC 


TTCAACGGCA 


TCGTT 


775 



(2) INFORMATION FOR SEQ ID NO: 712: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712: 



ACTTGTGCAC 


TGCCAATAAT 


AATTAATCCT 


TTCATATTTG 


ACCCCTCTTT 


TTTTTATTAA 


60 


ACTTGTTGTT 


CACGTAATAA 


TTTCATAATA 


GCTTTTCCAA 


CACCACTATT 


TTCATTCGTA 


120 


TCTGTGACAT 


ATTTCGCTAT 


TTTTTTAACT 


TCTTCTGCAC 


CATTTTCCAT 


CGCAACTGGA 


180 


TAGCCAACTT 


TCTCTAACAT 


TGATAAGTCA 


TTTAAATTGT 


CACCTATTGC 


CATGACATCT 


240 


TTCATTTCAA 


TCCCTAATCT 


TTCCGGCAAT 


TGTTTCTTAG 


CGCAATACCT 


TTNTGGTGCA 


300 


TCTTGAATGC 


GTTATTTCTT 


ATTTTTTCCT 


CTCGAAGATG 


ATGATATTAG 


CTTAAATTCC 


360 


GGAGATTCCA 


AGTTTAAAAT 


TTTTACTTAG 


CTTTGGTCAA 


ATTTTCTTTA 


AATTTTCCAT 


420 


CAAAATGCTT 


ATATTTTCCA 


ATAATTAATC 


AACCAGGTNT 


TTTGTTTTTC 


CA 


472 



(2) INFORMATION FOR SEQ ID NO: 713: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713: 

TTTTTATTAA 60 
TTCATTCGTA 120 



ACTTGTGCAC TGCCAATAAT AATTAATCCT TTCATATTTG ACCCCTCTTT 
CACGTAATAA TTTCATAATA GCTTTTCCAA CACCACTATT 
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TCTGTGACAT 


ATTTCGCTAT 


ttTTTTAACT 


TCTTCTGCAC 


CATTTTCCAT 


CGCAACTGG& 


180 


TAGCCAACTT 


TCTCTAACAT 


TGATAAGTCA 


TTTAAATTGT 


CACCTATTGG 


CATGACATCT 


240 


TTCATTTCAA 


TCCCTAATCT 


TTCGGCAATT 


GTTrCTAGCG 


CAATACCTTT 


NGGTGCATCT 


300 


GAATGCGTTA 


TTTCTATATT 


TCCTCTCGAA 


GATGATGATA 


TAGCTAAATT 


CGGAGATTCA 


360 


GCTAAAATTT 


TACTAGCTTN 


GGCAATTTTT 


TCTTAAATTT 


CCATCAAATG 


CTAATATTTT 


420 


CATAATTAAT 


TCAACAGGTA 


TGTTTTCAAT 


AGGATCATAA 


TTATCAACAA 


CTTTCAACGG 


480 


TNCCAATATC 


TATGCGTCTT 


T6NATACCAT 


CTTNAATACG 


CTCAACGTNT 


GGATGTTGAC 


540 


CTGCACGCTC 


AGCAATATCT 


ATGTAAATGT 


CTAAATCTCT 


TGGGGGATCT 


TCCAGTATAA 


600 


ATCGCACGAC 


TCGGNTATNC 


TGGATAATAA 


GTACCTGNAT 


CTTTTAAAAC 


ATNTGTAATG 


660 


NGGTGTACTA 


ACGATTTATT 


ANGGNGTGAA 


GTGCTCATTA 


CATTGAAAGT 


TTCATCACGT 


720 


TCTTCAGNAC 


CATTCAGACA 


AATATATGGT 


ACTGNTAAAT 


CTGTGTCAGC 


AANTGGTGCT 


780 


TGTGCTTCAA 


AAAATGCTCG 


ACCTGGCACG 


AGCGATAACA 


ACCGTTATCC 


CTTGTTCTTG 


840 


AGCGTATTNA 


NTCGCATCAA 


TATTAGGTTG 


AGAAATTTCA 


TNTGCTGCAT 


TAAGGTAGCG 


900 


NGCCATCCAT 


ATCAGNGGNT 


ATTAGGTTTN 


TCANTATGTT 


NCCNCGCCCN 


CGNCTTTCGT 


960 


AAATTTAAAA 


TCTTGTTCTT 


AAATAAGAAT 


ATATACTCAG 


CGCACATACT 


TTTCTATTAA 


1020 


CATTTATATT 


GTCATTAATT 


TATCATATAA 


TGTAATTCTA 


ACAAATTTTA 


ATTAGTATGT 


1080 


ACTATCGTCT 


TAATTGGTGG 


ATTTCTTATT 


GGCTCTTAAG 


TTTTTAAAAA ATGTTGGTTA 


1140 


ATAATGGTGC 


TACATGCTTC 


TTTTAAGGTA 


CAACCTTTAT 


CAACAAATTG 


GACGGATGAT 


1200 


TAAAAATTAG 


GTTNGGTTGG 


AAATAAATTC 


AATTAAACTT 


GCTACTACAA 


ACCACC 


1256 



(2) INFORMATION FOR SEQ ID NO: 714: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714: 



GAATTCTGCT 


GTAGGCGTCC 


AATTTCCAAA 


TGTACCACCA 


CCCATCAAGA 


GTAATAGATT 


60 


CAATATGCTC 


AGCGATTCTT 


GGATCACGAA 


TCAATGNCGT 


TGCTACATTC 


GNAAGAGGAC 


120 


CTGTCGCTAC 


AGAAGTAACA 


GGTGTATCAN 


TCTTCTACAC 


NCGGTTCATT 


ATCNNNACTC 


180 


ATGCTGGCAT 


TGCAACTGCT 


TGACGTGATG 


GTGTCGACGG 


TAGTTTCGGA 


CCATCTAATC 


240 


CAGATTCCCC 


ATGTATTTCA 


GAAGCAAAGG 


CAGCTGGTTT 


AATTAACGGC 


CTATCCGCAC 


300 


CTTTCGCTAC 


TGCTATATCT 


TGGCGTCCCA 


TAATATCCAA 


TACGTCCAAG 


GCGTTTGTCG 


360 


TATNCTTGTC 


AACTGATTGA 


TTACCTGCGA 


CTGTTGGTTA 


CAGCTTAATA 


TCTCTAGTGG 


420 


ACTTGTCACT 


NGNCCCCGCT 


TAAAATTAAT 


GGTATTGCAT 


CAACGGNGTC 


CTGGATCACA 


480 


ATCCATAATA 


ATCTTCTTA 










499 



(2) INFORMATION FOR SEQ ID NO: 715: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715: 



TCTTATACAG 


GAGAAAATGG 


CATAGATGGT 


ACATCTGGAG 


CAGCTAAACG 


CATGATTGGT 


60 


GCATCTAAAT 


CGAACAAGCA 


ATGCTCTGCA 


ATAATCGCTG 


ACACTTCTGA 


CATAATACTA 


120 


CCTTCTAAAT 


TATCTTCAGT 


TACAAGTAAA 


ACTTTACCTG 


TATTTTTAGC 


ACGATCAATA 


180 


ATTGTTTCTT 


TTATCTAATG 


GATAAACAAG 


NNCGGTAAAT 


CAACGACTTC 


AACGTNNGAT 


240 


ACCCGTCTTG 


CAGCTAAAAT 


ATCCCGCTGC 


TTGGGTAAAC 


AAATAATTGA 


CCCATTAATC 


300 


CCATAACAAA 


AATACTNGTT 


AAATCTTCCA 


NCCTCANCGG 


TTNTACATCT 


TGCTTTTCCT 


360 


AAAGGTACAA GGGGTAATAT 


TCCTCCTNGA 


ACTTCTTCCC 


CCNTTAAAGA 


ACGATAGCTT 


420 


TTTTATGCTC 


AAAGTACAAT 


ACTGGGATCC 


AATTGGGGTC 


CGGTAGATGA 


TAATAAAGGC 


480 



<2) INFORMATION FOR SEQ ID NO:716: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716: 



GAATTCNAAT 


ACTTCTTCAT 


AGAANANGTT 


TGTAACGTTA 


GATTTAGTAG 


TTAAAGTAAG 


60 


GTCATAAATT 


TACAATATGC 


CTTGAATTAA 


ATTATGTTTA 


GACTGAGCAA 


AACTCTCATT 


120 


TTTAACATCC 


GCTGTAATTT 


CTNTATCAAC 


TAATTTTAAA 


TTAAATTGAT 


TTAAAATATT 


180 


CTGTATAAGT 


TTAGCTCTCG 




TATATCAATA 


CCCAACATTT 


CTAATNCATT 


240 


AATCGACAAT 


CCGACATCTG 


ATAGACAAAT 


NNCATTATTA 


GGNAGTGTAT 


CAGCATATAT 


300 


TCTAATAAAA 


TCACACAAAT 


GGCTCTTAAN 


GGGAGTAGTA 


ATTTCTTGTA 


GAACTATCTT 


360 


AANTCTGNGG 


AAGATATTAA 


TTGNGGCTTT 


ACNCAGGTGG 


ANATATTCCG 


NACACTCTTT 


420 


CTTCCGATAG 


TCTTCACTAA 


GTATCCCCCC 


ANNTCTTNNA 


CAAGAAATCT 


TGGNTCTTNT 


480 


ATNACAANAT 


CGGGTNTNTN 


TTAGGNTTAC 


TCCAAAAAGG 


AGGCTTAATG 


ATCCTATNAN 


540 


NCCACCTG 












548 



676 
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(2) INFORMATION FOR SEQ ID NO: 717: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717: 



TTGTCATTAT 


AAGCCCTCCA 


TTTCATGATT 


TGATTTGCCA 


CTCGGAACCA 


TTGGGGTTAC 


60 


AGCTTCAGTA 


GGGGAAATAC 


GAACCTCAAT 


TAAAACTGGT 


CCTTGATAAG 


CAAACGCTGC 


120 


ATCTAATTGT 


TCTTCCAGTT 


GTTCTGGCTT 


ATCGATTAAG 


AAACCTTTGA 


CGCCATATGC 


180 


TTCTGCCATT 


TTCATAAAAT 


CAGGTTGACC 


ATTAAATACT 


GAGTGTGAGA 


AGCGTTGATT 


240 


AAAGAACTTA 


TCTTGCCATT 


GTTTAACCAT 


ACCTAATGTT 


CCATTATTGA 


TTAGTACGAT 


300 


TTTGACATCT 


AAACCATATT 


CGGGTAAAAG 


TGCCATTTCT 


TGGTTTGGCA 


TTTGGAAACC 


360 


ACCGTCACCG 


ACGAAACATA 


CGACTGTTTT 


ATCAGGATTA 


GCTAATTTGG 


CACCAATTGA 


420 


CGAAGGAATA 


CCGAATCCCA 


TGGTTCCTAA 


ACCACCGNTT 


GTAACCCATT 


GTCCGTGATT 


480 


TTTAAATGGA 


TAAAATTGAG 


CTGCCCCACA 


TTTGGATGTT 


GTCCCACGTC 


TGTAGTAACA 


540 


ATTGCTTCAN 


CCATTTGTAA TTTTGGCCGA 


TATATTCCGA 


TTGTTTGTTG 


GTGGCTT 


597 



(2) INFORMATION FOR SEQ ID NO: 718: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718: 



CCAGGAGCCA 


TATTTANCGA 


TGATGNACCC 


ACACCGCTAT 


TTACTCNAAG 


AANACATGGG 


60 


ATGAAAAGAA 


ACCCGCTTGT 


ACAGTGATAN 


CGATGTACCC 


TCATTTAAGA 


TGGTGTATTA 


120 


TCACTCGATC 


TCACAACTGT 


TCTTATCCTC 


AATCAATTAG 


CTAATACTTG 


AACCCAAATG 


180 


GTGCTGTATA 


TCTTGTAAAT 


CTATTTTCAA 


ATATTAGAAC 


ACCCGAAAAC 


CTCANACATA 


240 


TCACAAATCC 


ATACGATGAG 


CACACTGATA 


TTCATTTGAT 


GAAAGCGATT 


AGTGAAAGTG 


300 


ACACAGTGAT 


TCTTGCTTAT GGTGCCTATG 


CGAAGCGACC 


AGTTGTTATC 


GACCGTGTCG 


360 


AACAAGTGAT 


GGAAATGTTA 


AAACCTCATA 


AAAAGAAAGT 


AAAAAAGCTC 


ATCAATCCAG 


420 


TAACAAATGA 


AATTATGCAT 


CCACTCAACC 


CTAAGGCACG 


TCAAAAATGG 


ATTTTGAAAT 


480 
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CATAGCAAAT ACATGGAGGT CAATATGGAA ACAAAACAAA TCAACATCGT TTTACTGTAA 540 
ATGATATTGT TGAAACAATT CCTGACAATG AATTCGAC 578 

(2) INFORMATION FOR SEQ ID NO: 719: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719: 



AAATATTGAT 


AATGCTAGTA 


NCCCCTGTAC 


CTTANTCTTT 


TCAAAACAAT ATCTCCTCTA 


€0 


TAAGTCGGAA 


AATCATTTGA 


ATAAGCTTGG 


AAATACAATA 


TCTNNCCATG 


TGTATCATTT 


120 


CATGGGCTTG 


TGTACCCAAA 


AGACGGCAAC 


ATTAATATGA 


TNTGANTTAT 


CACNAATACC 


180 


CATAACTACA 


ATCTAAGGTT 


CTTTTTATTA 


TACCCTAATT 


TTTGTTCATT 


ATTATTTAAT 


240 


TTTTGTGAAT 


TTTATGTTTT 


CTATAAATTT 


AATTATTTTA 


CTTTAACAAT 


TCATTACGCA 


300 


TTTAGCATTT 


CAAGGCATAC 


ACAATATTTA 


TTACTATGAT 


TTCATTTTAT 


CTGCTGCAAA 


360 


AACAATCATT 


ATAACTCTTT 


TTCCATAATT 


AAATCTGTAT 


CCGTTACATC 


ACCTGTGTGA 


420 


AAATGATGTT 


CACCACCCAC 


TTTAAATCCA 


TGACGTTTAT 


AAAATGCTTG 


AGCACGAGGA 


480 


TTATGCTCCC 


AAACTCCTAG 


CCAAATTTTA 


TGTTTATTAT 


GTTCTTGGAG 


CAATTTTTTC 


540 


GGCCAATTCT 


ATCAATTGTG 


AACCTCTTCC 


GCCACCTTGA 


AAGTCTTTCA 


AAAAATATAT 


600 


GCGCTGCACT 


TCTAAATAGG 


TCTCCCCCAT 


TTCTTCAGTT 


TGAGCACTAT 


TAATATTCAT 


660 


CTTTATATAA 


CCAACATTCG 


CACCATCTTC 


TTGATAAAAA 


TAATGAAATG 


AATCTACATG 


720 


GTTAATCTCT 


TGTGTAAATT 


TNTCTACAGN 


ATAATTGTCT 


TNAAAAAATT 


GATCAAAATC 


780 


TTTGTCATCA 


TAGTAAGAAC 


CAACGTGTCA 


TAAAATGTTC 


TTGTTGNTAA 


TCAACTAATC 


840 


ACTAGCATTT 


GGTCTGAATT 


CTTNGATATC 


CCAGCCATTT 


AATCCTCCAA 


TAACAGGGAT 


900 


GGATCAAATA 


TTACTATGTT 


ATTTNCAGCC 


AC 






932 



(2) INFORMATION FOR SEQ ID NO: 720: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : s ing 1 e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720: 
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TTGAAACAAT 


CTAAAGAAAT 


TCCATTTATC 


ATAATCATCC 


AATTTGCCTG 


CATATTGATT 


60 


AATACTGGCC 


TTTGGCGTGT 


TGCATTACCA 


TTGTTTATTT 


CCAATATATT 


AAAAGAGGGC 


120 


GTTGGTGTAT 


ATGGACTAGC 


TACATCATGT 


TTAGGTATAG 


CATCTTTATT 


AATGTCATTG 


180 


ATTATGGGTT 


TACTATCAGA 


AAAACGATTA 


ATCTTTAAAT 


TTAGTATTGG 


TGTTTTAGTT 


240 


TGGGGGATTG 


GCTTATCGAT 


AATTAATGTG 


TTTCCAAGTG 


TGGCGATACT 


ATATATAGGT 


300 


GCAACATTAG 


TGGGACTTGG 


ACAATCTATA 


GAGGGACTAA 


CAAGATCAGT 


TGNCATNCAA 


360 


ATTAAAAATG 


CCAAANCATT TGGATGGGGG 


AAAGGTATTT 


AGGNATATCT 


CANCTTACAA 


420 


TCGGAATCTA 


CGCAGGNCGG 


ATNCAATTAT 


CTTTAGGGAC 


TNGATAAGGT 


TTTAATTATA 


480 


CCCTTTACTT 


AAGCTTAAAG 


TAATATT 








507 



(2) INFORMATION FOR SEQ ID NO:721: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721: 



GAATTNCCAT 


ATNGNACTAA 


AATATCTCCC 


AATCTGNGCA 


TGAGTTTATC 


TNGATTGTAT 


60 


CTATAGTTGA 


CTGTACGCGG 


TTTTTTATAT 


GACCAACGAC 


TGTGNGATAT 


TAANTCTACA 


120 


ATACTTCACA 


AGTTACTCTG 


TAATTGCATN 


ACAAATACTG 


ACATAACTCC 


CCCTACTTAC 


180 


TTATTGAATA 


TTGTTGNGAT 


ATATTGTGCC 


CAATGATACA 


GNCAATTGTT 


AGTTATCGGT 


240 


GGACATTTCA 


CACTGATGTG 


ACACATTATT 


TTTAATGTTA 


ATGTGTCACA 


ANTCTATTAA 


300 


CATTATTAGC 


AATGTACTTA 


CTCAAACATN 


CACCTGAGAC 


TAAAGCAGNA 


CCAATCAAAG 


360 


GTATGAAAGC 


AGAAGCTAAA 


AAGTGTGACG 


TTATTGGTTT 


AGACATTTNA 


GTAGGGACGA 


420 


TGTTAAGTTT 


AAATGTAATC 


ATCACACAGA 


CGTCTCATGT 


GGGGTTAGTT 


TCACAACGTT 


480 


AATTCTAGGT 


TTAATTGTTG 


TGTTTATCTG 


TTCATTAGTT 


GGTTTTGTAT 


ATTACGAAAA 


540 


TAAAATAAAG 


CATCCACTTG 


TTGATTTTTC 


AATTTTTAAA 


AATAGAGGAT 


ACAGTGGTGC 


600 


AACTATTTCA AACTTCTTAT 


TAAATGGTGT 


AGCAGGTGGT 


GCACTTATCG 


TTATTAACG 


659 



(2) INFORMATION FOR SEQ ID NO: 722 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii> MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722: 



TGNAATAAAC 


NTCAAAGCAG 


TAAGAGAGGG 


TCATAATACA 


AGCCTCATAG 


ATAAAGGCAC 


60 


GTATATGGAT 


ATGATGTATA 


TGGATGATGC 


AATTGAAGCA 


ATTATTAAAC 


TTATGGAAGC 


120 


AGNCGACGCT 


AAATTAAGAA 


ACTAGAANCG 


GTTATAATTT 


GAGCCCAATG 


AGTTTTGATC 


180 


CAGAGATGGT 


AAAAGAAGCA 


ATTCAAGAAT 


ACTATCCAAA 


TTTTACATTA 


GATTATGATG 


240 


TTGATCCTAT 


TAGACAAGGT 


ATCGCTAATA 


GTTGGCCGGA 


TTCTATTGAT 


ACAAGCTGTT 


300 


CACGTGGCGA 


ATGGGGATTT 


GATCCTAAAT 


ATGATTTAGC 


GAGCATGACT 


AAATTAATGT 


360 


TAGAAGCTAT 


TGAACAAAAA 


GATACTGTTA 


AAAATAATAA 


CTAATCATTT 


CCATTCACTT 


420 


TAATACACGG 


AATGATATTT 


TAAATTACTC 


TTTATTTTAA 


TAAACTAGTG 


CATGAATTCT 


480 


AATATTATTC 


ATTATACATA 


TT 








502 



(2) INFORMATION FOR SEQ ID NO: 723: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:723: 



TTGGCCCCAC 


CAACCTGGCG 


GTTATTAAAT 


ACGGCGATGA 


TTGTCCTGAT 


AATGTCAGTG 


60 


ANCCACAATA 


TACACATTGT 


GTAAGAGCAT 


CAGTCATAAA 


CACTTAAAGC 


CCNAATTAAA 


120 


CCAACGCACA 


CGTTAATATA 


ACNCGGCCAT 


TTGGTGATTG 


CNCTGGAAGA 


TGGCCGCCAT 


180 


CGACCGTCAA 


TCCACCCTGA 


AAAGNTCATT 


GGTAACAACT 


TCAACATCAT 


CAAGCTGTCG 


240 


GAGAGNCATT 


TGGTGTATTT 


ATTTTGAGAC 


ACAAAGGTCA 


AATTGTCCAT 


AAGGNTAAAA 


300 


CAATTGGACT 


TCAATTAGAA 


GACCNTGAAT 


CNNNAGACTT 


TATTTTTGCA 


GTTGCAGGAG 


360 


GCAAATCGCN 


AGGTGAAGCA 


ATTAAAGCAT 


ACTTGNCGAT 


TGCACCCAAG 


AATACAGTGT 


420 


TAATCACTGA 


TGAAGCCGCA 


NCANAGATAA 


TACTTGAATA 


AGAGATAAAA 


AGTTTAATAC 


480 


TTTTTAAATA 


TCATTTTAAA 


GGAGNCCATT 


ATAATGGCAG 


TAAAAGTAGC 


AATTAATGGT 


540 


TTTGGTAGAA 


TTGGTCGTTT 


AGCATTCAGA 


AGAATTCAAG 


AAGTAGAAGG 


TCTTGAAGTT 


600 


GTAGCAGTAA 


ACGACTTAAC 


AGATGACGAC 


ATGTTAGCGC 


ATTTATTAAA 


ATATGACACT 


660 


ATGCAAGGTC 


GTTTCACAGG 


TGAAGTAGAG 


GTAGTTGATG 


GTGGTTTCCG 


CGTAAATGGT 


720 


AAAGAAGTTA 


AATCATTCAG 


TGAACCAGAT 


GCAAGCAAAT 


TACCTTGGAA 


AGACTTAAAT 


780 


ATCGATGTAG 


TGTTAGAATG 


TACTGGTTTC 


TACACTGATA 


AAGATAAAGC 


ACAAGCTCAT 


840 


ATTGAAGCAG 


GCGCTAAAAA 


AGTATTAATC 


TCAGCANCAG 


CTACTGGTGA 


CTTAAAAACA 


900 


ATCCGTATTC 


AACACTACAC 


CACCAAGAGT 


TAGACGGCTC 


TGAAACAGGT 


GGGNTCAGGG 


960 


TGCTTCCATG 


TTCTACAAAC 


TCAATAGCAC 


CCAGGTGGNT 


AAAAGGTTTT 


ANACCGATGA 


1020 
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CTTGGGTTTA GGTGGAGGG . 1039 

(2) INFORMATION FOR SEQ ID NO: 724: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724: 



ATTTAAAGTN 


GAAATTGCCC 


AANATGGGTT 


CCGGCCTTGT 


AAAAAANCTN 


TTGTTGTTTA 


60 


CCCATCGGAT 


TTCCAGGACN 


TTGCTAATTT 


GTTNTAAAAA 


TGGACCCTGT 


TTGAGAAGCA 


120 


TGTGCNTCCA 


AATGCCTTAA 


TTTTAAGTTC 


TTTAAAATCT 


GTAATATCAT 


TTTGAATATC 


180 


AGGTTCTCCA 


AGAGCTTCGG 


TTGCATCATT 


ACTGAACGCA 


ACTAAAGTTA 


AACGAGGGCG 


240 


TTCTTCTTTA 


GGCATGCGTT 


CAACCGTTCG 


AATTACAGCG 


TCTCCTGTTG 


CTTCGTGATC 


300 


AGGATGTACT 


GCATATCCAG 


GATAAAATGA 


AATAATCAAT 


GATGGATTNG 


TATCATCGAN 


360 


TAAAGATTTA 


ATCATACCAT 


CTAGATGTTC 


GTAGGGCTCA 


AATTCGACAG 


TTTTGTCACG 


420 


TAAACCCATT 


TTTCTTAAAT 


CAGTAATACC 


AATAACTTTA 


CAAGCTTCTT 


CTAGTTCACG 


480 


CTCACGAATA 


CTTGGGAACG 


ATTCGCGTGT 


TGCAAATGGG 


GGATTACCTA 


AATTTCTGCC 


540 


CATTTGTCCT 


AGGGTTAAAC 


ATGCATATGT 


TACAGGTATG 


CCTTTGTGGA 


TAGAACTTGC 


600 


TAATGTGCCT 


GCAGATGAGA 


AGGTTTCATC 


ATCAGGATGT 


GGAAATAGTA 


CTGAAGAACA 


660 


TGTCCTTTCG 


NCCAGTCAGT 


GTTGATGCCT 


CCTCTATAAA 


NNNNATGGTC 


GCTCCACTAA 


720 


TNTGAAGTGC 


TGCAGCGAGN 


TGACCNTCGT 


AANTAAAACC 


TGCAATTAAA 


AATTCATCAT 


780 


GTTCATTGAC 


CTCCCNATGC 


GTTAGACCTT 


GTACATAAAC 


CCANCCACCA 


TTTGATAGTT 


840 


TAAGACCAAT 


GCGATAAGGT 


TCTTTATTAC 


CACCTTTTAG 


TTGTGCATGC 


GTATATGTTA 


900 


CTTGTATGTT 


TCNTAAAAAT 


TGTACCAGCA 


TTAAAAACAC 


GTTGATCGAA 


ATGGTTCGCA 


960 


TAGGCCCCAT 


TTGTCGTTTC 


AACATGCAGA 


TACACAGGTT 


TATGTTCCAA 


AAGAAGCAAG 


1020 


TAAATCTATA 


AcriCTTCrrr 


CTTTAATTGG 


TTCCAACACG 


TTCACTCCTT 


ACACTATCAA 


1080 


TGTGTTTATC 


TTTCTATTTT 


ACTAAAAACT 


ATTCGATAAT 


TGTATACGAT 


TGCTCAATTA 


1140 


TTTATAAATT 


AATTTTCATG 


AAGGGTAATT 


ACTCAGGATT 


ACGTAATCAT 


ACAGCATTAG 


1200 


TTTTTTACTT 


TTAAAAATCA 


AAAAmiriT 


GGAATTTGAA 


AAGTGTTAAA 


CATTAAAAAT 


1260 


GATGCTATAT 


TAATGGTGTA 


TGAAT 








1285 



(2) INFORMATION FOR SEQ ID NO: 725: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725: 



AAGTTGCTTT 


CTTGCTCTAG 


AAATAGGATT 


AATATTGCAT 


GGTATCGCAT 


CATAAGTGAT 


60 


TAGAGTGTTT 


TCTTTTTTGC 


TAGTTTTAGG 


ATTGTAAGTT 


GCTACTTGTT 


CTAATTGAAA 


120 


AATAACTCTA 


TCTTCATATC 


TCAAAAGAAC 


ACAGCCCTTC 


CTTTTTTAGT 


TCTCGTTCTA 


180 


GCATTAAAGT 


AATTATCAAT 


AATAGCTTCA 


TACTCCTTGA 


AATCGTTCAA 


TTCATACGCA 


240 


TTGCTACGTC 


CGTCAACCGC 


TTCTGATGTC 


ATACCTTCAG 


CACCAATCCT 


GTTGTAGCGT 


300 


TTAACTGCAA 


CTTCTTTAAT 


CATGTAACTA 


AACCTTTCCG 


GTATTTGTTC 


AACTTCAATA 


360 


GGGTAACATT 


GATAACAACT 


GGCTTTTCAC 


AACTTTNNAT 


GGATTTCTTC 


TAATTGGTCA 


420 


NCTTTGCTTT 


TNCATCTTTA 


AAGACCAATA 


CGGTTTNNTT 


ACATCAAGCT 


AGCGTAGGTC 


480 


ATATTAACCA 


CCCTACTCTT 


AGGTGACTCA 


AAAAGCATTG 


GATAATTTCA GCTTTGGGTG 


540 


GGGTTTCAAC 


AACTTTG 










557 



(2) INFORMATION FOR SEQ ID NO: 726: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726: 



GGTAATAGTT 


TAAANATATA 


TTTTGCTCCC 


NNGGGNTGGA 


TGATATATGC 


GTGTANCAAT 


60 


AATAGNCATG 


GGNCCTGCTG 


GTGTAAGTGT 


ATTGAGAGAG 


TTAGTTAAGC. NCCCGAAGTT 


120 


TAATCAATTA 


GATATAGATT 


TATACGATGA 


TAAAGTAAAN 


ATGGGACAGG 


GCGTACCNTT 


180 


TCAAAATGAT 


AGTTCTGAGC 


TGCTAATTAA 


TATGCCTTCG 


ANANAGATGA 


GTTTAAATTT 


240 


AGATGATGAG 


ACTGAATTTT 


GGAAGTGGTA 


TAANCAACAA 


NCTGATTTTA 


ATTTTGATGA 


300 


ACCTGCATAT 


TTACCGAGAT 


TTGTATTCGG 


ACATTATATG 


AAATCTTATT 


TATCTATGTT 


360 


CACGAAAAAA 


TATCCAAATA 


TATCAACTAA 


TTATAATAAA 


GTTCAAGAGA 


TTTATACAAA 


420 


TTCTAATATC 


GATGAANCGA 


ATTTAACTTA 


TTACATATGT 


ACTACAAATT 


CAGAACAATC 


480 


ATGGCAAGCG 


TACGATTATG 


TATTTTTAAC 


ATGTGGTACA 


TTTGCCTACC 


ATGATCCATA 


540 


TAACTTAAAG 


GGTAAAAAAG 


GATATATCGC 


TACGCCTTAT 


CCAACATATA 


ATACGTTAGA 


600 


CGAAGTAAAT 


GAACTGGACG 


ATATTGCGAT 


TATTGGTACA 


GGCCTTGCCA 


GTTTAGATGT 


660 


TGTCCGTTAT 


GTTGCTGCTC 


ATCATCCGAA 


ATTACCAATT 


ACTATGACGA 


GTCGNTCTGC 


720 


TCATTTACCA 


AGTGTAAGAG 


GTACAATGAT 


AGACGTGACA 


TTCAAGTACT 


TAACTAAAGA 


780 


CAAATTAAAT 


GACATTAAAA 


AACATCATTT 


CGGCAATGCA 


CCGCTTGATA 


CTATAGNTTC 


840 
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TTTTATTTTT AAAAGAATGT GCTTGAATAT GACATTGATT TTTAAAAAAT TAGTACATAG 900 

ACGTACAAGG AAATCATATT GCAGATTTGA AGTATTGATT TAGCGCCGTC CCAACAGAAA 960 

TGGGCATTTT CCAAAGTATG ATTGAACATT TGANAGAGAA TTTAAATTGG ATATGGAATA 1020 

GCTTAAGTAT TGAAGATCAA CATCAATTTA ATCCAAAAAT ACNCAAAGAT GATTCAATTA 1080 

AATCCAAATC CAATNCCTCC AAGANCTGCT GAACTAATTA TTGAATTAAT AGAAAAGAAG 1140 

TCTTTAATTT TGAAAAAAGA TTTAGAAGAT GTGAAACATG ATGGCAAGTT GTATTATTTT 1200 

TCATACACAA ACCAAGAATC AGTAGACATG TATAATGTTG TCATTAACGC TCGTGCC 1257 

(2) INFORMATION FOR SEQ ID NO:727: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727: 



CTACTAGCAA 


ATAATATCTT 


TTAATTTTAA 


AATTAAACTT 


ACAAGTTCTT 


CATAGGTATG 


60 


TACATACATT 


TCTTTTGTTC 


CACCGTATGG 


ATCTATAACT 


TCTCCTGCTT 


CTTTTACATA 


120 


TTCATGCAAT 


GTGAAAACAT 


GATTTTGCAA 


ACCAAAGTGT 


GCCTCTATTA 


ATTCTTTGTG 


180 


CGAATACGAC 


ATCGNCAAAA 


TAATATCTGC 


TTTCAAATCT 


GCTTCAGTAA 


ATTGTTGCGA 


240 


TAAGGTCGGN 


TTCAGCTAAA 


TGATGTTCTT 


CAACTAAGTC 


TTCAACATAA 


TTCGAAACAC 


300 


CTTGATTGTT 


CACAGCGAAT 


ATACCTCTTG 


ATTCAAATTG 


ATGATTTGGC 


ATAACCNCTT 


360 


TTGGAATACT 


TCCCGCTAAT 


GGGNTACGAC 


ATGTGTTACC 


TGTACAAAAC 


GAATAAAATC 


420 


TTCATAGTTC 


ACATCCTTTA 


ATAATGTGAT 


TACCNGCAGN 


TTTTAACATG 


GGATCCATAA 


. 480 


CNGGTTCCTG 


NNTTATCAAT 


CAAGTCAAGG 


CCGNTTA 






517 



(2) INFORMATION FOR SEQ ID NO: 72 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728: 
AATTAGTATT TGTGGATCGT TTAATTAATA ATTGCCGGGA GTATTATAAG ATGGATCTGA 60 
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TATTATTAAG 


TTGATGTTTT 


GTGATGTTTT 


GATTGTTTGG 


ATTATTTTGT 


ATGCTATTAJ 


120 


TAGGTGGATT 


TTATCGTCNG 


CITlTTTCMi 


ATTTTCTTTG 


TTGCGNTTTT 


GGTGNNTGCN 


180 


ANCNTTGTGT 


TGGTGTGTAT 


GAGACNTTTT 


ATTTTCTTCT 


TTTTNTGGGT 


TTTCTTTTTC 


240 


ACCACAAGCT 


GTTAAAGCTA 


AAGTACTTAC 


TAGTAATAAT 


CCAATTATTT 


TCCTCATATA 


300 


CATTTCTCCT 


TTTTTATATC 


TCTTTGTATC 


GATAAACTCT 


CACCGGNTCA 


AATGTAATAG 


360 


AATACTCGCC 


ATAGTGAGTT 


CCAATACCAT 


ATATCTTTTT 


ATATTGTTCT 


ATTGCTTCTA 


420 


ATATGTATTC 


TTCGCTTAAT 


TGTAGATACT 


CAGACAACTC 


ATACAAGTTA 


CGTACGCCAT 


480 


AATTATAAGC 


TTCTACAATT 


TCGCGTAGCG 


GTACAGCTGA 


GATAAAGCCG 


TGTCGTCTTG 


540 


CGTAATTTTC 


GAACTTGCGA 


TTGTTGAATT 


TCGAGTAATC 


GGCTATATCA 


CCGTATGTAA 


600 


GTTTATTATG 


TGCTAATTCT 


TCGAAGAGAA 


TACCTCTAAT 


TTGTTCCTTT 


CGGATAGGGA 


660 


AGGTCTAATA 


AAAATTTCTC 


CTTCTTGATA 


CCAACCATCG 


AATCCTCGAG 


GTCTCGTGCC 


720 



(2) INFORMATION FOR SEQ ID NO:729: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729: 



TCTGTATTAC 


CTNCTTAATA 


ATCCTTCANA 


CATTTTCAAG 


GCCAANCTAA 


GCCATCGGAA 


60 


TTAAGATAAC 


CTTTGTAATA 


GCTGAAAATA 


TCGTTTCATT 


AAGTTTCCAA 


GCGTATTANT 


120 


GATTACACCC 


CCAANAAGAT 


TTATTTGTTC 


CAGGGGCTAN 


AGTCATANAG 


NCTTAGAAGC 


180 


TGACCTGATC 


CAANCCAANT 


GNCNATCGAA 


TTTTTTCGGT 


ATGCGTGTGA 


NNATATTCTC 


240 


NGNTGCGATA 


ACATTCACCT 


CCTTAACGCA 


TTANATCTTT 


NATACTAATA 


AATNCNATGN 


300 


TTCCTTCACC 


CTTGTCAGAA 


ACNNCTTTAN 


TCATAATATC 


TGTATATGTG 


TTTGAAAATG 


360 


CTGCAAAACC 


ATCTATTTNA 


CGATATCTGC 


TTTGTTTAGA 


TGGCAGCCAG 


TTCCCATTTC 


420 


TGTCTAGTTT 


CAGCTGAACA 


TTATTGATAT 


ACCATTTCAT 


TAAAGGATTA 


TTATTAAATA 


480 


TTATTTTCCC 


ATCTAAAAAC 


ATTTCTTTTA 


GATCCTTCAA 


TGCAGGGCTC 


AAGGTCAAAG 


540 


CCCCTTGTCT 


TGTTTCTTCT 


GTTTCAAAGC 


CATAATTCTT 


TAACTCTTGA 


TTTAATTTAA 


600 


AAGCATTCGC 


CCTATCATAA 


GTGATTTTTT 


CGACAACATA 


ATGCTCATTC 


ATCTTTATTA 


660 


TCCAANTTAA 


AACATCTTGG 


TAGTCTATAT 


AAGGATTATC 


TTGTATTGTT 


AGTAATCCGT 


720 


CTTCTTCCCA 


TTCTATATAG 


GGTATCTTTT 


CGGGTAGAAT 


ATTCAACTTT 


ATGCTTAGGA 


780 


ATCCAAGAAT 


TGTGTTAAGA 


CAGCAACTTT 


GCCATTATCT 


TNATGNAAAA 


GTGGCACATG 


840 


CGGCTGTAAA 


GGCCCCNGNN 


NCTTGATNAA 


ATCATAANCC 


ATAGGGACAA 


NGGTCTTNCC 


900 


NTCCAACTCA 


ACTAAGGAAA 


TAATGTCATT 


ATTTTTTTGA 


AGTGTTGGAT 


AATCAATAAA 


960 


ACTCATCTCA 


TCATTATTAG 


CAAAGATGTT 


AAACCTTTTG 


GTTATAAAAT 


CCCCCACGTT 


1020 


CATCTGGTGT 


TGCTCTTAGC 


CTTTTCCCAC 


TCTTCTTTCA 


TTTCATCTAA 


ATCGATAGAA 


1080 


ACACCTAGGT 


TAGGATTTGC 


TTTAATCCAA 


TTCGNCGAAT 


CATTTATATC 


ATCGTCATCA 


1140 
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TCGAGAGAAG CTAAATAGTA AAAAGTTCTT TCATCTTCGA TGATTCGATC TAAGGTGTCT 1200 

CTTCCCGCTT CTACCATATT AACAAGTGGT CCATCTAGTT GGTACCCTNC TGTCGTAATG 1260 

TAGATAAGAA GGGGTTGTAA CCTTGCCGCT CTTGAGTTTT TTATAACTGA AATCAATTTA 1320 

TG 1322 

(2) INFORMATION FOR SEQ ID NO: 730: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730: 



GATGTCAGAG 


TTATTCTCTA 


ACCGAAGATG 


GTTTGAGTCT 


AATGGTATAT 


GCGATGNATC 


60 


TTGATATGTC 


TNNATCATCA 


TTTCTATTTA 


ATATGTGCAT 


GATGGCGNGT 


ATAGTTAATA 


120 


ACAGTATTGC 


CACGATATGC 


ATTGCATAAA 


TCATCAAGAA 


AAATATCAAT 


TTGACTATCA 


180 


TCGAAAATGG 


ACACATGAAA 


ATCTAATAGT 


ATATATGCAG 


CATCAGCGAA 


CTGAAACAAT 


240 


TTAACTTTGA 


ATAAAGGTGA 


ATCATTAAAA 


TGGTAAGTAC 


TTAATTCTTG 


CTTAAAAAAA 


300 


GCTTCTAAAT 


CATAGTTTGC 


GGAAGAAGAT 


GGAACTTGTT 


TTATCTCAAT 


AAAAGGCAGA 


360 


AATTCATGAA 


GTATCATTGT 


TAAATTGTCA 


TCGGTAGTAA 


CATCAAAAAA 


ATGTCTTATA 


420 


GATGCATGTT 


GTTGCACAAT 


TGTCGATAAT 


GCATACATCA 


TTTTAGTAGC 


TTCAACATTT 


480 


TTAGCGAGTT 


TAACCCAATA 


CGCATTACGG 


TGTGTCGTTG 


ATTCTGTATT 


ATTTTTGTAT 


540 


ATACGAAAAT 


ATTCCTGTTG 


AAATCTCAAA 


TTACCCATAA 


TCATAAAAAG 


TCCTTCTTTA 


600 


CATATCATAA 


TACTCATTAC 


TTACTAAAAT 


TGCATGATGA 


TATTGATAAC 


CGACGAAATG 


660 


TTAATTAACT 


CCGGTATGTA 


ATGATTAATA 


TAAAACAACA 


TCCGCAACAT ATTGAGCGAT 


720 


ATATTCTACC 


CTAAAATACA 


TCTTGTGNTT 


CACCCGNNAC 


AAATGGGGAT 


TATTTTCCAA 


780 


TGTAAATTAC 


ATCAACTTTC 


CGTAAATTGG 


CACACTACAA 


CTCCG 




825 



(2) INFORMATION FOR SEQ ID NO: 731: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731: 
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ATAAAAGTTA 


TTTCCTATAT 


ATACTTTGTG 


TTATAAATCG 


ATTTTGATAG 


GTGTATCTAA 


60 


TGATTTGTCA 


TTCATCAATT 


TAACAGATTG 


TGGTGTCACC 


TTTAAAACAC 


AAAGTTCTGG 


120 


GTCTTCTTNT 


GAACTGAAAA 


AGCTTTTATC 


TTGAGTCTCC 


CATAACCAAT 


CGATTACTTT 


180 


GTGATCAGTA 


ACTACTTCTA 


TCGTGGCTTC 


CATTTCAACG 


AAACTACGAT 


TAGTTGTGTC 


240 


GNTATAGCCT 


AACAAAACAT 


ATGCTGCTGG 


ATTATCTTTA 


ATTTCTTTGA 


CCTTAGCAGA 


300 


ATGGATATTC 


GCGTTTTAGT 


ATATAAAGTA 


AGACCCATCA 


TTATAAAAGA 


CCATATATCT 


360 


ACTATTAGGT 


TNGGTTATTA 


TATGCAGTTG 


ATAATACACC 


AACTTTTGAC 


GTTGGTAACA 


420 


CGTTTTCAAT 


TGCTTGAATT 


GCTTGTGAAG 


TTACTCATTA 


CATATCATCT 


CCTTATGATA 


480 


TAAATAGCCT 


TTACGTAAGA 


ACTAAACACA 


AGAGGACTTG 


TTAAGTATTT 


AGCTTGGGTT 


540 


TAAAGCAAC 












549 



(2) INFORMATION FOR SEQ ID NO: 732: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 732: 



CGATTTAAAT 


GATTTTGGTC 


CTGCGTGGAT 


TGAAAGATTT 


GTTCGTGTTC 


AAATCGAAGA 


60 


AGCATTTGCA 


GTGGCGCTTG 


AAACTGCGTT 


CTTAAAAGGT 


ACTGGTAAAG 


ACCANCCAAT 


120 


CGGCTTAAAC 


CGNCAAGTAC 


AAAAAGGTGT 


ATCGGTAACT 


GAGGGTGCTT 


ATCCAGAGAA 


180 


AGAAGAACAA 


GGTACGCTTA 


CATTTGCTAA 


TCCGCGCGCT 


ACGGGTTAAT 


GAATTGACNC 


240 


AAGTGTTAAA 


ATACCCACTC 


ANCTAACGAG 


AAAGGTAAAT 


CAAGTAGCGG 


GTAAAAGGTA 


300 


ATGTAACAAT 


GGTTGTTAAC 


CCGTCCGATG 


CTTTTTGAGG 


NTCAAGCACA 


GTAATACACA 


360 


TTTAAATGCA 


AATGGCGTAN 


ATGTTACTGC 


TTTACCATTT 


AATTTGAATG 


TTATCGAGTC 


420 


TACAGTCCAA 


GAAGCAGGTA 


AGGTTTTAAC 


GTACGTTAAA 


GGTTTATATG 


ATGGTTATTT 


480 


AGCTGGTGGT 


ATTAATGTTC 


AGAAATTTAA 


AGAAACACTT 


GCGTTAGATG 


ATATGGATTT 


540 


ATACACTGCA 


AAACAATTTG 


CTTACGGCAA 


AGCGAAAGAT 


AATAAAGTTG 


CTGCTGTGTG 


600 


GAAATTAGAT 


TTAAAAGGAC 


ATAAGCCAGC 


TTTAGAAGGT 


CCCGAAGAAA 


CACTATAAAA 


660 


TTTTATGAGG 


TGATAAAATG 


GTGAAATTTA 


AAGTTGNTAG 


AGCTTTTTAN 


AGACATAGAG 


720 


CACANTCAAC 


ACAAGTACAA 


NGTAGGGGAG 


GTNGGTNTCC 


AGCTGAAGGG 


TATNACAACC 


780 


CTCGNGTGGA 


ATTGGTGGAC 


AAATCAAATC 


AANAATNNGT 


ACCGACAAAA 


GNTTTATATC 


840 


NGCCCCCTTN 


AGATAAGCTG 


ACAAAACAAG 


AATTATTAGA 


ACTATGCGAN 


TCATTACAAA 


900 


AAAAA 












905 



(2) INFORMATION FOR SEQ ID NO: 733: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733: 



ACGAATCTTT 


CACGTCATAT 


TCATTGAACA 


TCAATATCTG 


TCAGTTTATA 


ATTTGATTAT 


60 


AACAATGATG 


AACTAAAATC 


AACTAATAAA 


AATGTGGGAA 


ATTGAATAAC 


AATTAATCTT 


120 


TATTAATTAT 


TATTGTATAC 


AAAGTTAAAG 


AAGTCAAAAA 


GGATTGGTGA 


AAAAATGTCT 


180 


TTTTATGTTG 


TACTTATCAT 


TATCATTGTA 


GCATTAATTG 


GTATATTAGT 


TTTAAATCAA 


240 


AGGTATAGTA 


ATAGTAAAAT 


AGATACCGAA 


GTTTATGCAA 


GAAAGCAACT 


TATTAAAAAG 


300 


AATAAAGCAT 


TAAGTGCAGA 


AAATGCTGAA 


TTAAGAAGCC 


AAATGCTGAG 


CNCTAATAAT 


360 


GATGTTGGCA 


CACCCATGCA 


ATACAAAAAA 


TGCTAACAAG 


AGCATGACGC 


ANGATCTTAG 


420 


ATAGCTATCT 


CGAAAATGGN 


AAATTAAACT 


ATTACGATAT 


TATCCGCTAC 


AAGCAACTCA 


480 


GCTACTAAAN 


CATCCCGNNT 


TTCGGGTTAT 


GCACGCCCCT 


TGGGTCTTTT 


CATGGTTCAG 


540 


ATTCGGGCTT 


CTAGCCTGG 










559 



(2) INFORMATION FOR SEQ ID NO: 734: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 863 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:734: 



CCTTAAGAAG 


CACCTTTACC 


AATTCCGCCA 


AAATCTGTAA 


ANTCAAATGT 


AGTAGGTAAT 


60 


cmrmAG 


GTTGAACCAT 


TTCTTCTAAT 


TTAAGTAGTC 


TTGCATCTGN 


CACTTCNACA 


120 


ATACCCACAT 


TTGGGTCNAT 


AGTTGCGANC 


GGATAGTTTN 


CTCCCAAGGC 


ACCCGCCTTT 


180 


GTAATTGCAT 


TAAATAATGT 


TGATTTACCA 


ACGTTTGGTA 


AGCCAACGAT 


ACCTGCTGTT 


240 


AAAGCCATGA 


ATCATTCTCC 


TATCTTNGTG 


TATCATCCTG 


AGATACGATG 


ATTTTTTTTA 


300 


GTTTTTTATC 


AAACGTTTGA 


CGTGGAATCA 


TAATGCTTCG 


TTGACAATTT 


TCACACTTAA 


360 


TTCTAATGTC 


AGCACCCATT 


CTAATAATTT 


TAAAACGATT 


GGTTCCACAC 


GCATGTTGTT 


420 


TTTTCATTTC 


TACTATATCA 


TTTATTCCAT 


ATTGNGACGC 


CATTAAATAT 


GACCTCCATG 


480 


TATTATAAAC 


TACTAAACAC 


CGNTTTCACT 


ATGATTAAAA 


GGTGTCATTA 


TTGGNTGTGG 


540 


TGTCTTGATA 


CCTTCTTGTA AAAACATTTT 


GTGGACTTCT 


TTACGAATAA 


TACGCGCACC 


600 
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AGAGAAACCN TCACCTGGTA TTGTTTCTGC TGAAACCCNT AATATTACTC TAGTATCTJC 660 

AAAGGCATCA ATACCAATCA CAACAGGATC ACTTACAAAT AAATNNATAT TTACTACGCA 720 

GAGGATGGTA AATAATGTGT TTAATTTCTT CTCAACATTA TCAATATTTT CATCTACAGA 780 

TACTGGAATT TTAACAATTG CTGGTTCCAC TCGGNAATTG AGTAANTCGG TATTCACCCC 840 

NTNCTACTAT TTGGGAAGAT TGG 863 

(2) INFORMATION FOR SEQ ID NO: 735: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735: 



CCATAATCAA 


TCCCTAATAA 


GTTATCATCA 


GGGGAATTTG 


ATACCTGCAT 


CGACAATGAC 


60 


AATTTCGTCT 


TTATACTCAA 


CTGCATAAGT 


ATTTTTACCG 


ATTTCACCTA 


GACCTCCAAG 


120 


TGCAAAATAC 


ACCTACTTCA 


TTTGGATGTA 


ATTGTTTCAT 


TATTTAGCAA 


TCTCCACATT 


180 


AAAGTGTTCT 


GAGTTTTCTT 


TTTCGTAATC 


TAAATGTGCG 


CCCTCTAATT 


TAGTGATAAA 


240 


TTCGATATTA 


AAATTACGAT 


CTTTCAAGTA 


ACGACGTACT 


TGTTCTTCTG 


TTTGAGCTTC 


300 


AACATAAAGT 


GATTGTGTAT 


TTTCACGCAC 


AATTACCTCG 


TCTCTGTTAT 


GTTGATAAAA 


360 


AACTTTAAAT 


ACTGCCATGT 


TAAAAATTCC 


TCCTAAGAAT 


GTTTGTTTAA 


TTTATTGTTA 


420 


ACCCTAGTAG 


AATCGTATTG 


GAGTATATAT 


CGATAAATTC 


ATTCCAATCA 


TCTCTATAAT 


480 


TTAACTTAAT 


AACGATTTGG 


NTAATAACTA 


GGTTCATCAT 


GTCGTTCATT 


TTAAAAAATT 


540 


AGTGAAATAA 


CACTAAAATT 


TCAGTTAAAT 


CTTATTTTAC 


ATGATGAATG 


ATAATAAATG 


600 



(2) INFORMATION FOR SEQ ID NO:736: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 6: 



CCAACCCACC TTACTTGNCT ACATCGAAAT ATTGTTAAGC ATTAATGTTT CATNTCTTGA 
CTAGTGTTCN TTTTTANCTT TGGANAATTA AATAAANTCC CAATAAGNCC GCAAACNCCT 

688 
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a AmA KMkWikn 
AATAANATAG 


GATAAATCCT 


GTATGGGAAT 


AACATTANTG 


NTGAAATACC AGCTACACQA 


180 


AW^ATfnv"* a aia 
NCCNCTGANA 


inn* Mim/wm 

TGACTTGCGG 


GCTATATGGT 


NATNAACCTT 


GGAAGCAGCC 


TCCCAAATAT 


240 


a IVHnunn 1 A 


\- X 1 a»L. X\xA 1 1 




t» a r» a npo a ip a rn 


TCATCTGCAA 


TATGTTTAGC 


inn 




/"» A f* A T* A f^m A A 

oAUAl AV- 1 AA 


1 AuAuAluu i 


Vsl 1\jX 1 lutt 


GTGGCAATAT 


CTGCGACACT 


i £ n 


1 Av^V_~AAA\_ IV 


uLAAl X\A-XA 


A 1 Iv. AvjUvjUL. 


ALuL. X X luAl 


TTCACTTTAG 


AGCGAACAAA 


420 


1 lvjv_ AAl_ AAL 


LAI Xv_AAl Al~ 


UAtLAl 1 vj lb 


fTVTV^ A AT* A ArpA 

i ITjAAlAAlA 


CCGACTAAAC 


CACCAATTAG 


480 


v_AA\-VjrV~AA l\* 


ATAvaCAATAT 


U 1 rCU A 1 TJC I 


m Am. A A Tt A PP*P 

rAi AAl ALC I 


TTGGACACTG 


CATTTAHTAn 

vn x x x 


540 


ttv^r+r* a nv^p A A 
TLCCATC-LAA 


CCGAATGAAC 


CAIXJTATUaG 


A A A m/** iv mm 

ACCAATGATT 


CCAGCTAATA 




600 


» iv rrv"* 7i A.mA/"v» 
AA I vJAA 1 AUG 


at* a smniipxw 
ATAATGACAT 


TTACACCTAA 


m a Ktw^rk a m 
TAATGCTAAT 


ACCAATACTA 


AAATATACGG 

***v** x x nv>uu 


660 


A 1 ALAAv. Ill 


1 AAl iAwii 1 


ArpA ampnTAP 
A 1 AA 1 aJA 1 A\j 


TTTTTTAGCA 


TY^&T tr P f PA A »p 
iwAl i 1 AAAVj 


AAA tw*^ a mm 
AAATGCCATT 


720 


CCGGTTAAGA 


AATACAGAAT 


AATAATCCGG 


CTTAAAATTA 


GCAACCTGGC 


AATACAATCT 


780 


TTAAAGGTCT 


TACTNCTTGA 


ATTTTATCTN 


NTCAATTTCC 


GGNTATGTTG 


GTGGTTCTTT 


840 


AACCGCAGCA 


AATTGGTTGG 


CTTNTCTGAA 


ATCAATTGGA 


TAGGATTATC 


GNCGAACATT 


900 


GCACCTCCAA 


CANCTGTAGC 


CATTGCTAGC 


GCAGGTGGCA 


CATCTGTCGC 


TTGTGCAAAT 


960 


CCGTACACAA 


CAGGGGTGCT 


ATTGGAGAAC 


AGNTCCTACA 


GACGTCCCCA 


TAGATTTTGA 


1020 



(2) INFORMATION FOR SEQ ID NO: 737: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737: 



CCTGTTAATA 


AATTACCATC 


TTGATTTAGG 


TAATGGTCTT 


TATCTATTTA 


TAGACAGAAA 


60 


AAATAAGGTT 


GGAGAAGCAA 


TTAAAAATGC 


TAAAAATTAT 


TTAGATAGAT 


GGAAACCAAA 


120 


ATATGAGAAA 


AAAATCATTG 


CACAAATAGA 


TATAAAAGTA 


AAAGAGAATA 


GATTGATGGA 


180 


TTTAGATGAT 


CATGATAATC 


AATATGAATT 


TAATTCATTC 


GNTGANGAAA 


ATGAAGAAAC 


240 


AATATACAAT 


GAATTAGATA 


AGTNGATTAA 


TAATAATACA 


AAAAGTCGAG 


GTAATATAGA 


300 


CGGCTTAATA 


ATTGAGATTA 


TGATTAGAGA 


GTATTATCTT GGAAATTGAT GCTATCCAAA 


360 


AAGAAACTTA 


TTCTCAAATN 


GGGTAAAAGN 


AAAAAAAGAA AAAGGATCTT 


TACATACCCA 


420 


ACGGGTAAAG 


AACTTTGGAT 


TNAGACAGGA 


ATGATATTAT 


TAAGATAGTG 


TATTTATTAT 


480 


GAAATTGTGG 


GNAACTTATA 


TTAGGTAGAG 


GGGTGAT 






517 



(2) INFORMATION FOR SEQ ID NO: 738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738: 



ATGACGATGA 


TTGAATATAA 


CCATCAACGT 


ATTGATAAAA 


TTATCTATTT 


TCATTCACCA 


60 


CAAGCTGAAA 


ATAAAAAGTC 


ATCTGTAGCA 


ATCATTATGA 


ACCCTACAAC 


TGGCAATCAT 


120 


TTCCAAGCAT 


TTTATATCAT 


GATTAACGCT 


ATTAAATATC 


CATATCCAGA 


TTCAAATAAA 


180 


AAGTTTCAAA 


TGATTAATGA 


TTGTGCTGAA 


AAATTCGACA 


TACCAATTTT 


AGGTATCGAT 


240 


GTACAGCCCC 


CTCAAGCATT 


TCATGATTTA 


TCGGTTATAT 


TATAATTATT 


TAATTAGTGT 


300 


GTTAAGGCTC 


CAAAAATGGA 


TACCAGAACT 


TCAATAATAT 


TAATTATATA 


TTNCGGGTTT 


360 


CTCTTTGTCG 


NAAGTTTTCT 


TTAAGTATTT 


ATATGGTGAG 


CCATCAATGT 


GTCTAACGTA 


420 


TTTAAGTTGN 


ATTAGTTTAT 


GTACAATTTG 


GTCAACATCC 


TGCTCATTCA 


CATNCATAAT 


480 


TAA 












483 



(2) INFORMATION FOR SEQ ID NO:739: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739: 



TTCCCCGTCC 


TGTAATAACC 


TTCACTTGGC 


ATTCCTGGGA 


ATTTGACCAA 


CATCCTGTTC 


60 


ATTTTNATAG 


ACTTGCTTAA 


ATGGTACAGC 


TGAAAAATCC 


AAAGTAATGT 


CCGTTTTCGT 


120 


CTGGCCTGTG 


AAAATGGATC 


AATGAGGATT 


GCGGTGTATC 


TTACATAAAG 


CAGATATTCT 


180 


TGTAAGATAT 


CTGGCAAGAT 


TTCGAAATGC 


TCAATATCAC 


CTTCATCCAA 


AGTTTCAGGA 


240 


CGTACTGGAT 


AATTTTCCTT 


CACCATAAAT 


TGAAGCAGGT 


TGACTGACGG 


CACTCTAAAT 


300 


GCTTGTAAAA 


TAGCCATCAT 


TTCCTTCCAC 


ATTAAACGCA 


TTTCACGGAT 


TAATGTCATC 


360 


GTTATCGGTC 


CTTCAGGAAA 


TAACTTATAC 


ACATAAACCG 


GTACATCCGT 


TCCTTCGAAA 


420 


ATGGCTTTCA 


ATGAAAAGTC 


ATTCATAAAT 


AGTGGCGGGT 


GCACATAAAG 


CGAACTGTTA 


480 


CGTGTTTCAG 


CATGCAGTGG 


CGACCCANCG 


AATCCGAATG 


TGAATCTTCA 


ATTGCTCAGC 


540 


TAAAGCAGAG 


ATTCGTTGAC 


ACATTGTTGA 


GTNTGAATGT 


GTCGATCCCA 


TGTACAATTT 


600 


CTTTTTTACA 


CCTGTTGTCA 


ACACATGATT 


AGGTGCTTCT 


TTATCAACAA 


TACGTGTATC 


660 


GCCAAGATAA 


GTTGAAAATG 


AAATCACTTC 


GATATCCTGA 


CTGAACTTAG 


ACATAAATTG 


720 


TTCCACGATC 


ATTTGCGAGC 


CAAATGTNGG 


TGANATTAAA 


ATGACATGTT 


TGACACCTTT 


780 


CCAAAGTTTC 


CTAACCGACN 


ATTGCTGTNG 


TGTGTCAATA 


ATAAGCATCC 


TTNCTGTGCA 


840 
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TTGCCATGAC CANCCGTTTC GGTATTCCAC CCTTAACGGT TTTTAACATC TTTAATACAA 900 

ACGATTAATT TCCAAACTTA CCTTCCAGAT GTTGATGCGC CTCGTTTTGG ATTTTGACTT 960 

CAAATTGTTT CTCTTTTTTA TACGCTTGAT ATAAGCGTTT TGATTTTTCT GATGTTGAGG 1020 

CACGTCCAAC CATATCAATC TCATAATCTG ATTTTAAATA GCAAATATTC GCTAATTGGA 1080 

TTGCGACCGG ACCAGTGCCT ATCATTAATA ATTTAGACAT CCGTAATACC TACCTTTATA 1140 

GCTGCTTTTT TATAAAGTGC TATATCAAAA ATTTGCTGTG GTCTCATATG TTTAG 1195 

(2) INFORMATION FOR SEQ ID NO:740: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740: 



CTAATGGCTA 


TGGTTATATC 


AAGTTAAGAG 


ATATTATCGT 


TGAACGAATG 


TCAAAGCAAG 


60 


GTATAAATGT 


AGGTAGAGAA 


AATGTAATGA 


TCACTTCAGG 


CGCTTTACAT 


GCCATTCAAC 


120 


TTTTATCTAT 


TGGGTTTTTA 


GGTCAAGATG 


CCATAATAAT 


TTCGAATACA 


CCATCATATA 


180 


TTCACTCTAC 


AAATGTTTTG 


GGAGCAATTG 


AATTTTAGAC 


ATATTGATGT 


NCCTTATAAT 


240 


CAAATTAATG 


AAATTAANAC 


CATCATTGAT 


AGATTTATTA 


ATTNTAAAAA 


TAAAGCGATN 


300 


TATATAGAAC 


CTAGGTTTAA 


TAACCCGACA 


GGTCGNTCTT 


TAACGAATGA 


GCAAAAGAAA 


360 


AATATNATTA 


CTTATAGNGA 


AAGACATAAT 


ATTCCTATCA 


TTGANGATGA 


TATCTTTAGA 


420 


GATATTTNCT 


TTAGCGATCC 


AACTCCTGCT 


ATCAAAACTT 


ATGATAAATT 


GGGAAAAGTT 


480 


ATACATATAA 


GCAGTTTTTC 


AAAAACGATT 


GCACCAGCAA 


TAAGAATAGG 


TGGATTGNTG 


540 


CTTCTGAAAA 


AATAACAGAG 


CANTTGGCAG 


ATGTAAGAAT 


GCAAATNGAC 


TATGGNTCCA 


600 


GTATCNTGTC 


ACAAATGGGN 


GTATATTGAG 


ATGTTGAGAA AATAAGTCTT ATGATAAACA 


660 


CTTAGNAANG 


TTAAGGTATT 


GTTTTAAAAG 


ATAAACGAGA CTTTATGGTT AAACATCCTC 


720 


AATNANTTTA 


TTTAAGGATA 


TAGCACATTG 


GGAGGNTCCA 


AGTGGAGGTT 


ATTGCGNNTG 


780 


GNTAGNCTTT 


AAAATAGATA 


TAGATATTAA 


ATATTTATTT 


TACGAATTGT 


TAAGTAAAGA 


840 


AAAAATATTA 


ATCAANCCGG 


GGTTACATTT 


ATGGNAGNNA AAGAAAAGAG 


TATAAGGNTA 


900 


TCTTTTGGCT 


TTGNATCAAA 


TGAAAATATT 


AAGCATGCGC 


TCTATAAAAT 


TTATACATAT 


960 


GTGAAAAAGG 


TTTAATTAAA 


ACAATAATTC 


GAATCATTAT 


GTGGNATGTT 


AAACAGNTAA 


1020 


ATATAAGCTA 


TGNACATTTA 


ACAA 








1044 



(2) INFORMATION FOR SEQ ID NO: 741: 



li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 459 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDE0NESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741: 



AAAAGAAAAA 


TAAAATATAG 


ATTGAGCACG 


AAGTGATTTG 


AAATAAGGTT 


GTGAAAGGGA 


60 


ATGACAAGGT 


CAGCATTAAA 


ACCATTTAAA 


AATAAACGCG 


TTATGGTTAC 


TGGACGTATA 


120 


CAACGTGTTN 


TGTTTAAAAA 


TTATTTAGAT 


AGACATAGCA 


CATTTANGCC 


GAATGTAAGG 


180 


ATATTATTAA 


AAGATGTATT 


TGTTTCAGGT 


GTATCAATAG 


ATCATTTATG 


GTTATATGAG 


240 


ACAAATAAAT 


ACTATGCATT 


GGCAATGGAA 


CTTATTCATC 


AACGAGTAAA 


ATTTAGTGCG 


300 


AATGTTGGTA 


CCATATTTAC 


AAAATTAAAT 


AGAAATTAAT 


AATTTATTCC 


GCACAAAGAT 


360 


TATGGAATTA 


AGCCGTAAGG 


GTAGGTNTAA 


TTACTGAAGA 


AGCTTACAAT 


CAAAAACAAT 


420 


CAAGTATCAG 


GATAAGATAT 


ATTGAAAAAT 


NACCCGGTT 






459 



(2) INFORMATION FOR SEQ ID NO:742: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 857 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 42: 



ACAACCCTNC 




CAATTAGGTA 


GAGAATTTNA 


CCTAGGTAAN 


TTAATGCGAT 


60 


AAAGCCCAAG 


TTTGTAAAAT 


GTCCNTTGTG 


CGCCAATTTG 


TTCCTGTACN 


TANTGGGANC 


120 


TATTTTAGGA TTCTTATCAG 


GGATATTTCC 


CAAGGGTTTT 


GTTGACNCCT 


TAATCATGCG 


180 


TGCGTGTGAT 


GTTATGTTGG 


CAATTCCCCA 


AGTTATGTTG 


TAACGTTAGC 


ATTAATTTGC 


240 


ATTGTTTGGA 


ATGGGTGCCG 


AAAATATTAT 


CATGGCATTT 


ATTTTGACGC 


GTTGGGCATG 


300 


GTTCTGTCGT 


GTTATACGTA 


CAAGTGTTAT 


GCAGTACACT 


GCTTCTGACC 


ATGTCAGATT 


360 


TGCTAAAACA 


ATCGGTATGA 


ATGATATGAA 


AATTATTCAC 


AAACATATTA 


TGCCGTTAAC 


420 


ATTAGCAGAT 


ATTGCTATCA 


TCTCTAGTAG 


TTCGATGTGT 


TCAATGATCT 


TGCAAATATC 


480 


TGGCTTTTCA TTTTTAGGAT 


TAGGTGTCAA 


AGCGCCTACT 


GCAGAGTGGG 


GCATGATGCT 


540 


TAACGAAGCT 


AGAAAAGTGA 


TGTTTACACA 


TCCTGAAATG 


ATGTTTGNGC 


CAGGTATTGC 


600 


CATAGGGATT 


ATAGTGATGG 


CATTTAACTT 


CTTATCCGAT 


GCTTTACAAA 


ATTGNTATTG 


660 


GATCCCCCGC 


ATCTCTTTCT 


TAAAGATAAA 


CTTCCGCNCC 


TTGTGAAAAA 


AGGGAGTGGN 


720 


GCAATCATGA 


CATTGTTAAC 


AAGCTAAGCA 


TTTGGCGATT 


ACAGATACCT 


GGACAGATCA 


780 


ACCACCGTGA 


GTGATGTGAN 


TTTNNCAATT 


AACTAAGGGG 


TGAAACTCTA 


GGCNTTATTG 


840 


GGGAAAGTGG 


TAGCGGT 










857 
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(2) INFORMATION FOR SEQ ID NO: 743: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:743: 



GGCACGAGCG 


GCATATCTTT 


TTTACTAATA 


CCTCTGCCAT 


AGTCTTTAAT 


ATATAACGAA 


60 


ACATGTTGAT 


CATTTAATTC 


TGTCCCAATT 


TCACTATTAA 


AATTCTCACT 


ATNTGATCAN 


120 


GGCGTATGAT 


AGAATCTGAC 


TCTNAANNAA 


ACGCAANCTT 


GTTGGCTGAA 


AGTTNNCATG 


180 


CTGGTGTTGA 


AACGGCATAT 


AAAGCTGTTA 


TGAAACCAGN 


TGAAGGNACA 


ATACTTACAG 


240 


GTGGAAAAGA 


TGCTGCGCAA 


GCTGCAATAG 


AAAAAGCAAA 


TAATACTGAA 


GATTGTATTA 


300 


GAATTAATGG 


AGNACATTAT 


TGGTAAAAGN 


CAATGAATCA 


CTTGGAAAAC 


ACAACCAAAC 


360 


TTATTAGCTG 


TACTTAAAGA 


AGTGGGTGT 








389 



(2) INFORMATION FOR SEQ ID NO: 744: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744: 



AATTGCTCTA 


TTAATTGTGA 


TTTTTTAAAA 


ATAAGCTGCT 


TTATTAGTTA 


TTAATCTTGC 


60 


ATTGATATCA 


CTAATATGTT 


GCTTTTCAAC 


CAGTCCGTTC 


GCTTTTAGCG 


TAGTACCTGT 


120 


TTGAGCGATG 


TCGACAATTC 


CGTCTACCAT 


ATCTACGACA 


CAAGCTAATT 


CAACAGANCC 


180 


ATTCAATTTA 


ATCAATTCGA 


CATCAATACC 


TTTTGATTTA 


AAATATGTTT 


CAGCAGTATG 


240 


AACATAACTC 


GTTGCGATTT 


TACGATAATT 


GGTCGTTTCA 


GGTTTCGCTG 


CAACCGCANA 


300 


ATGACATGCT 


CCAAAAGGCA 


TATTCAACAA 


ATTATTAACA 


TTATATTGGC 


GCTCATCTAA 


360 


TATGTCGCTA 


CCAACAATGC 


CTATGTCGGC 


CATTCCTTGT 


TCCACATAGA 


TTGGCACGTC 


420 


ACTTCCTTTT 


NCTAAAATGC 


ATTCAATATT 


ATCTACACTT 


AATAATAATT 


GGCGTTCTCT 


480 


ATTTTTTAAT 


GTTTCTGATA 


ATGTCGTATA 


TTCAATTACA 


TCTAAATAGT 


TAATTAAACT 


540 


ATCCATTAGA 


CGTCCTTTGG 


CTATGGCGAT 


TCTTAACATT 


GTGCATACTC 


CCTTACTTAA 


600 
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AGTGTTAATC CTAATCCAAA CCCTTCGATG CTTCCTTTGT AATAACCG _ 648 

(2) INFORMATION FOR SEQ ID NO:745: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745: 



ATTAGCGGAT 


GGNTAAAAAT 


ATGGTTTTAG 


TAAGCGGGCC 


ATATTCCCTT 


TTATTANTGC 


60 


GAGTGTTTTA 


TAGCTGACAC 


AACTTNCTTA 


CCTTGTTCGT 


GGCAATCTAG 


TGATATTATA 


120 


TCTGCTGATT 


ATTTTCATGT 


AGGATTCGTT 


CGATATTTTA 


GTAGAATGAT 


TATACCTAAT 


180 


TTATTCTCGC 


TTTTAGCAAG 


TATTATAGTA 


TTGTGGTTAT 


ATTTTAGAAA GGCGATACCT 


240 


AAAACGTTTG 


ATGATAATAA 


TATAAAGCAT 


CCTAAAGATG 


CCATTAATGA 


TTTAAAGCTA 


300 


TTTAAAATTT 


CAGGGATTGT 


CCAAGTAANA 


TTACNTTTCG 


GCCAACCTAA 


TCAAGTGAAT 


360 


TTNCCNAAAA 


TTCCGGGTAT 


CCAATTTTTC 


NCCTGGATCA 


TTGCTTTTAT 


TTTTCTAATG 


420 


TTGGCTCGTA 


AATCCAAATG 


CTGTAAATAT 


TAAGCAAGTC 


ATTAAGGGCG 


CACCTTGGAA 


480 


TATAGTATTA 


TTTTCAATTG 


GTATGTATAT 


CGTCGTATTC 


GGCTTAAGAA 


ATGCTGGCAT 


540 


TACTTTAATA 


TTGGCTAAAA 


TATTAGAATA 


TATTTCCAAT 


TACGGTCTAT 


TTAGCACCAT 


600 


TTTGGGAATG 


GGCTTCATTT 


CAGCGTTTTT 


ATCATCAATA 


ATGAATAATA 


TGCCTACAGT 


660 


TTTAATAGAT 


GCGATTGCTA 


TTGGTCAATC 


AAATGTCCAT 


GGCATGTTAA 


AAGAAGGCCT 


720 


AATTTATGCG 


NATGTTATCG 


GTTCTGATTT 


AGGTCCCNAA 


AATTACACCG 


ATAGNCTCTT 


780 


TAGCTACATT 


NCTGTGGTTA 


CACGTCTTAA 


CACAAAAAGA 


TGTTAAGACT 


TCTTGGGGCA 


840 


CATACTTTAA AACTGGTATC 


ATCATTACAA 


TTCCAGTACT 


ATTTATAACC 


CTCATAGGGT 


900 


TGTATCTAAC 


ACTTATCATA 


TTTTAAGAAA 


TAAAAAGACG 


CTTTCTATTT 


ATTGCGATAG 


960 


AAAGCGTCTT 


TTTATACTTG 


AGCAAGTTAT 


ATCATAACTT 


AACTCTATCA 


AATTACTTAG 


1020 



(2) INFORMATION FOR SEQ ID NO:746: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:746: 
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ALA X\_~rl luun 


^uAjnnnnX 


AAAGAAACCC 


NACAATTAAT 


60 




GGAGTGTGCT 


TCNACCNCAT 


TTGTAAGTAT 

X X VJ X AA13 X X 


TTATCCGTTA 

x x n x v— x x 


ATAATTTCTA 

x* x nn xxx x ** 


120 




GAACNATTAT 


CCAAATTCCA 


TCCTTTCCTA 

X V^N*** X X X V** X *»■ 


ATAAAGTGGA 


CNACGGAGGG 

\w*X^X*X* 


180 


AAi XAALAAn 


GACAAAGATG 


ATGAGGGTAG 


CACTATCAAT 

XXIX X 


CCAACAATCA 


CGGATAGAGG 


240 


CACAAAGATC 


AAATANNAAA 


GCGGTCATCA 


GTCAATCTGA 


TGTGI 1AGCA 


AGAATGACAA 


i An 
300 


TAGAAGAACA 


TGAGCAAGAC 


GGCGAAAAAG 


CTTATCAATA 


TGTTCTTAAC 


GCTGAACCAT 


360 


CAAACTTATT 


CGAGACAAAG ATAAGACACT 


CAAGCAACAT 


TAAAATTAAC 


AACANACGTT 


420 


TCATTAATCC 


AAGTATTAAC 


GACGTAGTAC 


AAGCAATCAG 


AAATGGAAAC 


TAATAAAAAA 


480 


ACTAAAAAGG 


ACGGTATTTA 


ATTATGAAAA 


TCACAGGACA 


AGCGCAATTT 


ACTAAAGAAA 


540 


CAAATCAAGA 


AAAGTTTTAT 


AACGGCTCAG 


CAGGGTTTCA 


AGCTGGA 




587 



<2) INFORMATION FOR SEQ ID NO: 747: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:747: 



TCATAATATC 


TTGTCACTCG 


GTACTGCTAA 


AGTGGTTNTG 


CNAAGTGAGA 


TAGCNGTAGA 


60 


NTCAACTGGT 


GAAGAAATAA 


CGTCTTGCGG 


AATACTAATG 


ANACTTGCGC 


CATNTTNTCC 


120 


TGNAGTAGCA 


ATTCGAATTG 


CAGGTTGTCA 


TAACTTCTGA 


TAATGATTCA 


GGATCTTGTA 


180 


CTTCTTCACT 


GTATTTTGAT 


GAATATTTTA 


ATAGCGCAGC 


ATTATCAATA 


CTTTGATGCG 


240 


CTAATCGTAA 


TAAATCATTA 


CGTTTCACTT 


GTCCACCTAA 


CGCTAATACA 


GGATCCCCTT 


300 


CAGATGTAGC 


TGTTAATAGT 


CCAGTCGTTA 


AATTACTTAC 


ACCAGGGNCA 


CTTGTAACAA 


360 


GTACTACACC 


CGGTTTACCT 


GTTAATCTTC 


CAATACCTTG 


TGCCATCATT 


GCAGNATTTT 


420 


GTTCATGACG 


AGTGACAATA 


AGTTCAGGAC 


CATCATCAAT 


TAAAGCATTA 


AATAGATAGT 


480 


CTATCTTTGN 


ACCCGGAATA 


CCAAAAACAT 


ATTCTNCCCC 


ATTATTTTTC 


AAAGTATCAA 


540 


ATAACCCATA 


TCGGNTGCAG 


GGGTACTTTT 


TATCAAGTCA 


TTTATNTTCA 


ATTTCCTTCC 


600 


CAATNGTGAT 


GTATTTGATG 


GTGGTNTTTA 


CATGGTAATT 


ACAATGGGGG 


ATTGGGGGAA 


660 


TCTTATGATA 


GGNTACGCAA 


AAATTAATGA 


TGGTTTATCA 


AAATTTTACA 


TTTTTCCGAT 


720 


TAGATTNGGG 


GAATACAATA 


AAATTTGCCT 


CCCAANAAAT 


TATTTTTACC 




770 



(2) INFORMATION FOR SEQ ID NO: 748: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: lin ar 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:748: 



GGCACGAGCG 


AAGGTGGACG 


TACTGTAGGA 


TCAGGCGTTG 


TTACTGAAAT CATTAAATAA 


60 


TTTCTAATTT 


CTTAGATTTT 


ATATAAAAAG 


AAGATCCCTC 


AATCGAGGGG 


TCTTTTTTTA 


120 


ATGTGTAAAT 


TTTGTAATGG 


CTATTCGATT 


TAGAAGAACA 


ATAATTGATG 


AAAGACTGAC 


180 


TAATAAAACT 


TATAACTGAT 


AATACTGTTT 


AAATAAAATT 


GTTGAGTCTT 


GGACATTGTA 


240 


AAATGCTCCC 


TTCAAAGTTT 


TCATTTTTTC 


AATGTCTACT 


TTGAAGGGAG 


CATTTCATTA 


300 


GTTTATGTCC 


CAGACTCATA 


TCTTTCAATT 


AATTTAAATG 


CTTAATNTGN 


TTTAAATACT 


360 


TGCTCTAATT 


CAATGATTTT 


TAAAAATACA 


GCTACAGCGT 


ATTTTAATGA 


TTTTNCATCA 


420 


ATATCAAATT 


TGGGATTATG 


GTGTGGCGCT 


GTAATACCTT 


TACTTTCATT 


ACCACAACCA 


480 


GTCAGAAAGA 


ATGCACCTGG 


NCGGACTTNC 


AAATAATGTG 


AAAAATCTTC 


TCCAATCATC 


540 


ATTAAATCTG 


ATTCATTAAA 


GCGTACATGT 


AAGTCATGTG 


TTGCTTCTTT 


TAATAACTTG 


600 


ATATGCTTTC 


TCGTTATTAT 


GGACAGNGCA 


AATACCCTCT 


TAATATTAAT 


TCCAAAATCA 


660 


ATAGGNTAAT 


ATCAATTGCT 


ATTGTTAAAC 


CCTGG 






695 



(2) INFORMATION FOR SEQ ID NO: 749: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749: 



CGATTAAAAA 


ACTCTATCCC 


TTTGTGTTTA 


ACACACTTTN 


TGAAGCGATT 


GATAAATTAG 


60 


CAATTGATAC 


TGATATAGAG 


ATGAGCGAAG 


ATGAAATTGC 


ATTTTTAACA 


ATTCATTTTC 


120 


AAGCTGCCAT 


CGAGCGTCGT 


NCCAAAACAC 


AAGTAAATGT 


AGTNATTGNT TGTTAANTAT 


180 


GGTCCTAGGT 


GTGTCGAANT 


TTTTAGAAAC 


TNAAATAAAT 


AACTTATCCG 


AAGAGTTGTC 


240 


AGTAATAAAT 


ACGAGTAAAC 


TAGAAAGTAT 


AACACACTAT 


CACTTTGACN 


ATGTAGATTT 


300 


ATTAATAACG 


ANCCATGATA 


TTCCAAAACA 


AACGCTAAAC 


ATACTTCCTA 


AGCACCTGAC 


360 


GACTATCAAG 


GTTGCACCAT 


TATTTTCTGA 


AGATGATCGT 


CACAAAATCA 


GACACGTTGT 


420 


GAAGCAAAAG 


CAAAATCCGG 


TTCAAGCACA 


TCATCATATG 


GACACTGTCA 


ATTTCCTTGT 


480 


CGGAAATACT 


GAACAAAAAT 


CACGGCATAC 


TGTGCAAATT 


TTGGAAGAAG 


CTCAAAAAAT 


540 


ATTACAAGCA 


CATCATGCGA 


TTGGTAGAAG 


GATATNTCGA 


ATCAGCTTTA 


GAGCGTGAGA 


600 


AATCATCTTC 


TACATACAAT 


AGGTAATTTN 


ATGGGCGATT 


CCTCAAGGAG 


GATCCCGGAA 


660 
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AAAGGTTNAC AAATCACATG TGCTTATTTN TCGCACAAAA AGATTGTTTT TCCCTNGGGC 720 
GACAACAACG ACNTGGCAAA CTTGGCTTTN NCTTAGGCGA NTTCGGATTN AGGCAAAGGT 780 
TTTTCAAAAC AAATTGNTGC AANTNATTTG GTTACTTNGG GTG 823 

(2) INFORMATION FOR SEQ ID NO: 750: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750: 



AATTCCGCAT 


AAGTTTAATT 


GGTTTGATAG 


ATATCAAAAT 


ACAATGCCGA 


CCGGTAAAAA 


60 


TCCCTAAAAA 


TTGTTGAAGG 


TAATACACTA 


GCTCCTAAAT 


TCATATACCC 


CCCATGTATT 


120 


TTTAAACCAA 


TTCAAAGTGT 


GAATCTNAAT 


AANTATCTNC 


CTGCATCTTN 


AATATTTTTT 


180 


AATATGAACT 


TTGAGATTGA 


TTAAATGTTT 


TAATCACTTC 


AATGCCTTCG 


ATAAACTCTA 


240 


CAATCGCGCT 


ATTCATATAA 


TTATTTGATT 


TCATTTGTTC 


AGCATATGTC 


TCATTAAATC 


300 


CAGACATAAC 


TTTTTTAAAA 


GCGAAAATTG 


AAATTGGTAT 


CGTTACTAAT 


AAGGCACTAG 


360 


CCATACGCCA 


ATCAATGAGC 


ATTATGTATA 


AAAAGATAGC 


AGCTGNCACG 


NAAGTAAGTT 


420 


TCCTATAACT 


TCAGGAATCA 


TATGTGCTAA 


AGGTNATTCT 


ATTGTTTCCA 


ACCTTAGCGA 


480 


CAAATATATT 


TTTTAATTCA 


CCTATTTTCT 


TAGATTCCAC 


TACGCCTAAN 


GGGAGGCGCA 


540 


TTAATTTTTG 


AGCTAATTTT 


TTGCGAATTT 


CAGATAAAAT 


TTCATATNCC 


GTAATATGTG 


600 


ATAGCATCGT 


TGACGCTCCA 


AAACAACACA 


CTTGTGAAAT 


ATAAGTGATT 


AAAGCAATAA 


660 


AGATATAANC 


CATAATCGAA 


TTANTCGTAT 


ATGCATTGTT 


AATCATCNTT 


AAAATAATTT 


720 


TAAAGACTGC 


CCAATATGGA 


ACTAATCCAG 


AAAAGACNCT 


GATGATAGAC 


AACAANAG 


778 



(2) INFORMATION FOR SEQ ID NO:751: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751: 
CTGTCAATGA ATATGATGAT ACTGATAAAA TTTTCTCTAA CCCATCAAAC AAGAAAACAG 60 
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AAGATTATAT 


TTCAGGAAGG 


TTTGGTTGAT 


ATATAATGGC 


AATAATTAGA 


CAGCGATATC 


120 


AGGAGCAACT 


TGATGATTTA 


ATAAAGGAAT 


TACGTCGGTT 


AGGTGCAAAT 


GTCTATGTGA 


180 


GTATTGAAAA 


TGGTATAAAA 


TCATTAAGTA 


TTGACGATAG 


AGGCTTTGNA 


CGACAAACAG 


240 


TTAAAAACGA 


TAAACATATC 


AATCAATTAA 


ATTATGATAT 


TAATGAGCGA 


GTTATCATGT 


300 


TAATTACAAA 


GCAACAGNCC 


ATTGCGAGTG 


ATTTGCGTAT 


GATGATTTCT 


TCAANTAAAA 


360 


ATCGNCTCCG 


ATTTAGAAAG 


AATAGGAGAT 


AATGCATCGA 


GTATTGCCAA 


TATTCGATTG 


420 


CGTACAAAGA 


TTACAGATGA 


TTATGTGTTA 


ACCCGTTTAA 


AGACAATGGG 


TAAATTAGCT 


480 


ATGTTAATGC 


TAAAGGACTT 


AGATCAAGCA 


TTTAAAAAGA 


AAGATACCGT 


ATTAATAAGA 


540 


GAAATAATTG 


AGCGTGATGA 


AGATATCGAT 


GACTTATATA 


GTCATATTAT 


TAACGCAACG 


600 


TATCTTATTG 


ATAACGATCC 


ATTTGTCGCT 


GCACAAGCTC 


ATTTAGCAGC 


AAGACATTTA 


660 


GAACGTATTG 


GTGATCATAT 


TATTAACATC 


GCTGAAAGTG 


TTTATTTTTA 


TTTAACAGGT 


720 


ACACATTACG 


AACAAATAAC 


TTAAAGTTAT 


TACTATAAAA 


TCCCTTACGA 


TAAATATATA 


780 


TTTCTATCAT 


TCATAAACCC 


TCAAAAAAAA 


CCAAGATTCT 


CACAAATTAG 


NAATGNGTGA 


840 


AAANCTTNGN 


GNTATATTTT 


GGTTCTTACT 


TATTAAATNG 


GTCTCGCATC 


TTAGGNTATT 


900 


TGGNTTGGTC 


AATTTCATCT 


TG 








922 



(2) INFORMATION FOR SEQ ID NO: 752: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752: 



TCACACTCAA 


TTCTACATCT 


AATATCAATT 


TATTACCAAT 


AGAATTAGCA 


GCAACAAGAT 


60 


ATTACNTNAN 


ATTACTCCCC 


GAATTTAAAA 


TATAAACAAT 


GTAATAATTA 


AGGGNATAAA 


120 


TTGATTTATC 


CTGTAACTAA 


TTAAATGTGT 


ATGGTATCTT 


CTAACATCAT 


GAAACGCATC 


180 


TTAATATAGA 


TATGTACATT 


TAAACCTTGA 


GAGATAGATT 


ATAGTAAACT 


TTAGTACNAA 


240 


GTATGAATTG 


CTGAACTCCA 


ATGACTACTA 


TTTGTCCGTT 


ACAACTAAAG 


TTTCAATCTA 


300 


TCTCTTTCTT 


TTTGTGTTTA 


AAACACGACT 


TTGTAGTAAA 


TATTGAACAT 


CATATTTATT 


360 


AGCAAGTTCT 


GCTAAAAAAT 


CTAATAAAGT 


TTGGGTGTCG 


ATTTTACCAT 


TAATAAAATC 


420 


AGCTGTAAGT 


AGTNTTGAGT 


GTTTTCTTTC 


TTCTTCATTC 


GCACATTTTT 


TAAAATATCC 


480 


CCACATATGA 


GCAAAAGCAT 


TACATACACT 


TCCTATACTC 


GGNTCAANTA 


ACAAAGCATC 


540 


ATCAATCATT 


TGTTGGACTT 


GATGAAGTGA 


TAAATCCGTT 


TTTAAAGCAT 


TCCTTATCAT 


600 


TTGGTAAGAC 


TGTTGACTGT 


GTAACAATAC 


ATGATATTTT 


TCCTCACGCC 


AAAGCTGTTC 


660 


GATATGGCCA 


CGTTCTTTCA 


TAAAATTACC 


TCCGATAATT 


AATGTTAATA 


TACATGCATA 


720 


TAGTATAGCA 


ATAAAAATGA 


ATGCGCTATC 


AAAAATGGTG 


TAGGTGTTTA 


TATAAAATGA 


780 


TAG 












783 
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(2) INFORMATION FOR SEQ ID NO: 753: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753: 



GGCGATGTAA 


AAAACGATGT 


AAGTGTCCTG 


AGGATAAAAG 


AATCAATATC 


TTACCCTGTT 


60 


AGCCACGTGT 


TAATTAATGG 


TATTCGCTAC 


AAGATAATTG 


ATACAAAGAC 


ATACAGACAC 


120 


GAAACGTCAT 


ATTATATCGA 


AGAGGTCAAT 


TGATGAATAT 


AGACGGATTA 


GACGCACTGT 


180 


TAAACCAATT 


TCACGATATG 


AAAAACAACA 


TCGATGATGA 


TGTAGATGAT 


ACTTTACAAG 


240 


AAAAAGCCAA 


AGAATATGTA 


GTACGAGCGA 


AATTGAAAGC 


TAGAGAAGTA 


ATGAATAAGG 


300 


GTTATTGGAC 


TGGTAATTTA 


TCACGCAATA 


TCAGATATTA 


AAAAACTGGC 


GATTTGCAAA 


360 


TACACTATCA 


CATCGCATGC 


AGCTTATAGT 


GGGCTCCTTA 


GAGTTTGGGG 


ACTCGATACA 


420 


ATGGAGGCAG 


GACCCTCCTA 


TGTGGCAGTA 


TATTGAGGGT 


AATAAGAA 




468 



(2) INFORMATION FOR SEQ ID NO:754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:754: 



CCAATTGATA 


TTACGGCGAT 


TGTAACNGTT 


GCTGATAATG 


GTGGGAGTAC 


NGGGNAAAAT 


60 


CAGAGATGAA 


ATGGTTATAC 


CAGCACCAGG 


AGACATCAGA 


AATGTGATTG 


CAGCTTTAAG 


120 


TGATTCTGAG 


TCAGTTTTAA 


GCCAACTTTT 


TCAGTATCGC 


TTTGAAGAAA 


ATCAAATTAG 


180 


CGGTCACTCA 


TTAGGTAATT 


TATTAATCGC 


AGGTATGACT ANTATTACGA 


ATGATTTCGG 


240 


ACATGCCATT 


AAAGCATTAA 


GTAAAATTTT 


AAATATTAAA 


GGTAGAGTCA 


TTCCATCTAC 


300 


AAATACAAAG 


TGGTGCAATT 


AAATGCTTGT 


TATGGAAGGA 


TGGAGAAATT 


TGTTTTTGGA 


360 


GAAACAAAAT 


ATTCCCTAAA AAACATAAAA 


AATTGNTCGG 


CGTGTTTT 




408 



(2) INFORMATION FOR SEQ ID NO: 755: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:755: 



AATTGNCGGG 


NCGTCATTAN 


TTTAACTCAG 


TANCACTTGC 


ATTCATTCCG 


TCAAAGTTTT 


60 


CATTAGCTAT 


CTTCATCGCN 


TTANCCATTG 


CAATATCATT 


TGCACATTAA 


CTGTGCCACT 


120 


TCTAACGCCA 


TATTCTTGNC 


CACCACCATG 


GNCAGATGGG 


TCAACATTTT 


GNATGTGATT 


180 


TACAAGTAAG 


ACGCCTTGTC 


CTTTTAAACC 


CATTAAACTT 


TGTGCCCACN 


TAAACNTAAT 


240 


ACTATCTATG 


TTATTTGAGA 


TCCCATTGAA 


ATTTTTGCCG 


GAATGCTTGT 


GAACCCGCAC 


300 


CTACATGAAA 


ATGTGCCTTA 


GGATAATTTT 


TTATAACTTT 


AGCCATTTTG 


TGGAATAGGC 


360 


TGNATTTTGT 


CCAGTTACAT 


TATTTACAAT 


ACAATACATT 


GTNACTAAAC 


CCGACTTTGT 


420 


CTGACACCTA 


ATTCGGTTGA 


AGTGNCCTAA 


GTTAATACTG 


CCATCTTTCT 


TNACATCAAC 


480 


ANANTTAACN 


TTAAACCCTT 


CGTGTGCTCC 


CCAAANATCT 


TACCAACCCC 


TAATNCGGAC 


540 


GGATGCTCTA 


ACACGGATGT 


AATTATTTCC 


TTCGCTGNAC 


CAAATTTACG 


ATAGGCAATA 


600 


CCTTTTAAAG 


CAAGATTATT 


GGATTCAGTT 


GCACCACTAG 


TGAANACAAC 


AGCATAATTT 


660 


CTTTTTGAAT 


TAATCATTGC 


ATTAATTTGG 


GCTTTTGCTT 


GTTGTAGTAA 


TTGATTTGCC 


720 


TGCAAACCAG 


CTCGTGCCGC 


TCGTGCC 








747 



(2) INFORMATION FOR SEQ ID NO:756: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756: 

CTACATAAAG TCTGGATGAT TATATGTACT AACCTTTGTT TCACATTGAC TATCTGTTTG 60 

ACATAAAATA AGCTTTGCAA AATTAATCGC AACTTGTTTC ATCGTTTGTG CATCATCACC 120 

TTCAAATAAA TAGGCATGCG ATAATTTATT TGAATGATAT GCATTCGTCA ATTGTTGCTG 180 

TTCATCCATT CATTACAAAC TCCCTTTTGC TTTTATATAA AAAGGTTGCC AAAGAGCAAC 240 

TAATTACTGC AACAAATAGC GACCATGATT TGGTGATTAT TTTTAGAACC ACAAAAAATC 300 

AAAATCTTAT TGGNATCTTG AACTAGGTCG CAGGGCAAAG CNAACCCAAA ATTGGTTAAA 360 

TAACTTTATT AATACACTAT TTNTANGTAT CTCTTAATTT GGATGGATNG NNTTTATTGG 420 
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AATTNNAAAT TGGATGGGAA TGNATCAACT TGGCATNACA AATNCAAGAG AACCGNCAAC 480 
TTCCAACTTT CAAACTTGAT TATTGGATTG TTCCGGATTC CC 522 

(2) INFORMATION FOR SEQ ID NO: 757: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757: 



CCAATCGGTC 


TTATCTTTCA 


ACACGTTTGA 


TTGTANCGGA 


TATCACCCGG 


TTCATAAATC 


60 


CNAAATCGTT 


GTAATATATT 


TCTCTTTATA 


TTCATTATTT 


CTAGGAATCC 


ACCCCCNCCG 


120 


TGCGGGCAAG 


TTTCTGGGAA 


ATTTAACAGC 


TATCCAGCCT 


TTCAACAAAT 


AAATCTGAAA 


180 


TTCGATGTTC 


TAAAATTTCT 


GCTTCTTGAT 


GTACTTCTTC 


CCAATTATAT 


TTCAATATTT 


240 


CTATTAAAAA 


TAATTCTAAT 


AGTCGATGTC 


TCTTAATGAT 


ATCAAGCGTA 


TGCGTTAAAC 


300 


CATCCTCTGT 


TAATCTAACA 


CCTTTGTATG 




AACATAGCCT 


GCTTTTTCAA 


360 


GACGTCCTAC 


CATTTCACTT 


ACAGATGGAG 


GCTTAATATT 


TAAAAATTGA 


GATAAGATTT 


420 


TATTTGTCAC 


AAAGTTTTTA 


TCGCCATTAA 


TTCGTAAGGA 


TTTGCCTTTA 


AAATAAGTCC 


480 


TCTTTTTCTT 


CAGNTAACAT 


ACTTTCACCT 


CAACAGACAT 


TGTCTATATT 


ATATCACGAA 


540 


TTTACTTGAC 


ATGATAAATA 


TTCTCAGTTT 


ATTANACAAT 


TAATTAGGTT 


AGCCTAAACT 


600 


TTTAATTAGG 


AGGTATAAAC 


GTTTGTTAGA 


AACANAAGAT 


TTAAATCTGT 


TTTTAGGTAA 


660 


TAAGCATGTA 


CTTAAAAACA 


TTTCCTTATC 


GATACCAGCA 


CGAGCGGCGA 


AATAATTGGT 


720 


ATCATGGGCC 


CGAATGGTGC 


TGGTAAATCT 


TCCCTTATCA 


AGTCTTTAAT 


TGGTGAATTT 


780 


AATGCTACCG 


GTACTAAATN 


GTTNTATAAC 


AAACCTATAC 


AACAACAACN 


GCAACAGATT 


840 


ACATATATTC 


CACAAAAAGC 


ACATATTGAT 


TTAGATTTTC 


CTATAAGTGT 


GGAACAAGTG 


900 


ATTTTATCAG 


GTTGCTACAA 


AGAAATTGGA 


TGGTTTAGAC 


GACCTAATAA 


ATCAGCAAGG 


960 


GATTAAACTC 


AAACAGTTAT 


TAAGCGATTT 


AGAATTAGAA 


TCTTTTACGT 


CATCGACAAA 


1020 


ATTTCAAGAA 


TTAAGTGGGT 


GGGACAAATT 


ACAACGGTGG 


TGCTTANGTA 


GGAAAGAGCA 


1080 


TTGGATGGTC 


CCGAAAAGTG 


AAGGTTTATT 


TTCTTTAGAT 


NGAGCCCGGT 


TGGTCGGGAA 


1140 


TTTGGATTTT 


AGGGTAGCGA 


AAAAATTAAT 


CAATGACAAA 


AAATCCGAGA 


ACTTTAAAAC 


1200 


AACAAAGGA 












1209 



(2) INFORMATION FOR SEQ ID NO: 758: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic UNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758: 



ATCCATCACT 


AAAGCCATGG 


TAAAGCATTA 


TAAGAATCGC 


CGAATATAGG 


TATGATTTTC 


60 


CACATTTTAA 


TTAGTTAAAC 


GTCAGTGTAT 


TCGAAATGTA 


CTAAGTGGAC 


GTGTAGGTTA 


120 


TCACCTCCTT 


GGAAAATGGT 


ATTAGGTTTA 


TTCCCAAAAG 


AAGACAAAAT 


TAAAGCAATG 


180 


GATGCACTAG 


NAACGCGTCA 


ATATCTTAGA 


TAAATATAAT 


CAACGCTCTG 


ATGAATTATC 


240 


AGGTGGCCAA 


CAACAACGTA 


TATCTATTGC 


ACGTGCGCTA 


TGCCAAGAAT 


CCGAAATTAT 


300 


TCTTGCAGAT 


GAACCAGTTG 


CTTCATTAGA 


CCCATTAACA 


ACGAAACAGG 


TTATGGATGA 


360 


TTTAAGAAAA 


ATCAACCAAG 


AATTAGGCAT 


CACAATTTTA 


ATTAATTTAC 


ATTTTGTTGA 


420 


CTTGGCAAAA 


GAATATGGCA 


CACGCATCAT 


TGGTTTACGT 


GATGGTGAAG 


TTGGTCTATG 


480 


ATGGTCCCTG 


CATCTTGAAG 


CAACAGATGA 


CGTATTTTAG 


GTGAAATATA 


TTGGACGGTN 


540 


CAAATTAAAG 


AAAGGATGGA 


AAAAGCTTAG 


GAGGTGGAAC 


TTAACAATGC 


CTTNTAGGAA 


600 


ATACCTACAA 


AGGTATTGAC 


TCCCCNTNTA 


AAGAAAAGGT 


TTCTTTTAAA 


ACGAAGTTTN 


660 


ACCTCCANGT 


NATCAATGGT 


GCTTTATCAT 


TGGGAGTTCC 






700 



(2) INFORMATION FOR SEQ ID NO:759: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759: 

CTAAAAAGTT TTTCCTGGTT GGTTAAGAAG CCAGATTTGA ATGCGACCAA CCGACCTGCC 60 

ACCAAAGAAT TTTTAGAAAC GGCGTTAAAG NACGAAGAAA TATTCCNGCT ATTGAGAATA 120 

TTACGGATTC AACTCAANCT AAAATGGATG CTTATAAAGA AGTTAGACAA TCAGCTACAG 180 

CTAGAAAAGC GCACAATGCT ACAGTTTCAA ATGCAACTGA TGAAGAAGTA NCAGAGGCAA 240 

ATGCAGCAGT AGATGCAGCT CAGACAGAAG GTTTAC ATGA CATCCAAGTT GTTAAATCAC 300 

AACAAGAAGT GGCTGATACT AAAGCAAAAG TATTAGATAA AATCAATGCA ATTCAAACAC 360 

AAGCAAAAGT TAANCCTGCA GCTGATACAG AAGTAGAAAA CGCATATTNA TACACGTAAA 420 

CAAGAAATCC AAAATAGCAA TGCTTCANCT ACAGAAGAAA AAGAAGCT 468 

(2) INFORMATION FOR SEQ ID NO: 760: 



(i) 



SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:760: 



GAATTCCTAT 


TTACCTTGCT 


GGACAGCNAC 


AATTTTCCCA 


TTAACCTTAC 


ATAGGATANC 


60 


NACGCCTTGA 


CCTACTTTTT 


CAATATGTGG 


CGCTCTTATA 


AAGACACCTA 


CAATATCTGT 


120 


AGNGATACCT 


TTAATATCTA 


ATTCTGTTTC 


AAAGNTGNCA 


ACTTGTCTAC 


CGAATGAGTT 


180 


TCGGTGTACA 


GGAATATNCA ACTTGTTAAG 


GTATCCTTCT 


TCACCAANTA 


TATCTTGGGN 


240 


TAGAACTATT 


AATCCTGCGC 


ATGTCCCAAA 


CATAGGTAAA 


GGTGANTTGT 


GGTGAAGNCT 


300 


CTTTANATCC 


ATATAAATTC 


ATTAATCGAC 


GNNACGGNGG 


GGGNCTCGNC 


ACCAGGNNAT 


360 


ATTANGCCNC 


GNNNNCTTCT 










380 



(2) INFORMATION FOR SEQ ID NO: 761: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 761: 



AGTTGGNCCC 


AATNAAGAAG 


AAATCGAGGG 


CCTTAAATAT 


TACCCTGNTG 


GCCGAGTACC 


60 


ACACCGTNAC 


GTCGACTAAT 


GAATTTATAT 


GGAGTTAAAG AGNCTTTNCC 


AAAATTCAAC 


120 


TTTACCTATG 


TTTGGTACAT 


GCGCAGGAGT 


AATAGTTCTA 


GCGCAAGATA 


TAGNTGNTGA 


180 


AGNAGGATAC 


CNTAACAAGT 


TGAATATTAC 


TGTNCANCGA 


NACTCATTCG 


GTAGACNAGT 


240 


TGACAGCTTT 


GAAACAGANT 


TAGATATTAA 


AGGTATCGCT 


ACAGATATTG 


ANGGTGTCTT 


300 


TATAAGAGCG 


CCACATATTG 


AAAAAGTAGG 


TCNNGGCGTA 


GATATCCTAT 


GTNAGGNTAA 


360 


TGAGAAAATT 


GTANCTGTCC 


ANCAAGGTAA 


ATATTTAGGC 


GTATCATTCC 


ATCCTGAATT 


420 


AACAGATGAC 


TATAGAGTAA 


CTGATTACTT 


TATTAATCAT 


ATTGTTANNA 


AAGCATAG 


478 



(2) INFORMATION FOR SEQ ID NO: 762: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 762: 






ACCAAGATTG 


GGANTNCCTG 


TAAATCCCCG 


GATAATTGAT CCAATGGGTC 


CAAGTACNCN 


60 


ACCCCNCTGA 


TGATNCCCCA 


CATGTTTATG 


TAAGTGGTGA GNATANTCAN 


TAGTAGAAAA 


120 


TGATTTAATG 


AAGAATGTNC 


CATTAAGCCN 


AANTANAAAT GTATNTNAAA 


TGATAGATAA 


180 


ACAAGAGTTT 


ATGNCTGTGT 


CTGGAATGAA 


TCGCATGGCT TATAATGATC 


AATATATTAT 


240 


AGGTCAAAGA 


GGAGACGAAT 


TTATTCTTTA 


TAAATTTGGA GATGAGTCAA TGCGTGTTTA 


300 


CAATACTGAA 


TTTGAAATGC 


AACAAGACTT 


AAATGAATTA GGGCAAAATT 


TACAACTAAA 


360 


ACCCGAAAAT 


GCTTATCAAT 


AGGAAATTAT 


TGCATAAGAA ATATATTTTA 


GTTATCCTGT 


420 


CACTAGAATA 


AATTTAAAAG 


GTTGGTGTAG 


TTCAAATGGC NGAACAATCN 


AAACNGAAAC 


480 


AAGCTAATGA 


ACAACAAAAG 


GCCCNGAACT 


TATTCGCACG TTGGAGACAA 


CTTCA 


535 



(2) INFORMATION FOR SEQ ID NO:763: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 519 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763: 



GATCCAACGT 


TACCGTGATG 


AATAAAAGCG 


CACAAGTTGG 


GGCGCAGCCN 


CAACAATTAA 


60 


GATACAAGCG 


AGCCATTTTT 


GTGCTAATTT 


TTGCGCGTAA 


AAATGTAACG 


TCAAAGATNC 


120 


ACCGTATGTA 


CCAACAATAT 


GTTAAGAGAT 


ATTAAAAAAT 


ATGAGGCACA 


CACGGATCCC 


180 


AGCTGTTGAA 


CACAAATTCG 


ATGCATTCCA 


AACAGANTTC 


CATATTTCTG 


ATNATGATCC 


240 


NAGCCTTGTA 


TGNCTGGTCA 


AGTANACANA 


CGTATATCGC 


ATTAGGCAAT 


ATGATGNCGA 


300 


CAGCCGCANT 


GTTAGGTATT 


GATTCATATC 


CGATGGAAGG 


TTTTAGTCTG 


GATACAGTGA 


360 


CAGACANNTT 


AGCANATNAN 


GGTATCTTAG 


ATACTGAGCA 


ATTTGGTTTA 


TCAGTGATGG 


420 


TTGCATTTGG 


CTACAGACAA 


CAAGATCCAC 


CGAAAAATAA 


AACACGCCAA 


GCATATGAAG 


480 


ATGTTAATGA 


AGGGGTTGGA 


CCAANAGAAT 


AAATAGNAG 






519 



(2) INFORMATION FOR SEQ ID NO:764: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 753 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764: 



CTGTTACTAG 


CGTCATTTGT 


ACTCATTACA 


CTTGGACAAT 


CTGTAGAACA 


ATTACGTGCT 


60 


GCAATTATTT 


ATGTTGTCTT 


GAATATTATT 


GGTTCATGGC 


TATTCTTATT 


AGGTATAGGT 


120 


TTACTTTATA 


AAACAGTAGG 


TACATTAAAC 


TTTTCACATA 


TTGCAATGCG 


TTTGAATGAC 


180 


ATGGGAGATA 


ATCGCACTGT 


TACAATGATT 


TCATTAATCT 


TCTTAGTCGC 


ATTTAGTGCG 


240 


AAAGCAGCGC 


TGGTCCTTTT 


TATGTGGCTA 


CCCAAAGCCT 


ACGCTGTGTT 


AAATACTGAG 


300 


CTTGCAGCAT 


TATTTGCAGC 


GTNAATGACC 


AANGTAGGGG 


CCTATGCATT 


AATTCGGATT 


360 


CTTCACTTTA 


CTATTTGGAT 


CAAACATAAT 


GATCTCATAC 


ATCCATTGCT 


AGCAACTATG 


420 


GCTGCTTATT 


AACTTATGGC 


CATCCGGCGC 


TATAGGGTGT 


CAATTGCTTA 


TTAAAGATAT 


480 


TAAAAAGATT 


GCAGCTTACC 


AAGTCATAAT 


CTCAATAGGA 


TTTATCATTT 


TAGGNTTTAG 


540 


GAACAAACAC 


GTTTGGAGGT 


ATTAANCGTG 


CAATATTTTA 


TTTGGGTAAA 


TGACAATGGN 


600 


TGGTAAAAAC 


ATNGCTTATT 


CCTTTATTAT 


TGGGGAGGTT 


AGNTNTACAT 


TACAGGCTAT 


660 


CGACAATATC 


AATATTTGGG 


AATGGCTTAG 


CTAAAAAAGA 


ACCTTTATTT 


GGGAGTTGCG 


720 


TTTATTATAA 


TGATTTTTGG 


TANTTGGNCG 


GCG 






753 



(2) INFORMATION FOR SEQ ID NO: 765: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765: 



GAAGAAGAAT 


CCGACATANC 


CGNAAGTTCC 


TNCATCTAAT 


TTGGGGAATG 


TTCCCCTCCC 


60 


CATGTTAGGG 


AACCACCACC 


ATCAGAATGC 


GCCTAATAAT 


TGTCNCCTCC 


GAATGCTTTT 


120 


TGTGTTACCC 


GTCGTCGATT 


GTTTTNCCAG 


CCAAATAATT 


TAANCCGTCT 


AGATCATCAC 


180 


TATCTTTTTC 


AATAGTAATA 


TCATATTTNG 


GATGATTTAC 


TTTTACCNCT 


GTTTCGAATA 


240 


AGAAGCGCCG 


GCCTTCTTGT 


ACATATTGAC 


CTAAAGAATG 


TAAATCAGTT 


GTGTAGTTGG 


300 


CACTTGAAGG 


ATAGATACCT 


TTGAAGTCTT 


TACCTTCTGA 


TTCACCAAAT 


AATTGTTTCC 


360 


ACCATTCATT 


AAAGTATTGC 


ATAGATGGTT 


GCATAGTTAA 


TCAACATTTC 


TGTNGTATAA 


420 


CCTTTTGCAG 


ATAAAATGTG 


TCGANTTGTC 


GCATATTGGT ATNCAATGTT 


GTCTTCTAAT 


480 


TTATCTGAAG 


ATAATTCTTC 


ACGTGCTTTT 


GCAGCACCAA 


TCATCATAGC 


ATCGAG 


536 
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(2) INFORMATION FOR SEQ ID NO: 766: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766 : 



GTCGTTACTA 


CTTNTAATTT 


GCCATTTGAC 


TTATCACTGC 


TTTGTTTACC 


ACCAGTACCA 


60 


CATGCAGCAA 


CTAGAAGTAN 


TAAGGCTAAT 


AATAAAGGTA 


CTAATTTTTT 


CATGTTAAAC 


120 


TTCCTCGTTT 


CTTTCTNTTC 


GTAAATTATT 


GTGAAAAATA 


ATGTGATGAT 


ATAAATTACA 


180 


AACGTACAAA 


GTACGATTGT 


CGNACCACTA 


GGAATGTTGT 


AATTNTAGCT 


GTAATAAAGT 


240 


CCGACAATTG 


AACTTATGAC 


ACTTATTAAN 


CTTGCTATGA 


TCATCATTGA 


GTATAGTTTT 


300 


TTACTAATTA 


AAAATGCTGT 


NGATGCAGGT 


GTAATTAATA 


ATGCAACTAC 


AAGAATTAAT 


360 


ACCTACCNNA 


TGGAATCACT 


TGCTACTGTT 


ACTACTGAGA 


GTAACAACCA 


TGCACAATGT 


420 


ATGTAATAAC 


GTCGGTTTTG 


AGACCACTCA 


TTCTTCTTAA 


ACGTTGCCCC 


GATTGTAGAC 


480 


TTCATTTAAT 


GGCGATAGAA 


AAATACAAAT 


TGGTTTNNGG 


CAGATGAACC 


AGCCACACGG 


540 


GTGGGTTTAC 


C 










551 



(2) INFORMATION FOR SEQ ID NO:767: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767: 

TNAATGNACC CNCGAATNAA CCTACACCTG TCGAATCCAA ATAACTCACA TTTNCTNAAT 60 

TAACATAAAT ATCCCCGAGT TCCATCTTGT CTCATAGTTG NTNNAACCTC TTCTAATTCA 120 

GGCACAGTAT AAACATCTAA TTCTCCACCG ACTTTAACTT CGTAAAATTT ATCTTGAGTG 180 

GTTGTTTCTA TATTAAGATT CATTTCATTA CACTCCTACT TATTANAAAT ATTTATCTTC 240 

ATTTTAAGCN CTGTATCCAC CATATTTATA TTTCAAATTA TTATATACCC ATCGAAATAA 300 

TTTCTAATCT TCTTTTTTAA ATTAATTTAC TCTTTTTATA ATCAAAATAG TCATATCATC 360 

TTTTTTATTT GGGTTTTGAA GCCTTAAAAT TGCTTCATAG ATAATTNGAA CAATATCTTG 420 
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TGGGTGCAGA TGTTTATGTT TTTTAATATA TTCTAAAAGT TTGTGTTTAT CTATAAAGGT. 480 

ACCTTCACTA TTTCTAGCTT CAGTCACACC ATCCGTTAAA ATGATAATTA AATCATCAAG 540 

GTATATAGGA ATTTCTTGTT GTTGATATCG TGTGTGTGAA CTGATTCCTA ACACTCTACC 600 

TCTAACTGAA ATTTCTNCAA ATTCTTCTTT TACAGCGCGA TAAATATATC CAGGCTCATG 660 

ACCAGCTGAA CTACAATACA ATAAATGGNT CATCTCTTCA TATAAACCAT AAAACATGTG 720 

ACGAACATAT TGTGATTAAT ATTCCTTTTC ACAACACGAT TTAA 764 

(2) INFORMATION FOR SEQ ID NO:768: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 7 68: 



CACAAGCTCT 


AAACATTAGT 


GTAAACCAAT 


GCTTAGAGCT 


TTCTAATTAT 


TTTATGCTTT 


60 


AAAAGATAAT 


GTGTTATCTN 


CGATGNCCTT 


ACCGTCTTTA 


ATAACTTTTT 


CCTGCGTGAT 


120 


TGATACCCAN 


NATGATATGG 


AATATATTCA 


TGATTTGGTG 


CATCCCAAAT 


TACTAAATTA 


180 


GCCTTATCAC 


CTGTGTTAAT 


TGTACCCGCG 


TTAATATCTA 


TTGCTTTAGC 


AGCGTTGACC 


240 


GTAACAGCAT 


TCCACACTTC 


ACTAGGCGAA 


AGCTTTAATT 


TTAATGCTGC 


AATGGCCATA 


300 


ACAAGTTGCA 


AGTTATTCGT 


TACACTGCTG 


CCTGGATTAT 


AATCAGTAGC 


TAACGCGATG 


360 


GCACCATTAT 


TATCAAGCAT 


ACCTCTCGCA 


GCTGCATAAT 


CTTCTTTACC 


TAAATAGAAC 


420 


GTCGNTGCAG 


GTAAGAGGAC 


AGCTACAGTA 


TCACTATTTC 


GCAACTTTTC 


TTTTCCTTTA 


480 


TCACTAGAAG 


CTACTAAGTG 


GTCTTGCTGA 


TATTGCTTGT 


TCATCAATTG 


CTTAATTCCA 


540 


GTCCGCCTAA 


CGGGATCAAT 


TTCATCCCGC 


ATGTATTTTC 


ACTTTTAAAA 


CCT 


593 



(2) INFORMATION FOR SEQ ID NO:769: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:769: 
GGGGACCAGC TTACNACCTG TAGCTNCAGG NNNAAGCATA NATAGGATAN GGAGCCNACC 60 
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AATGAATAAC 


TTTAATAATG 


ACAATCCTNA 


TTGATACTTA 


CCAACAATAT 


TTGGNGNAGt 


120 


TTGAAGCGCA 


TTNCGNGCGT 


GTATTACAAG 


ACGATCAATA 


TATCGAAGCA 


TTAGAAACAT 


180 


TGATGGATGN 


CTATAGTGAA 


TTTATTTTAA 


ATCCTATTTA 


TGAACAACAA 


TTTAATGCTT 


240 


GGCGTGACGT 


TGAAGAAAAA 


GCACAATTAA 


TAAAATCACT 


GCAATATATT 


ACAGCGCAGT 


300 


GTGTTAAACA 


AGTGGAAGTC 


ATTAGAGCTA 


GACGTCTATT 


AGACGGACAG 


GCGTCTACCA 


360 


CAGGTTACTT 


TGACAATATA 


GAACATTGTA 


TTGATGAAGA 


GTGTGGACAA 


TGTAGTATCA 


420 


CTAGCAATGA 


CAAATTATTG 


TTAGNTGGCN 


CAGGTGCATA 


TCCAATGACG 


TTAANNCAAG 


480 


TAGCAAAAGA 


AACAGGTGCT 


TCAGTTATCG 


GTATTGATAT 


TGATCCCACA 


AGCCGGTGNA 


540 


CCTAGGGCGC 


AGAATCCGNT 


AACGTCTTTA 


GCAACCAAAT 


GAAGATATAA 


CANATTACGG 


600 


CATCAAAAAG 


GGTATCTTGA 


ACTTAAAGAT 


ATCAAAGATG 


TGACGCATAT 


CATNTTCCAA 


660 


GCTCCGACAA 


ATCCCTTTAA 


AGTACAGCAT 


CTTAGAAGGA 


ATTATATTGA 


TTTAACAAAA 


720 


TGAAAATGTC 


GNNGGGTTGC 


AATGCGCTTT 


GGGTGGATGG 


CATCAAAGCA 


ATATTTAATT 


780 


ATCCCGTCAC 


AAAGAAACA 










799 



(2) INFORMATION FOR SEQ ID NO:770: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:770: 



CTCNTTGCTC 


TTTTAGTCGT 


CTTCATTATA 


TCCATCTCAT 


GACAAGCAAT 


GAAGTTTGTC 


60 


ATACCACAAT 


GAAACTATAT 


TCACATTTTG 


AAATTGCTTT 


CATATATGCA 


AAACACATTA 


120 


CACTGGATAT 


AGCATTCGGA 


GGCGAACATA 


AATTTCAGAA 


TATTGTAAAA 


ATATAACGAT 


180 


ATATTTATCA 


TTTGGACAAA 


ATATAGTGTG 


ATCTACTAAT 


TTATAACATC 


TATACATAAT 


240 


GTGAAAAAAT 


TTTTAAAATA 


CAATTAAATA 


TCGGTGTAAT 


AAATCCCTTT 


ATACATAGGA 


300 


GTTGTATACA 


TGAAACTAAA 


TAACTATTCT 


TTAAAAGTTA 


AAAACAAACA 


ACTAGNTGNC 


360 


AATTGTGATT 


TAAATTTCTA 


TCTTGGTCAG 


ATCAATCACA 


TTGNTGGTAA 


AAATGGTGTA 


420 


GGAAAATCTT 


TATTAGCTAA 


AGATTTCTTA 


CTAAATAATA 


GTGGAAATAT 


CCCTAAGTCC 


480 


ATTTCTCAAA 


ATGGTAACCT 


TAATATCAAG 


NTCCATCAAA 


TATTCCCTAA 


TGGATATTAA 


540 


CAAAAAGATT 


TTTTATTATC 


ATTGTTAAAA 


TCAAAATGTG 


GGGAACAATC 


GACAAAACAT 


600 


TCGATAAGAT 


TTATTNACAA 


TACTNAAACA 


TCCGAAGCAA TACCCGTtTT 


TACGGTACTA 


660 


CTTAAAAAAA 


CTTGGAGTGG 


ATGGGG 








686 



(2) INFORMATION FOR SEQ ID NO: 771: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 541 base pairs 

708 



WO 97/30070 



PCI7US97/02318 



(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771: 



CTCCTAAGTC 


AATAAAATAT 


TTATGATTTA 


CATGCAACTT 


ATAATTATGT 


GACATATAAA 


60 


TGCATAAAAA 


ATATAATCCT 


AATNACTTGA 


TAGTGAGAAT 


CATTATCAAT 


TAGGTAACAC 


120 


ACAATATTCT 


AGAATTTTAA 


ATTTGAGGAG 


GAAGCGCTTT 


TGATTGAAAA AAGTCAAGCA 


180 


TGTCACGATT 


CATTGTTAGA 


TTCTGTAGGG 


GAAACACCTA 


TGGTTCAACT 


NCATCAACTA 


240 


TTTCCGAAAC 


ATGAAGTGTT 


TGNAAGGNTA 


GAGTATATGN 


ATCCTGGNGG 


GAGGATGAAA 


300 


GATCGACCTG 


CCAAGNACAT 


CAATGANCAT 


GGAATTAATC 


ATGGATTAAT 


CACTNAGAAT 


360 


ACACANNTAN 


TTGAAAGTAC 


TTCTGGCTAA 


TTTAGGCATT 


GAGTTGGCAA 


TGAATAGCTA 


420 


AATTCAAGGG 


ATTAATACTC 


ACGNGTGTTG 


GTGGTCCCTA 


CCTTATCACC 


NACAAATTTG 


480 


GGAGATTATT 


AATAGTTATG 


GGTGGCAANG 


TNGGAAATNG 


GTGGAGGAAC 


CTGNTGCACA 


540 


A 












541 



(2) INFORMATION FOR SEQ ID NO: 772: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772: 



AATTGTGANA 


CCANATACAC 


ATGTGTCATT 


GNAAGTGAAN 


TANATGGTTT 


GNNAATACCA 


60 


CAATACATAT 


GCACTTCNAA 


ATAGTAATGA 


TTATTAAGAA 


ATTATGGAAG 


GAGGAATATC 


120 


ATATATAGTA 


CGTGNATCNA 


AAATAGATCC 


AGTCAATATA 


GTAGGTATTG 


GTATAGACTT 


180 


TACTTCATCT 


ACTATTATTT 


TTACCGACGA 


AAACCTTAAC 


CCGGTACATA 


ATTTAAAACA 


240 


ATTTAAAAAC 


AATCCACATG 


CGTATGTGAA 


ACTTTGGAAA 


CATCATGGTG 


CATATAAAGA 


300 


AGCAGAGAAA 


TTATATCAAC 


CTGCTATTGA 


AAATAATAAT 


ANGTGGNTAG 


GCCATTATGG 


360 


ATATAATGTT 


AGTAGTGAAT 


GGATGATTCC 


CAAAATAATG 


GAAGTCATGA 


ATCGAGCACC 


420 


AGATATTATG 


GAAAAAACGG 


CTTATATTAT 


GGAAGCGGGC 


GATNGGATTG 


TANTTAANTT 


480 


AACTAATAAA 


AATGTACGCT 


CGAATTGTGG 


ATTAGGGTTT 


CAAAGCATTG 


TGGGGAAGAA 


540 


GGAAACAGGG 


TCCTCAATTA 


TGGATTTATT 


TGGATAAAAT 


AGCACCCCCA 


CATTTATCAA 


600 


AAAGGTCATT 


CCAAAGATAA 


AAGTATCTTG 


CAACCNGGNT 


GGTTAATATT 


GGGGTGAAG 


659 
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(2) INFORMATION FOR SEQ ID NO: 773: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 483 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773: 



ATACACACAT 


GTNATCGGAA 


GATNTATTAA 


ATCGTGTAAC 


AGCTTTGGTA 


AGTAAAACGC 


60 


CAGAGGTTAA 


CATTGATATT 


GACGCAATAG 


CAAATAAAGT 


AATTGAAAAA 


ATAAATATGA 


120 


AAGAAAAGGA 


ATCAGAAATC 


GATGCTGCAG 


ATAGTAAAGT 


ATCAGCAAAT 


GGATTTTCAA 


180 


GATTCCTTTT 


TTAATACAAA 


AAATAGGAGG 


TCATAAAATG 


ACTATAAATT 


TATCGGAAAC 


240 


ATTCGCAAAT 


GCGAAAAACG 


AATTTATTAA 


TGCAGTAAAC 


AACGGTGAAC 


CGCAAGAAAG 


300 


ACAAAATGAA 


TTGTNCGGNG 


ACATGATTAA 


CCANCTNTTT 


GGAGGAAACT 


AAATTACAAG 


360 


CAAAAGCAGA 


AGCTGAAAGA 


GTTTCTNGTT 


TACCTAAATC 


AGCACAATCT 


TTGGGTGCAA 


420 


ACCAAAGAAG 


TTTCTTCACG 


GGTTCAANAA 


AAACGTTAAC 


TATTAAGAAG 


AAAAACTTTT 


480 


GGC 












483 



(2) INFORMATION FOR SEQ ID NO: 774: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774: 



GGTTGATAGT 


TATGNCCTAT 


GTTGNACGTT 


ATATTGANCA 


GTTTTTGAGA 


CCCNNCAGTT 


60 


AAGAAATAAT 


ATCCAANCCC 


TACCTTTTAA 


TGCTAGATGA 


ANAANTGNNA 


AATTTAGATG 


120 


ATTATATGCG 


TTATTTAATT 


ACTAAAAAAG 


AACNACTTAG 


CAAGTTAATT 


GACAGTCTAA 


180 


TGCTAACATT 


AGAAAATAAA 


TATATTGATA 


TTGCTGAAGC 


ATTTCANATT 


CAATGTGCAA 


240 


GAGAAATCAA TAATCAAGAA 


ATTGAAAATA 


TTAANTCAGA 


GTTGAATAAA 


GTTGAAGCAT 


300 


ATTATGCACA AATTGAAACT 


CAAATTCAAC 


AAACTTCAAC 


TGAAAAAATA 


GCAACAGAAA 


360 


AAACATCGTA 


TCTAATAAAT 


TATATGAACG 


CTGTGGCATA 


GAAAGGCGGC 


GAAACATGAC 


420 


ACACANATAT 


ATATCAACGC 


AAATGTCGAC 


CAATTTTCAC 


TGCATTAATG 


ATNATTGCCA 


480 


ANTTTTACTA 


CATATTTTTT 


GAANAAATTG 


GCTTTTTACT 


CGTTCTATTA 


TTGGGATGTG 


540 
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TATTAGTTTA TGTAGGATAT CTTTATTTTC ATAAAATACG TGGCCTTTTA GCGTTTTGGA. 600 
TAGGCGCGCT ATTAAG 616 

(2) INFORMATION FOR SEQ ID NO: 775: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775: 



CCATCGCTTT 


TAATTGATCC 


ACTTGATCNT 


TCATTAATGA 


TATTAGTGGA 


CTTATTACAA 


60 


TTGTTGTACC 


ACCTAACAAT 


AAACCTGGTA 


CTTGATAGCA 


TATAGACTTA 


CCTCCACCAG 


120 


TTGGTAAGAC 


ACCAAGCACA 


TTACGATGGT 


CTAATACTTT 


GCTAATAATT 


TCTTCTTGTC 


180 


CTGGTCGAAA 


TGTTTCATAC 


CCAAAGTAAT 


GCGATAATGT 


TTGTTGCATC 


ATAAATTGAC 


240 


CCCTNATTGT 


TGTTCTTTAA 


TTTCTTCTAA 


CTCACTCCAT 


CTNGTGATGT 


CTAAATCATA 


300 


TTGGAATTNC 


AAGTNGGNCT 


TTTTCTTCGN 


TTAGTTCTTT 


AATTTTCCCA 


TAATCTGCAC 


360 


TTGCCTCAAT 


CATGAGCACA 


TCAATTTCTT 


CCATTCTTAC 


TTCCGCTTGT 


TCTATGCGTN 


420 


TCATCAATTG 


GTCAAATTCT 


AATTNNNCTT 


NATATTGATA 


AACATNTTNC 


TTACGGACAG 


480 


GTGTATTAGG 


ATTTAGGTTG 


GTGGTTCAAN 


GNGGGTT 






517 



(2) INFORMATION FOR SEQ ID NO:776: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776: 

CGCAGTAGAT CCAAATGCTA TCAGACCACT TGTTGTAGGT AAAATGAAAG ATGGTACNAT 60 

ACACCNTTCC AAGTGAAACA TGTGCAATAG ATGTGTTAGG TGCCAGANTT TGTCCAAGAT 120 

ATTCATGCAG NTGAATATGT CGTGATTANC CGATAAAGGT ATTACCAGTT AANTCTTAAT 180 

ACACATCATA CGACAACTGC AATTTCTGCG ATGGAATATA TTTATTGTGC TAGACCAGAC 240 

TCAACAATAG CTGGTAAAAA TGTCCATGCA GNACGTAAAG CTTCTGGTAA AAAATTAGCC 300 

CAAGAAAGCC CTGTAAATGC TGATATGGTC ATCGGTGTAC CCAANTCATC GCTATCAGCT 360 
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GNGAGTGGGT TATGCTGAAG AAATAGGTNT GCCATATTGA AATGGGACTA GNTTAANNAT 420 
CAATATGTTG CNAAGAACAT TTATTCANNC AACTCAAGAA TTACGTGAGC AAGGTGTGAG 480 
AGTGAAGTTA TCTGCGGTGG AAGATATAGT AGATGGGAAA AACGTNCC 528 

(2) INFORMATION FOR SEQ ID NO: 777: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 382 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777: 



GAATTCCCAA 


TAATAAATTT 


TNGGATAACN 


CNAAAGCNAT 


TTTAAAAGAA 


GTCATGGGAG 


60 


AGAATCGTGA 


TTTCGAGCCA 


ACTGAACAAA 


AAATTGATAA 


CTTCAAACAT 


TCATACGGTA 


120 


CAATTTTGTT 


TTATGAAGAT 


CAAGATGTTG 


TAAGTGGTTT 


ACAAGAACAA 


ATGCCAAACT 


180 


ACTATGATAA 


TTTTGGAATA 


TGGTCAACTC 


AAACAAATGC 


CATGCATCAA 


TTTGGAATTT 


240 


GGGACTGGAT 


TAGGTACTAA 


AGGGATNGGT 


GGTTCANTAC 


AACAATATTA 


TCCACTTGNC 


300 


GATGAGATTG 


ACTTCAAATN 


AATTTAATAT 


TCCTAAAAGT 


TGGNAACTTA 


TTGNTCANAA 


360 


TGGCAAATGG 


GGGGATNTTC 


CG 








382 



(2) INFORMATION FOR SEQ ID NO: 778: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778: 



GAATTCCTTA 


NGTTAACGTG 


TGCCATGTAG 


TACTACACCT 


CTTTTTTTAT 


CGGATTATCT 


60 


AACTCAATAT 


TTAAATATTG 


CTTTACTTTA 


TTTTCCATTC 


TCATATTTAA 


CAACGCTTTG 


120 


ACAGTAATTG 


TTGGTCCAAA 


CAAAATATCT 


TTTAACTCAT 


TTGNATGTGG 


TAGTGTTGGG 


180 


TTAATAGCAT 


GCGCTACTAC 


TTGCTGTACT 


ATGTTAAATA 


ATTCGGTTTC 


TTCAACACAA 


240 


TCATACTGAT 


TAAAATGATG 


GATTAATTCT 


GCCATTTGAT 


TTTGGATAAC 


AGCATGTTGG 


300 


GATTTTGGGA 


AGCACTGCAT 


CTATAGAATC 


AGCTATTAAA 


CTATCAATTG 


GTAATTTTAA 


360 


TATCTGATAC 


ACGAAGGTTG 


GGATGTTTCT 


TAATCAAATC 


TTGGAACCAC 


CCAATCTTTC 


420 
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AACTAAAATT GGATGTCAAA AA . 442 

(2) INFORMATION FOR SEQ ID NO: 779: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 779: 



GGCACGAGAC 


CACNCCTCTT 


TTTTTANCGG 


ANTATCCAAC 


TCAATATTTA 


AATATTGCTT 


60 


TACTTTATTT 


TCCATTCTCA 


TATTTAACAA 


CGCTGTTGAC 


AGTAATTGTT 


GGTCCAAACA 


120 


AAATATCTTT 


TANCTCATGT 


GCATGTGGTA 


GTGTTGGGTT 


AATAGCATGC 


GCTACTACTT 


180 


GCTGTACTAT 


GTTAAATAAT 


TCGGTTTCTA 


CAACACAATC 


ATACTGATTA 


AAATGATGGA 


240 


TTAANTCTGC 


CATGTGATTG 


TGAATAACAG 


CATGTTGGAA 


TTGTGGCAAT 


CACTGCATCT 


300 


ATAGAATCAG 


CTATTAACTA 


TCAAGTGGNN 


ATTTTAATAT 


CTTGATACAA 


CGANGGNGGG 


360 


NATGNTTTCT 


TTAAATCAAC 


TCCTTAGAAN 


CCACCNAANT 


CTCTTCAAA 




409 



(2) INFORMATION FOR SEQ ID NO: 780: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 80: 



GTGATGTGGA 


NGAGCNTCAT 


TTGTTGTAAA 


TATTNATTGA 


TTTCAGGGGG 


AAATGGAATA 


60 


TGCNCGGTGG 


CGGGAACATC 


CACCAAATGA 


TGAGCCAAAT 


CCGCAGATNC 


CGAAAGAGGA 


120 


TGGCTCCGAG 


NACCAAGNAA 


AACTTNAAGA 


AGGAGCGTAT 


TGTAGGAACA 


GCTGGCGGTG 


180 


GCATGGTTGC 


AGTTACTGTA 


ACTGGTCATA 


AAGAAGTTGT 


CGACGTTGAA 


ATCAAAGAAG 


240 


AANCTGTAGA 


CCCAGACGAT 


ATTGAAATGC 


TACCCAAGAC 


TTAGTGTTAG 


CANCTACTAA 


300 


TGAAGCGATG 


AATAAANCTG 


ATGAGCTTAC 


CCAAGAACGT 


TTAGGTAAAC 


AATACTCNAG 


360 


GCNTAAACAT 


CCCCTGGAAT 


GTGATCATAG 


ATGCATTATC 


CAGAACCTAT 


ATCANAGCTT 


420 


ATTGATAGCT 


TTATGAAATT 


GCCAGGCATT 


GGGCCANAGA 


CAGCCCAACG 


TCTGGCTTTT 


480 


CATACCTTAG 


ATATGAAAGA 


AGACGATGTT 


GTTCAGTTTG 


CCAAAGCATT 


AGTAGATGTT 


540 



713 



WO 97/30070 



PCI7US97/02318 



AAGAGAGAAG TAACATATTG TAGCGTATGT GGTCACATTA CTGAAAATGA TCCATGTTAX 600 
ATTTGTGAAG ATAAACAAAG GGATCGTTCA GTTATTTGNG TTGTGGAGGA TGACAANGNT 660 
GTCNTNGCTC GTGCC 675 

(2) INFORMATION FOR SEQ ID NO: 781: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 628 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781: 



GTCGCATCAG 


ACGCACTTGC 


AATGTTACNA 


GTGACNAGCG 


AAATATAAAG 


AAATCCATGA 


60 


CCATGAAATC 


GTTATTGTTA 


AAAAAGATGA 


AGTTATTATT 


AAAGATGCAG 


ATGGAAACGT 


120 


TGTAGAACGT 


GATTCATATA 


TTGCTGAAAT 


TGATGCATCA GATGCTGAAA 


AAGGTGTTTA 


180 


TGCACACTAC 


ATGTTAAAAG 


AAATTCATGA 


ACAACCAGCA 


GTAATGCGTC 


GTATTATTCA 


240 


AGAATATCAA 


GATGCAGAAG 


GTAANTTGNA 


AATTGGATCA 


AAGACATNAT 


CAATGANGTT 


300 


AANGAAGCAG 


GACCGCATTT 


ACGGTTATTG 


CAGCAGGGTA 


CAAAGCTACC 


CATGCAGGGT 


360 


TTAGGTAGGG 


TAAAGGAATT 


TTTTAGGAAA 


AAATGGGCTG 


NCGTACCAAC 


TGAAGGTACA 


420 


NNGTTGCATC 


AAGAGGTTGG 


GCTTACAAAC 


ATGCCATTAT 


TATCTTGAAA 


AACCATTGTT 


480 


CCGTTTATAT 


TTCTCAAATC 


AGGTGGAAAC 


AGGAGGTAGG 


CCGNGCAGGT 


ATTAGTTGGA 


540 


NACTTAATAA 


ATTAGGTCAT 


AAATCATTAA 


CAATCACTTA 


ATGGTGGAGG 


TTCCAACTTT 


600 


ATCACGTGAA 


GGAGACCCAC 


ATTGGTTA 








628 



(2) INFORMATION FOR SEQ ID NO: 782: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:782: 

ANTTTCCCCT TGNCCACCNA AGANATAGTG GATAANAAGA NCCCTGAGAA GAATAAGAAT 60 

GTTTCTNNNA AAAACACTCG CAACTCTGNC CATATATTGG TCGTATATCG AANCTTTCTT 120 

TACCCCACAG NTTGTGATAA TTAGCAACAA TTTTATCAGA GTTATATNTT AAATGGGGGT 180 
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CCTCTCAAAC 


CCATTCATCT 


AATAGGTAGT 


TTGTNTCACT 


AAGATGGTAA 


TAGTGTTGTQ,. 


240 


TNAATATGTG 


NATAGGCTTT 


ACGAGCGTGT 


ATTTGCTCCA 


TCATCGNCAT 


AAATGAATAA 


300 


ACTGCTTTTT 


TCCTTAAGTC 


AGTCGTATGT 


AGCATAACTA 


AAGGCATGCC 


ATCATCTCCT 


360 


NGATGGGTAT 


CTAAGCCTGC 


TAAACCAGCT 


AATGCTTTTT 


TAAAGGGGTC 


TGGTTCAGCT 


420 


TCNGATAANG 


NCTTCCAACG 


GGCAATGTCT 


TTTGATACTT 


TAAATTCTGT 


TTCAACCCAC 


i OA 

4B0 


ATTTGAGATA 


TATTTTGTCT 


CCAAAACATA 


TTCGTCATAT 


CTTCTTGTGT 


GTTCCAATTA 


540 


ACAGCTATCA 


TGAATATTAA 


TTTCCTCCTG 


TAATGTTACT 


CAAATAATTG 


ACAAAATTAC 


600 


AGACATAGCA 


AGGCTTAATC 


ATCTGCTAAT 


TCAAGACCCA 


GATGATTAGC 


TTGTTNGGCA 


660 


ATTTCAACAA 


TTTAATNGGT 


TAGATTAGAA 


CAACTTGGTC 


ATTCTTCTTA 


CACTTAATAA 


720 


TTATTCTTAG 


T 










731 



(2) INFORMATION FOR SEQ ID NO: 783: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 83: 



GGCACGAGAC 


GTATGATGCC 


GGCTCAAGAA 


CCACCTTATA 


CACGTTTAAT 


CGATTCAGCT 


60 


AGAAGACAAC 


CTGAACAAAC 


TGAAACGTAT 


GTCAAAGGTT 


CAGTCGTTGG 


TTTCTTTACA 


120 


CCAGAATTAT 


TCCATGGTAT 


CGGATCAGCA 


GGATTTCATG 


TACACTTTGC 


GAATGATGAT 


180 


CGTAACTTTG 


GGTGGACATG 


TCTTAGATTT 


TGAAGTAGAA 


GATGTTAAAG 


TAGAAATCCA 


240 


AAATATAGAA 


ACATTTGAAC 


AGCATTTTCC 


AATTCAAGAT 


AAAGATTTCA 


CTAAAGCAAA 


300 


TATTGACTAT 


AAAGATATTG 


CAGACGAAAT 


TAGAGAAGCT 


GAATAATGAA 


TCCAGAAATA 


360 


TAATGACGGT 


TTATGAAAAT 


TGACTTTCAT 


AATGCGCGAT 


TTAGAAATGA TAGTTTGGTA 


420 


AATATTGATT 


AACCATGACT 


ACAATAGACA 


AATATTATTA 


TTATTACGTC 


TTAGGTAATA 


480 


AAATAAATCC 


CTCACTATTA 


GCAGTAGTGA 


GGGGATTTAT 


TAGGGTTCCA 


GATAT 


535 



(2) INFORMATION FOR SEQ ID NO:784: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:784: 



GAATTCGGCN 


CGAGCTTACN 


AAATAATTTA 


NNGACCTGAT 


TGGCGTTGAA AATCTCCAGA 


60 


TTTACCGCCA 




CAAGGTAGGT 


TTCGCCAATA 


ATCATACCTT 


TATCAACTGC 


120 


TTTCGTCATG 


TCGTAAATGG 


TTAAAGCCGT 


TGCTGATGCA 


GCGGTTAAAG 


CTTCCATTTC 


180 


AACACCGGTT 


TTGGCAGTTG 


TAGAGACAGT 


TGTTTGAATG 


TTTAAAGTAT 


AAAGGGGTGC 


240 


ATTTGTTTCA 


TCCCAGCTGA 


AGTGAACATC 


TATGNCAGTC 


AATGGTAATG 


GATGGCACAT 


300 


CGGAATAATT 


GTTGATGTAT 


TTGTGGGAGG 


CATAATACCA 


GCGATTTGAG 


CAGTGNTCAA 


360 


CTACATAGNC 


CTNGGTATTG 










380 



(2) INFORMATION FOR SEQ ID NO: 785: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 85: 



GAATTCGACA 


AATAAACAAC 


TTGAATGAGC 


TTGCTTTAAT 


GTTATGTTTT 


ACGTAATTTT 


60 


TACAATTGAT 


GAGGAAGCAT 


TCCCTTTAAT 


AATTAGGAGG 


TCAAGACATG 


ACAAAATTTA 


120 


TTTGTGTAAC 


AGGTGGCGTA 


GTTTCATCAT 


TAGGGAAGGG 


TATTACAGCA 


TCTTCTCTAG 


180 


GTAGATTATT 


AAAAGATAGA 


GGTCTAAATG 


TAACAANTCA 


AAANTTCGAN 


CCATACTTAA 


240 


ATGTTGACCC 


AGGTACAATG 


AGTCCTTATC 


AACATGGTGA 


AGTATTCCGT 


AACGGATGAT 


300 


GGTGCAGGAA 


ACTGACCTAG 


ACTTAGGACA 


TTACGAN 






337 



(2) INFORMATION FOR SEQ ID NO: 786: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786: 



GAATTCCTAA CGAATTAAAT TTTTACCTGC TTTATCTTTT TGGCTTTGTA GTCTTAAAAC 
TTCTCCAACT TTATCAAGCG AAGCAGGTAA GCCAATACGC ATTGAATTAA CCATTGTGCA 
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AATCCATTCT TCAGGTGGCA TCTGTTTATT AAAATGTTTA GCAAGACAAG TTCTTTCGAA. 180 
ATTAGCATTG AATGCATACT TT 202 



(2) INFORMATION FOR SEQ ID NO: 787: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOKOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787: 



GGNACGAGGT 


GGCATTTCTA 


AGGAAGAAGC 


AACAAGACGG 


TTATTAAACA 


ATGGTGACAT 


60 


TGTTAAACGA 


GCAATCAGAG 


ATAGACAACC 


TTAGGAGGGA 


TTTAAATGAC 


CAAAGAACAA 


120 


CAACTTGNAG 


AACGAATTAT 


TGCTGCAGNA 


GGTGGTATGG 


ATAATATAGA 


TAGTGTCATG 


180 


AACTGCATGA 


CACGTGTGCG 


TATTAAAGTA 


TTAGATGAAA 


ATAAAGTAGA 


TGACCAAGAA 


240 


CTAAGGCATA 


TTGATGGTGT 


CATGGGTGTT 


ATACACGATG 


AACGCATTCA 


AGTTGTGGTT 


300 


GGCCCTGGNA 


CAGTCAATAA 


AGTGGCTAAT 


CATATGGCGG 


NATTAAGTGG 


TGTTAATCTA 


360 


GGTGACCCAA 


TCCCACAACA 


TCACAAATGA 


TAGTGAAAAA 


ATGGNCTTAT 


AAATCATATT 


420 


GCAGGTTGAT 


AAAGCAANGG 


CGAATNNNNG 


AAGCACATTC 






460 



(2) INFORMATION FOR SEQ ID NO: 788: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:788: 



GGCACGAGCG GTAGGTTCAT CCGCTAAAAT AATCGACGGA TTCGTATATA ACGCTTTGGG 60 

CTATCGCCAC ACGTTGTTTC TGACCACCTG AAATTTCTGA AGGTAACTTA TTAAGCAATG 120 

AAGTTAGACC TAATTGTGAC ATAAGTTGCT GATAGTCTTC ATTAGACATA ACATTCTTAT 180 

TTTTCTTTTT CAATAATGTA AATTGTTGCT TTACCGTTAA AAATGGTACA AGGTTTGTAG 240 

CTTGTAAAAT AAAACCTATT TCAGACATTC TAACTTTTGG CAATGNTTTT TGCTTCATTG 300 

TCGGTAATAT CTTGGGTTAT TGATTAAAAT TGTGCCCAGA TTGTCGGNGT TTGGTAAAGC 360 

AACCTGCCAT AGGTTAGGAA ATTGTACTTT NNTACCAAGG AGGCCAGAAG GGTCCCAACC 420 
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CAATGCTTAT TAATATCAAC CTTTATTTAT CTCAAAAATT GGGTTTCTTT TAACCCGCTT. 480 
CAAT 484 

(2) INFORMATION FOR SEQ ID NO: 789: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789 : 



GAATTCATAG CTTGATTATT TTNTGTTATA GGACTAGAAT ATACACATAT TATTAGAGCA 60 

TCTGTGNATT TTATCTCAAG AAGCGAGGTT AATGAACAAT GAATATGCAT ATTTTATATN 120 

ACTTNCGANC TAAACATAAT TTAGAAATTG ACGANTTAGC ACAGCAATTA NNTGAGAAAT 180 

ATGGCACTAA ATATGAAGCA CATCANATGG GGGAATGGGA GAATCATCAT CATGACCCTA 240 

AATTTANAGA TGCCATGCAT TTAGNTGACT TCTGTGGATG CACCATATGA AATGTTTNTA 300 

GAAAGTAAGG TTAAAGAATA TCAGAAACAT TTAGAAGAAG TCG 343 

(2) INFORMATION FOR SEQ ID NO: 790: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790: 

GGCACGAGAC GTTGGGCGCA ATCTGGTTTT ATCTCTGCTA AAGGTAAGGA AACACCTAGN 60 

AAATTTAATG GCATTTAAAG ATGGAACAAT TAATCCTAGA AAGAGTAATC AACTTAAAGA 120 

TTATGTGTTT ATTGATGACG GATGGGCGAA ACATGGAACT TATTGTGTTG TCAGACGTAT 180 

TCAAATACAC ATTGAAACGT GGGATCGTAC TGCACTGGAA GAACAAGAGG CTACATTTGG 240 

GTCGGGAAAC GACATAGTGG GTGCGCCGTT AACAGGTGGG AAAGAGTTTG GATGAAATTG 300 

ACTTAAAAGC GAAAGATAGT CATGGCGAAG TATTATTATT GGATAAAAGA TGACCCATAC 360 

GGAGACTTAG CGAAAGAAGC AAATTCGTCA ATTTTACGTT GGGAGCCTTT ACAATGTTGG 420 

NTGGGACGGA TTTGACGCAC AGGGTTAACT TTCGGAACAG GCTTGGTGGG TCCAATGGCT 480 

T 481 
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(2) INFORMATION FOR SEQ ID NO: 791: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791: 



GAATTCCTAA 


TTGACTTACC 


TGAACCCGTT 


TCACCACTTA 


AAACAGTTAA ACCATCAGAA 


60 


AATTGAATTT 


CTAATTCTTC 


AATAATAGCA 


AATTGCTTGA 


TTGATAAGGT 


TTGTAACATA 


120 


AACTCATCGC 


ATCCTTATAA 


CAAATTGAAA 


ATTCTTGACT 


TGATTTCATC 


ACTTGCCTCT 


180 


TTGCTTCGAC 


AAATAATTAA 


ACAAGTATCA 


TCACCACAAA 


TTGTGCCTAG 


TAOTTCTTCC 


240 


CAATTGATTT 


GGTCTAATAT 


AGCTCCAATA 


GATTGTGCAT 


TACCAGGTAA 


TGTTTTTAGA 


300 


ACAAGTAAAT 


TATCAGTACC 


ATCTATATTA 


ACAAAGGAAT 


CCATTAAATA 


ACGTCCCAAT 


360 


TTTTCTTAAA 


GGATGGAATT 


TTCTTATCAT 


TTGGGTAAAC 


TTATAAACAT 


ATTGGACTGA 


420 


AGGGTATTGG 


GTACTTTAAT 


AAGTGGTAGG 


TCTTTTAATA 


TCACGAGAAA 


CAGGTGGTTG 


480 


GAGTGACATT 


TAAATCATAT 


CGGNTAATCG 


GNNTACTAAT 


NCACTTGG 




528 



(2) INFORMATION FOR SEQ ID NO: 792: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792: 



GAATTCCTTT 


TGNAGCAGNA 


TGGTTGATGA 


TGCTCTACAT 


ACCGAAATGA 


TTGTTGATGG 


60 


CACACATTCT 


CATCCGGCAT 


CGGTTGCAAT 


TGCTTACCGT ATGAAAGGTA ATGAACGTNT 


120 


TTATTTAATT 


ACCGATGCAA 


TGCGTGCAAA 


AGGTATGCCT 


GAAGGAGAAT 


ATGATNTGGG 


180 


TGGACAAAAA 


GTAACTGTTC 


ANTCGCAACA 


AGCACGTCTT 


GCAAATGGTG 


CGCTTGCTGG 


240 


TAGTATTTTA 


AAAATGAATC 


ATGGGTTACG 


TAACTT AAT A 


TCATTT AC AG 


GTGATACATT 


300 


AGNTCATTTA 


TGGCGAGTAA 


CAAGTTTAAA 


TCAAGCCATT 


GCATTAGGTA 


TTCGATGATA 


360 


GAAAAGGTAG 


TATTTAAAGG 


TAAATTAAGG 


ATGCAGGATC 


TTTGGTTATT 


CTTAGATGAT 


420 


GATATTGAAT 


TGTAAAATCT 


TACAAATAAA 


CAAAGCAAGG 


CTCACACATT 


TAGCTTAATA 


480 
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AATTATCAAT AATTAAATCG TATTGCAAAT AGGATTTTAA TCTTTTGNTA CAATNAGCAC 540 
T 541 

(2) INFORMATION FOR SEQ ID NO: 793: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 793: 






GAATTCCTCT 


AGGGTATTAA ACATAGGTAA 


ATCAATTTTC 


TATCGCCAAT 


AGTATAAACA 


60 


TCATCCATAT 


TAATCTGAAG ATAGCTACAT 


AATTCTTTAA 


CAGCATAGCC 


TTTATTTGTA 


120 


TAAGGCGCCA 


TAAACTCTAT TAAATATGGT 


NTGTTGTGAG 


GTTTCTATGT 


TTANTCATTT 


180 


AATTTNNCAT 


TATATAAATC GAATAATTAT 


AAAAAGTCAT 


CATACTTTTT 


ATTTTTTAAA 


240 


AATATNTGAA 


AATTTAATAG TAATCCAATT 


ATAAGATCAA 


ATATCATTAT 


CATTATAATC 


300 


AAGGCAACTA 


TAATCACTGG TGACACTCGG 


TTACATAACA 


AGCTAGGTAA 


TCTCCAGATA 


360 


AAATTGGACA 


TTTNCAACCA ANTCCCATAA 


CTTTAACAAT 


ATTTACAAAG 


GTAGGNAAAA 


420 


GAATTAATAA 


TACCTATACT TGTAAATTAA 


ATTGGCAAGG 


GTATATT 




467 



(2) INFORMATION FOR SEQ ID NO: 794: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794: 

GAATTCCTAA TTAATTAAAA TTTTANCGGA TTTCAAGAGG NCTTTTTATA ATGACGCCTT 60 

ATTTGNAAAA TGTCATATGT GGTTTAGCCT CTTTATAATA ATTTAATCTG TTCTCAAGTG 120 

TGCCAGTATG TAATTCTAGC TTATGACCAT CAGGGTCAGT AAAGTAAATT GATTGTCTAT 180 

CTCTAATATC TCTAACTCTN CCNCCTAAAA TATTCACGTT ATTATCTTTT ANCCTCTGAT 240 

GCCAATATTT AAATTCGCTG TCATCTATAG TGAAAGCTAT ATGTGTATAT GAAAAGTGAA 300 

TTTCATTACG TGGGTATATC TNCNTCTTCA CCATAAAGCA AANCCATAGG GGCTGCAAAG 360 

CTCAACAANA AGCAG 375 
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(2) INFORMATION FOR SEQ ID NO: 795: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795: 



AATGTCTCAT 


AAGTACCTTG 


CACAATACCT 


TGAGNAACCA 


ATATATATCC 


AACTACCGGC 


60 


TGTCATCTGT 


CCATACATGA 


TTAAACCTTT 


TTTATCTAAT 


TCATTAAAAT 


GATCCCAGTT 


120 


TGCCCATTCA 


GGCACTAATA 


CTGAATTTGA 


AATTAATACA 


CGTGGCGCTT 


CTTCATGTGT 


180 


TTTAAATACA 


GCAACTGGCT 


TTCCTGATTG 


TACTAACATT 


GTCTCATCTG 


ATTCTAATTC 


240 


TCGTAACGTT 


TTCTCTATTG 


CTTCAAAAGC 


TTCCCAATTA 


CGTGCTGCTT 


TTCCAATACC 


300 


ACCATAAACA 


ACTAAATCTT 


CTGGTCTTTT 


CAGCAACTTC 


TGGGTCTTAA 


ATTGTTGGTA 


360 


TTAACATTCT 


TAAGTACTGC 


TTCTTGGTTC 


CCAACCCTTT 


ACACTCAAAT 


ACTCAAACCC 


420 


TTTNTTTGGC 


TTGGAATTTT 


NTCTTCAATA 


AAATTCGCTC 


CTGGTCTTTT 


TAAGAAGGTT 


480 


AATTCCACTA 


AATTTAAACG 


CTTACATTAT 


TATCTTCAAT 


ATTCAATATT 


GTATT 


535 



(2) INFORMATION FOR SEQ ID NO: 796: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796: 



ATTAGTCCGA 


TTTGACATAG 


ATCCCGAAAA 


ATAATGGCTA 


ATAGCCACTT 


ATAGTCGAGA 


60 


TTATGAACTA 


TATAATTTAT 


TTATTATTAA 


ATGGATTATA 


TACTTATTGT 


GTTTCTTAAT 


120 


ACCTTTTTAT 


ACTTTAATAT 


CTTGAATTTT 


TTCTAAATCA 


TTACTAATAT 


CTTTTAATTC 


180 


GTCTGTGGCA 


TTTTTCACCC 


AAGCCCCTTG 


TGCCGCATTT 


TTAAAATCAT 


CTTCAAAGTT 


240 


TGGGAAATGA 


CTTACCATTT 


GGTCAAGTAA 


TATTTTCTCA 


TCACTTTTCA 


GCTTATTTGG 


300 


ATGATCATGA 


TGCATACGAA 


CATGTTCAGG 


CGCAATAACC 


TCACTAATCT 


GTTCTTTTCC 


360 


TTCATGTGGT 


CCAATACTGC 


TAAATCTGGT 


AAACCACTAA 


AATCAAGTCC 


CACGCTTATT 


420 


CATCTCCTTC 


AATTCTGTCA 


GATTGTTTTA 


GATATTCTCT 


TCTTAACTCA 


TCTTTATAAT 


480 
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CTTCTAATTT CATTTCGGGA TGTCGGATAC GTGGATTCGT TCTCTATCGA AAGATTCTAT 540 
AATACGTGTC ATCCGGATCA AAGCGGCTTC ANTATNATTC AATTTATGGT NGGAGATATA 600 
GGAATGATCA AACATTGGGT CAAAATTCTT GGG 633 

(2) INFORMATION FOR SEQ ID NO: 797: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:797: 



CTGTGATTTA 


TCATTCGATT 


GCATGATTAG 


AGGGAGGGAT 


TAAACGTGAC 


ATATCATGAG 


60 


CGTGTTTTAG 


CATTAAGAGC 


AGANAGTAAN 


AGNNCCGCAT 


TTGATTTTCG 


ATTCGGNAGA 


120 


TTTATTTACC 


NAAGAAGAAT 


GGCTAAGTAT 


GTCTCTTGCA 


GAAAGACNCA 


AAGCTGANAA 


180 


AGCATTTCGA 


CACGNAGTTA 


AAAATATGGA 


CGATGTAAGA 


ATGCCCTTCT 


CAAGTGTCCA 


240 


TGACGCCCCA 


AGTAAAATTA 


TATAATGTTG 


TATATTCTGA 


TANCGGCATT 


AAACGTAATT 


300 


TTAAACNAGC 


TGAAAATGAA 


GGATTCTAAT 


ATCATTTCGT 


TTATATATAG 


CAGACATGAT 


360 


AGAATTTTAT 


ATGTAAATCT 


TGTAGGTAAT 


CGTTTTNNAA 


ATAATATAAG 


TATGACTAAA 


420 


NCCACATCCA 


ATATAGGACG 


GGGCTTTTAG 


TATCGTTCCA 


GATTTGTGGG 


GAGTTAGGGT 


480 


TTACTTAATT 


AAAGGGGTCT 


GCCCTCAATT 


GCTCACCAAC 


GAGGGCAAGT 


TACATCAATC 


540 


ACACTTCAAT 


TGCCGCCGAC 


TAGGGGTAGT 


AATCATTGGC 


AATAAGAACT 


AGTTAGTGAC 


600 


TAACGGATTT 


ACGTTCCATA 


AGCAAAGTGA 


TACAAACGCT 


CAGCATCAAT 


GTAAGGCATA 


660 


GAATCAATAG 


TTAGGTAACC 


ATCTTTGATG 


TCACCAATTA 


ATCCTTGATT 


TACACCAGTG 


720 


TTCACGTAAA 


TATCATAGCT 


ACTTACGTCT 


GAGAAAATAA 


GCGCTTCAAT 


CTCTTTTCTC 


780 


ATAATCATCA 


CTCCAATGTT 


TATATATTAT 


TTATATAAAC 


TCTCTCGTCT 


CTCTCTATTT 


840 


ATTAACTTCT 


TACAAGTCTA 


ATATTACATG 


AATTTCCAAA 


TAAATAAAGA 


GGTTTGTGAT 


900 


GTATTTCACA 


AATTTATCAC 


TATTTTGGAA 


AATTAAGAAA 


TAGTTAATTA 


TATAAAAGTT 


960 


TAATAAGTCA 


GAAAATATGA 


TAAAATGTAG 


ATGTTCTTTA 


GACATTAAAA 


GCTTCTAACA 


1020 


TGATACGGAA 


TATGAGGTTC 


CTGTATCACA 


TTAGAAGCTT 


TTTTGTTGCG 


GTGCTTATAT 


1080 


TAATAATGGC 


ATGAATGAAC 


GCATGCTGTC 


TTAGTTTCAG 


GAATCGGCTA 


GAATATTAAA 


1140 


CTTCCGTTCT 


TTGAATATTT 


NTTAATTTCT 


TACTAATATC 


ATCCAGTTCA 


TTCTTCGCCT 


1200 


TTTTAACCCA 


ATCTCCTTGA 


GCTACATTAT 


CAAATTCACC 


TTTAAATTTT 


TTGANATGGC 


1260 


TACCTATATT 


TCACAGCTCA 


CTTTAGATAT 


CGAACTTGTT 


TCTTCAGAGT 


TACTTGTTAT 


1320 


ATTACCAGGT 


TTTATTTGAT 


ATTCAGTTTG 


ACTCATAACT 


TGTATCCTCA 


CTTGAAGTAT 


1380 


AGTGTACAGA 


ATCTGTAAAA 


TAAATGTTTT 


CTTCTTTATT 


TTCTACAAAG 


GTAAATTCTA 


1440 


GATTTTTTGA 


CCCTACAGAT 


GAGCCTTTTA 


AATCACCATC 


NCCTTTTATT 


AAG 


1493 



(2) INFORMATION FOR SEQ IO NO: 798: 

722 



WO 97/30010 



PCT/US97/02318 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798: 



GAATTCCTGA 


TAGATATGGT 


TTTAAACGTG 


ACTTCAAGCT 


ATATGAATGC 


GATGACTGTT 


60 


CAGCATGTTC 


TTTGAGGCAA 


CAATGCATGA 


AACCAAATTC 


GAAATCCAAT 


AAGAAAATCA 


120 


TGAAGAATTA 


TAATTGGGAA 


TACTTTAAAG 


CCCAAATTAA 


TCAAAAGCTT 


TCTGAACCAG 


180 


AAACGAAAAA 


AATCTATAGT 


CAAAGAAAAA 


TTGATGTAGA 


GCTTGTTTTT 


GGGATTTATG 


240 


AAGGNTATTT 


TGGGTTTCAC 


TCGAATGTCA 


GTTCGAGGAA 


TTAAATAATG 


TTAAACGAGA 


300 


GCTAGGTTTT 


GGNTTTAATG 


GCACTTAATA 


TTAAGGAAAA 


TAGCAGGTCA 


ACGAGCTGGT 


360 


ACAATTATTA 


AAATACATAT 


CAAAAAAGCT 


GGATTTCTTA 


TCAAANTTAA 


TTAATAGGAA 


420 


TCAACTT 












427 



(2) INFORMATION FOR SEQ ID NO: 799: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799: 



AATATAATTT 


TGTTGGAATA 


TATTTCGTAN 


AAAGTGGGGT 


NATTTAATTT 


AATTGATTAA 


60 


TTCTAATATA 


TGATAGTATA 


ATGAAATGTA 


GATAGGTATT 


TAATTTAACC 


AGAGGTGAAA 


120 


TTGAGATGTG 


GAATTTTATT 


AAATGTGTGT 


TTAAATTCGT 


ATTTAGCTTA 


GTTGCTATTA 


180 


CAACATTAGT 


TGCTGGTGTT 


GGTGTAGTAG 


CATTTGCTTA 


TATCTTTAAA 


AAAGATTTTG 


240 


AAGATATTGA 


AAGAAAAACT 


AAAGAAATTA 


TTTCTGATAT 


TGAAAGTAAA 


AATAACTAAT 


300 


AACATTTAGA 


GGCTGGGACA 


TAAATCCCTA 


AAAAACAGCA 


GTAAGATAAT 


TTTCAATTAG 


360 


AAAATATCTT 


ACTGCTGTTC 


TCTATTTATA 


CAATACTTCG 


TAGTTGATGG 


CTTCGCTTTC 


420 


CTAGGTTGCC 


GTCTCAGCCT 


TGGTCGTCGN 


CTGGCNCTGC 


TCCCTCAGGA 


GTCTCGCCAT 


480 


TAATACNNCG 


TATTNACATG 


TNATTTTACT 


NTTGANATNC 


TTTNAAAAAA 


TAAGACACTT 


540 


TGCCCAACTT 


GCACATAGAT 


GTAAAATTCA 


ATATGATGAA 


GTTTCTGTGT 


TGGGTCCCCT 


600 


CTTATAATTT 


AATAAATACC 


ACTAAACTAA 


ATTAACGAGG 


TGCCTTATGT 


ATAAAAATTA 


660 
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TAACATGACC CANCTACACT NCCAATAGAA ACTTCTGTTA GNATCCCTCA AAATGATATL 720 
TCNCGATATG TTAATGAAAT TGTTGAAACN ATACCTGATA GC 762 

(2) INFORMATION FOR SEQ ID NO: 800: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 base pairs 

(B) TYPE: nucleic acid 
{C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800: 



CGTAATTACA 


TTTTAGGCTT 


GATTTTCTAT 


CGCTTCTTAT 


CTGAAAAAGC 


GGAACAAGAA 


60 


TATGCAGATG 


CCTTGTCAGG 


TGAAGATATT 


ACGTATCAAG 


AAGCATGGGC 


AGATGAAGAA 


120 


TATCGTGAAG 


ACTTAAAAGC 


TGAATTAATT 


GATCAAGTCG 


GTTACTTCAT 


TGAACCACAA 


180 


GATTTATTCA 


GTGCGATGAT 


TCGTGAAATT 


GAAACGCAAG 


ATTTCGATAT 


CGAACATCTC 


240 


GCAACGGCGA 


TTCGTAAAGT 


TGGAAACATC 


AACACTAGGT 


GAAGAAAGTG 


AAAATGACTT 


300 


TATCGGACTG 


TTCAGCGATA 


TGGACTTAAG 


TTCAACGCGA 


CTAGGTAACA 


ATGTCAAAGA 


360 


ACGTACTGCA 


CTAATTTCCA 


AAGTTATGGT 


TAATCCTTGA 


CGACTTACCA 


TTCGTTCACA 


.420 


GTGACATGGA 


AATTGATATG 


TTAGGTGATG 


CATACGAATT 


CCTAATCGGG 


CGCTTTGCGG 


480 


CGACAGCGGG 


TAAAAAAGCA 


GGCGAGTTCT 


ATACACCACA 


ACAAGTATCT 


AAGATACTGG 


540 


CGAAGATTGT 


CACAGACGGT 


AAAGATAAAT 


TACGTCACGT 


GTACGACCCA 


ACATGTGGTT 


600 


CCGGTTCATT 


GTTGTTACGT 


GTTGGTAAAG 


AAACGCAAGT 


GTATCGTTAT 


TTCGGACAAG 


660 


AACGTAACAA 


TACTACATAC 


AACTTAGCAC 


GCATGAATAT 


GTTATTACAT 


GATGTGCGTT 


720 


ATGAGAACTT 


CGAGATCCGT 


AATGATGACA 


CATTGGAAAA 


TCCAGCCTTT 


TTAGGCAATA 


780 


CATTTGATGC 


GGTTATTGCG 


AACCCACCAT 


ACAGGTGCGA 


AATGGACAGC 


AGATTCAAAA 


840 


TTTGAAAATG 


ATGAACGATC 


AAGTGGGTTA 


CGGGAAGCTT 


GCGCCAAAAT 


CCAAAGCAGG 


900 


CTTTGGCTTT 


ATTCAACACA 


TGGTACATTA 


CCTAGACGAT 


GAAGGGTACC 


ATGAGGCAGN 


960 


CTCCCNCATN 


GCGTTATTNT 


TCCCTTNTGC 


CNCAGAAGGG 


GTGATTCNCN 


CGCCTTTTTT 


1020 


TTTGAAGG 












1028 



(2) INFORMATION FOR SEQ ID NO: 801: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801: 



AGAAAANCCN 


ACATCCjTuAI 






AAACATCTTO 


AttfSACAAGfif* 


60 


GGAATCCATA 


Jk TV Of* TV 1 wiup 

CCAAGCAl 1 1 


r*r* t Zk o & r*r* r* Zk 

iA> 1AuAUV.Ua 




TTfyi A AMPfi A 
1 1 vranAHLun 






AATCTCCGCT 


TTCATCTTCA 


TCACCAC-L. 1 1 


IV Zk Zk Zt/YTYSTT 


1 1 v,A IL1 X i 1 


1\- 1111 1 V-/i 1 


1 nn 

lOU 


CATTTTCACC 


NCTGTTAGCT 


TTTTCGTAGT 


TTTTAGTAAT 


(JAVjtoTAl ITJlj 


TL I AA 1 1 vJAQj 


O A ft 

Z4U 


GAtItVTCTGA 


TGGTTCTTCA 


CCTAACATAA 


1\~L.ItL.AI~1. IT- 


A 1 11.1.1 JTjlA 


AA luAALLAb 


1 Aft 


AACTTACAAG 


TTTGTCAATT 


GCTTCAGCAT 


ATTGAAGTGG 


GTCTTTTTTA 






CAATTTCTAT 


TCTTGTATCT 


1 rCAAA TAv. A 


TGCTTTGTGT 


TATTGAGTTT 


rrVY"l» rVT> TV Am 

CViCVj 111 AA 1 


Jl 1 ft 


TCGTTCTGAA 


IV l 1 11 1 TAA 


TAAAGGTGTT 


AAAUAoAAL. 1 


1 V Iv-AAAA 1 A 


V-AAItCVj Ivj 1 1 


Jl O ft 

4oU 


TTTTTCCAAA 


TCAAGCTGTT 


TCTCCGTAAA 


TCCAAACCTG 


GAGGTAATAC 


CCAATCATCC 


C il ft 

b40 


AAACGCAACA 


■i^M"Tl J i'ATTG 


CATCTCTCAT 


TAGCTCACTC 


X % WWW* Hi TV /"»T\ TV 

AATTCAAGAA 


iv iv iv t~*f+r% iv m/ B *f f \ 

AAAGGCATGT 


f Aft 


TACTATTCTT 


ACCACCATTA 


GATAATTCCT 


C AT AATC AAA 


ACCTTCTATC 


AAAGGCGCGA 


£ ft 
660 


TTGCTAGTTG 


ATTTTTATTA 


AAAGTATTGA 


ATAATTTATT 


TGTAAACGGC 


TTGTAATTTT 


720 


TCAATATTCT 


TTTCGTCATA 


TGCGCTAGAG 


GCAGATTTCA 


AAATCCCTCT 


TATTTGATAG 


780 


TTTTTTAATT 


GTGCACCTAT 


CATTCTTCCG 


AATATTTTCC 


CGTAATCTTC 


GAATAGACTT 


840 


TCTACAAAGT 


GTGTCACTTT 


ATTGTTGTTG 


TACTTTAAAT 


ATATGACCTC 


TTGCATTGTG 


900 


AAAGTACGTT 


GATAAGTATA 


ATCTTTAACC 


GTTACATCTT 


TGAATATATC 


ATCATACaaA 


960 


GCGTACTCTT 


CTCTGTAAAA 


GCTATCTGCG 


ATAAGTAATT 


CTTTGCTGTC 


ACTTACTACG 


1020 


ATTAAAACCT 


CGTTATCATA 


AATTAGTTTA 


TATATAACTT 


GTTGCCAAAA 


ACTATCGCTT 


1080 


GATAAGTCAG 


TATTTGGTTT 


TATATTTAAC 


TTGTAGTAAA 


CATCATTCTT 


TT 


1132 



(2) INFORMATION FOR SEQ ID NO: 802: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 737 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802: 



AATTCAAGAA 


CGTACCATAC 


ATTTAACTGA 


GGATATCTCA 


ACATTGATTA 


CAACAGCACT 


60 


CCGGAACGAC 


AGTACAACGC 


AAAATAACAA 


CATTGGAGAG 


ACAGAAGATG 


TACTTAATAG 


120 


AACCGATTAG 


AAATGGAGAA 


TATATTACTG 


ATGGTGCGAT 


TGCACTGGCT 


ATGCAAGTTT 


180 


ATGTTAACCA 


GCATATCTTT 


TTAGATGAAG 


ATATTTTATT 


CCCTTATTAT 


TGTGATCCAA 


240 


AAGTGGAAAT 


TGGACGTTTT 


CAAAATACTG 


CTATAGAAGT 


GAATCAAGAT 


TATATAGATA 


300 


AACACAGTAT 


TCAAGTAGTT 


CGCCGAGATA 


CTGGTGGTGG 


CGCTGTGTAT 


GTTGATAAAG 


360 


GTGCCGNNAA 


TATGTGTTGT 


ATTTTAGAAC 


AAGACACTTC 


AATTTATGGT 


GATTTTCACC 


420 


GATTTTATCA 


ACCAGCTATA 


AAGGCATTGC 


ATACATTAGG 


TGCAACAGAT 


GTGATACAAA 


480 
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GCGGTAGAAA TGATTTAACA TTGAACGGCA AAAAAGTGTC AGGCGACNGA AGNGTNGAAQ 540 

GGTGTTAAAA CGGCGGTTTC TGGTGCGATT CAAGGTGTAG TTGGCTGGTT AACTGAATTG 600 

TGGGGCANCA TCCATTCTAC NCTANATCCA ATAATGGCAT GCTCACCAGT TATGGTGAAT 660 

TATACACCCA AGTTCATNAT CGATAGGCAC GTAGGTCACA TCACATCAGT AATGACTATC 720 

ATACAGCGTT AGGTGGA 737 



(2) INFORMATION FOR SEQ ID NO: 803: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803: 



TCCTGCAATA 


ATTTTTGTCG 


GTACTACGAC 


ATTTCTATAA 


GTTTGAGAAA 


TATCAGCGTT 


60 


TTTGAATACC 


TCTTTAACTG 


CTTGAATTTC 


TTCTTTAGTT 


GATATTTCAT 


TTGTATCTAC 


120 


GACATGTTTA 


AGGAATCGAT 


TCATTTTTCT 


CGGACGATAA 


TACTTACGCT 


TATTTAAAAA 


180 


TTTATCTTGT 


TTTTCAGGAT 


CCCAGTTTCG 


AATAATATGG 


GCATATTTTC 


TAAATAAACG 


240 


TTCTTCCGGT 


AATTCACCTT 


CAATAGATGT 


TGGATTTACC 


AACATAAGTG 


ATGATGTAAA 


300 


TTCAGGATAA 


CGTACTGAAA 


TATCCGCACC 


AATGATTCCA 


CCCATTTCAT 


GGCATACAAA 


360 


TGCAACTTCT 


TCAATGTATT 


AAATATTTAA 


GTAATTCAAC 


AATGTCATCA 


GAAAAATCTT 


420 


TTATTTCAAT 


GTGACGAGGT 


TTATCAAGAA 


TAGCCATGTC 


CACGGTAAAT 


CAATTAACAC 


480 


AACTTGAAAT 


GATTGTGGTA 


ATTGTGCTGC 


TAATNTATTA 


AACACAGAAT 


NATTATCAAG 


540 


TACAGTATGA 


ATCAATACGA 


TAGGATAGCC 


TTCACCTAAT 


GTACTGTAAT 


GTATCGATGT 


600 


TCCATCTTTT 


TCTAGTAAAT 


AGATCCATAA 


TTTTCTCCTT 


NGATTATCAT 


TTTTTATTGG 


660 


TAAAATGGTT 


AATAATTCTT 


CCACTGTTGT 


AATTGTAAAA 


TCAACTTCTT 


CTTCGAGTGG 


720 


CTCAATTTCA 


GCATCTTCTT 


GTTTATACCA 


TACACTAACC 


ATGCCCATAG 


NTCGTGCTGG 


780 


AGNTACATCA 


TTTAACGCAT 


CATCGNCAAC 


ATACATAATT 


TGGTCAGGTA 


ATACCCCNAG 


840 


GNGGATCAAT 


CATNTCTTCA 


AAAATTTTAG 


GATGTGGNTT 


TACGAAAACC 


AACTGNNTCT 


900 


GATGNNGGAT 


AAATNATTAA 


TAACAAGCAT 


CAANCCANGT 


GAATGTAATC 


GAAATTGGTT 


960 


AATCTTAGAT 


TTACCNTGTG 


GG 








982 



(2) INFORMATION FOR SEQ ID NO: 804: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804: 



ATGACGCCAC CACCGAGTGT ACTCAATGGA 
CTACTATAGC CATTTTCCAT AGAATGTGCA 
GCTATTTTTC CAAAGTTTTG AGCTATTTCA 
TTTACAACAT GTGCAATCGT TTCAGCTTTA 
CCTCTGCTTT ATTAATATGA TTTTCATCAG 
TATTAATGAC ACCAATATGA ACATTCAGCA 
ACAAATCATN TTGGTCATAA ATGTTATATA 
TAGATTTCAA TTTCTTAAAA CAACGCCAAT 
CAATATTTCA TCGNCAATTT GTTTAAAGTC 
TAAAGGATAT GATTGTACAA TCGCACCACC 
GAAACCCATT ATTTATAANC CGGNTTAANA 
GTGNAANATA CATATCATTA TTNCGGATAT 
ATGTGGCCAC TNTNAGACCT GCATCAGTTT 
ACANTNAACG CCACTTTAGC TCCTCCAAAT 



CCTTTGAGAC 


CACTAATGAC 


TGTTGCAGCA 


60 


GCTTTTTCAC 


TACTGTAGTA 


ATTTAATTGC 


120 


TCATACTTTT 


GACTAATGCT 


TTCCAATTCT 


180 


ACACTAATTT 


TTCCACTCAA 


CGTCATTACT 


240 


TATCGAATCC 


AAATAATTCT 


CGTATAAATG 


300 


CTTTATCTTT 


TAATGGGTAT 


CGAATGAGTA 


360 


AACTTGGTTG 


AGGGTCACCT 


TGATGATTGG 


420 


CGTATCGATA 


CTTTCATTAT 


TTAAGTTCTC 


480 


CCAATCATTG 


GATTTTTCTT 


GTCTCATCAA 


540 


ATGNAATGTC 


TTTATTGATA 


ATNCGNNTTG 


600 


AAATATATNC 


ATCTTTTNCA 


TAGCTAANTG 


660 


TTACAATNGG 


GNTAATATAC 


TCTCTCACTA 


720 


ATCTGGNCAT 


CCTCTGCTAC 


TACTCCTAGC 


780 


ATGG 






814 



(2) INFORMATION FOR SEQ ID NO: 80S : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805: 

AACAAGATTT ACCCCTCGTT TTTGAGACNA NTGCGTTTAA TNACAGCACC GGAAGTCCTG 60 

TTCCTAGAAT GTCACGTCGC CTTTAGCATA AAGACTTAAT ACNTTAATAT CTCTTAAAAG 120 

ACTATCTGAT AGTAGTGCAT TTTTGCCAGT CACCCAATTT CGAATTGTAA TTCCACCCGC 180 

TACTTAACTC AATAGCATCT AAATCGGTTA TATGATTATT TGTTCTCATC ATACTTTCCT 240 

CCCTCGATGG AAATTACATT AGTATCCGAT TTTAAACGGC CATTTCCATC TACCATTGAC 300 

GGGGAAACAT CATCTTTATA AAACTGTACA AATTTTGTGC TAGGATATGA TTGCATAATG 360 

TCATTTATAG ATTTTTGAAG GTTCCAAGCA TAGTTTTTTA CTGTTTCTCT ATTTTTTTCT 420 

TTATCATAAT AAGCAATGAA CTATGATTTC ATATTTTTTT GTATTTGTGG ATATTTGTTA 480 

ATCAAAGATT GTAACTGTTG ATTGAATGCA TCACCTTTTA AGTGCATAAC AGGTTTNNGA 54 

AGTC 54 
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(2) INFORMATION FOR SEQ ID NO: 806: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 806: 



GGGGGGCCGC 


CCGAGGAGGG 


GNTTTCAGTT 


TNCAATACCG 


ANCCCATTTC 


TCCAAGATCC 


60 


GAAAGGANCG 


GAAGNAACCG 


ATTGAACCTT 


ACCGTAANCC 


CAACCCGTTT 


ACCCAATCTA 


120 


TAAGACCGTG 


CCAGTTGTTA 


AATACNATTT 


TCTGNCATTT 


CCAGGGGTCA 


AGATGATACC 


180 


NCTAAAAAGC 


CTAANNGGCC 


TTGTNACACC 


CTNGNCCAAC 


AACCTACTCT 


GAAATTTATG 


240 


ATTATCCAGT 


GGANTGAAAA 


GTTTGATACC 


CATATTGGCA 


AAGATTTANC 


CTGAGTTTGC 


300 


CAAAAGAGCA 


AGATGTAAAA 


ATTGCCATAG 


AGTTACATGC 


AGGATTTTTA 


GTTCATACAC 


360 


CATATACGAT 


GTTAAAGTTA 


CGTGAGGCTA 


CAAATGAATA 


TATCGGTGCT 


AACTTAGATC 


420 


CTAGTCATTT 


ATGGTGGCAA 


GGTATTGATC 


CAATTGCTGC 


GATTCGCATA 


TTAGGCCAAG 


480 


CAAATGCAAT 


TCATCACTTC 


CATGCTAAAG 


ATACGTATAT 


TAATCAAGAA 


AATGTAAATA 


540 


TGTATGGTCT 


AACTGATATG 


CAACCATATG 


GTAACGTTGC 


GACAAGAGCA 


TGGACATTCC 


600 


GNACAGTTGG 


TTATGGACAT 


AGTCCATATG 


TATGGGCAGA 


TATCATAAGT 


CAACTTATTA 


660 


TTAATGGATA 


TGATTATGTA 


TTAAGTATTG 


AACATGAAGA 


TCCTATTATG 


TCAGTAGAAG 


720 


AAGGTTTCCA 


AAAAGCTTGN 


CAAACTTTGA 


AATCTGTTAA 


TATTTACGAC 


AAGCCAGCAG 


780 


ACATGTGGTG 


GGCATAATAC 


GAACTCGAGG 


TTAGTCTGAA 


GTTTGTCTGA 


AGTAAGACTG 


840 


GTGGCAGGGT 


TGAATGAATG 


CATATGTCGG 


CAAGNCATTG 


CAAAAATNTC 


ACACTTAANT 


900 


CATGTCATGG 


GTGTAAAGAG 


GTGTACTTTA 


TATAAGTGTT 


TAGCGATGGC 


ATCCCNTTCA 


960 


CAGGNACAAT 


CCCCACCATG 


AAAGAGTATA 


TACCT 






995 



(2) INFORMATION FOR SEQ ID NO: 807: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY:, linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807: 
CCGGATAACC CTTCATTTGA TATATCCAAG AGGTTTGCCG AAAATAAANA TTGGTGGCCC 60 
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r^IVJl TV TV TV I^T* TV 


TTTGCAGTTT 


fTATTYSAGCC ATTGACCCCA 

IjIaI X Vj/iUvV- iunv.v.v<un 


AATTGATTTA ATAGCCCAAT 


120 


ATCCAGTAi-A 


ALlAIAluul 


APAAATYSCAC TTGATTGGGG 


TCGTCAACGC 


ATGTATCGTG 


180 


7V/~» xniqvriivr'PP 
Al_Al X lAiiV-vj 


uAv. 1 ALil Aft. I 


7xf2fyiY"3TY , AAA RfJTATRCTlAA 




x vtv_v_x\x v_ 


240 


GTATTGTTTG 


GCTTTAATAA 


TGCAGTGACA TCAATTGTAA 


CAATCGCAAC 


AGTGATTATT 


300 


GTCAGCATTG 


TCTTAGCCTA 


TATTATTGGA ACACTAATTA 


ATAAGTTTAA 


TTTAATGAAT 


360 


ATCAACTTTA 


AAATGCCGAG 


CAAAAAGAAT CATATTAaGG 


AGAGTGTTTA 


ATTATGGCAA 


420 


AGAGCTATGA 


TTATCAAAGT 


GCTTTCGATA TTATCGGACC 


AGTTATGATG 


GGCCCTTCAA 


480 


GTTCTCATAC 


GGCTGGTGCA 


GTAAAGATTG GGAATNC 






517 



(2) INFORMATION FOR SEQ ID NO: 808: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 665 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808: 



TCTTCATNTA 


ACTTTTTCAC 


TTACCATATC 


TGATAATAAT 


TCTTTTACGC 


CAAGTAAGTT 


60 


TGTAGATCCA 


CCAGTTACAA 


TAAACCCACC 


ATTTACTTGT 


GTTAATCCTA 


AATCTTGTAA 


120 


AACATCAAAT 


ACTTCGAAGA 


ATATTTCTTC 


TACACGCGCT 


TCAATAaAGT 


CACTCAAATC 


180 


TTTGTGAGTA 


TACTGTACTG 


TTTCATCACT 


ATCAACCTGT 


TCAACAGTGA 


AGATATCTTG 


240 


ATCTGAAGCA 


GAATCATAGA 


ATGCATGACC 


ATATTGGTGT 


TTAACTTTTT 


CAGCAGTTTC 


300 


ATAAGAAGTG 


TTTAATCCTT 


GTGCAATATC 


GTCTTGTAAT 


ATCACGCCCT 


GCCATTTCGA 


360 


TAGAATCAGC 


ATCTACTAAT 


TCACCGCGTT 


CATAAAAAGC 


AACTTGGGGT 


AACGTCTTCA 


420 


ACAATATCAA 


TGACACATGC 


ACCTAACTCT 


TTNNCAAGTA 


GCTTGTNAGG 


ATTGGACCCA 


480 


TAGTTATATT 


GCATCAGAGG 


TAAACATCTT 


TATACATCAA 


CAACCACATG 


CTTCTTCGGC 


540 


ATTTTATCAA 


TATTAATTAA 


AATCGATTTG 


TGGGATTAGA 


AATTACGCCT 


GCTTCCAACC 


600 


CTTAATGATT 


GTCTTTGGAA 


ATATTCCTTT 


AGGGCTTTGA 


AACCTCAATT 


TCTTTTATCG 


660 


ATTTT 












665 



(2) INFORMATION FOR SEQ ID NO: 809: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809: 



AATGGGAATA 


GATATGACAG 


AGAGCTTTAA 


TGAAACAATG 


GAAAAGTTTA 


ATACAAGAGA 


60 


TAAAAATCGA 


TTCGAAAGAA 


AGTGAATTTA 


AAAATATATG 


CTTTATTTGG 


ATTTGATTAA 


120 


GATTAAAAAC 


ATAGACCAAT 


GAATTGCATC 


GCATGAAATT 


ATATTATTTT 


TAAAGTAAGT 


180 


AGAACTAGAT 


AAATGATTAT 


ATATACAACA 


CCTGTTTTTT 


AGATATTTGA 


AATTATAAAC 


240 


TCAACATTAC 


TATAAACTTA 


AATTTAAACA 


ATTATCTACG 


CATCGAGAAT 


AATAGAAAGA 


300 


GAAATAGAAA 


GAAAGGATGG 


GCAAGCGAAT 


GAAGATAGGT 


ATAACATGTT 


ACCCCCCCAT 


360 


GGGTGGCTCT 


GGAATTATTG 


CACAGAATTA 


GGAATTAAAT 


TAGCAGAGCG 


AGGTCACGAA 


420 


GTACATTTTA 


TCACTTCAAA 


ATATCCCCGT 


TTAGAATAAG 


AAAACCATTG 


CCGAATATGA 


480 


TTTTTCATCA 


AGTTGAAGTG 


AATCAATATG 


CAGTATTTCA 


ATATCCACCG 


TATGATATTA 


540 


CGTTAAGTAC 


TAAAATCGCT 


GAAGTTATTA 


AAGAATATGA 


TCTTGATTTG 


TTGCATATGC 


600 


ACTATGCTGT 


ACCTCATGCA 


ATTTGTGGAA 


TTTTAGCTCG 


TGAGATGTCA 


GGTAAAGATA 


660 


TAAAAATTAT 


GACAACGCTA 


CACGGGTACT 


GATATTACTG 


TTNGGGCTAT 


GATCATTCAC 


720 


TCCAAGGTGC 


AATTAANTTT 


GGCATTNGAA 


AAAAGTGATA 


TTNGNGAAAA 


GTGTG 


775 



(2) INFORMATION FOR SEQ ID NO: 810: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 870 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810: 



TTGATTTTTC 


ATGGAAATGA 


GATGTACTGT 


ATCCCATTTC 


ATTTAGTTGT 


GCTTTATCAA 


60 


TTCTTAAACC 


TTCAAAGTCT 


CTAATATACA 


TCGTGTCCAA 


TAGACCATCT 


TTCCGAAATG 


120 


TTGCAATAGC 


ATTTTGCAAG 


TGAGCCTCTA 


ATGCAATGCC 


ATATTTAGTA 


ACTAGTGGGA 


180 


TTACGAGACC 


AAGCAATGCT 


TTACTATAAG 


TTTCAATCCA 


CGATTTCGCC 


GATGATTCAA 


240 


AATCAGACAA 


TGATGCAGCT 


GATTGATAAC 


GTTTAATCAA 


TGTCACGATA 


GGTGATTCAT 


300 


TGTTAAATGG 


GTAGGTTGCA 


ACTAAGCTTG 


AAGGAATCAT 


TGGTGTGACT 


TCTTGTGGAA 


360 


TCATNTGGTA 


TTTTUTTTNN 


TCTTAAATAA 


TGTACCTAAT 


TGTTCACTTC 


CGTCAAGNTG 


420 


ATAGTCTTAC 


TTCCGNCTTG 


CTCATTATAA 


AAATGAATAC 


CAGCAACTTC 


AACAATAATT 


480 


GGTGGATGCA 


CTAGGACTTA 


ANTATGACAT 


CTTTCTCCAA 


AATATCAATT 


AAAATACGTG 


540 


TCATAGGTGG 


ACCATTGTGC 


GTCGGTTGGT 


CTGATAATGT 


ACGAATNTCA 


CCCTGTAATA 


600 


TGAACGGTNG 


TCGACAAGTT 


GATGTGTGGC 


GACATAGCTG 


GGTATTTAGG 


AACTAATGTT 


660 


CTGAAAGATA 


AACCGGCATA 


ATAATCCAAC 


GNNTGTGTTG 


CTTCAATGAT 


TANTTCTTCA 


720 


TCTACTTCTG 


NTTGATAATC 


AGAATGTAAT 


ACGTCATCTA 


ATNGCCATGG 


ATGTACAATC 


780 


ATAATGTGAT 


AATCATTAAA 


GTNAAAGGTT 


GGCGTANCCT 


NCATCTNCTA 


ATTGGTTTAA 


840 
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TNANGTCCCG AAATAGNTGA TGGAACAGGC _ 870 

(2) INFORMATION FOR SEQ ID NO: 811: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811: 



ANCTATANNA 


TTTAGGTGGC 


GGNGCTNATN 


AANAAATNGC 


CATAGAGGTT 


CAATTGGCTA 


60 


ATNATAAANT 


CATCCAGAAA 


NAATACGTGA 


ATATTTTACC 


TAGTAAAGAA 


GGGTACNTGT 


120 


TACCAATTAA 


TAAANATGTG 


TACGAAGCAA 


TTAGCAAAGA 


ACGATTGAGA 


ATNATGGTTA 


180 


TGAAGCTGAT 


CAGAATGTAC 


GTATGACTCA 


TTATCATAAT 


GTAAGTCGCA 


AACAACAGGA 


240 


AGTTNTATTA 


NAAGGTCAAA 


TCNACCCCCN 


TAATACTTAT 


CATAATAAAG 


AAATTTATGC 


300 


ATTTGCAGGT 


TATCTAAACN 


ATTGATTTAA 


GAGGGTAGGT 


GTTTATTGCG 


AAATCTATCA 


360 


ACCCAATNTN 


AATGCAAATA 


NTGGCGGCAA 


NTACTATNAC 


ATATNACTTT 


ANGTNGNTCA 


420 


TGGCATCTTC 


TTTTGGNATA 


GGANCTTAAC 








450 



(2) INFORMATION FOR SEQ ID NO: 812: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812: 



GGCACGAGCT 


ACCCATTTCA 


AATTATTCAT GAGCCATATA 


TCCTTTTCCT 


ACTGTTGTTT 


60 


CGATTGCACT 


ATCCATACTA 


ATTTCAGATA ATTTTTTTCG 


TAAACGATTC 


ACATTTACTG 


120 


TTAACGTATT 


ATCACTAACA AATGCTTCAT CATCCCATAA TGNAGTGAT A t ATTGTNTCTC 


180 


TCGACCCGAT 


TTGATTTTTT 


TTGCTAATAA GAATTTCTAA 


TATANTCATT 


TCTGTTTTGG 


240 


ATAGAAAAAT 


CGTATCGACA 


CCTTTGTGTA TACTATCTTG 


CGATAGATCA 


ACGACAGCAT 


300 


CTTGCCAAGT 


CAATGTACGT 


TTCTCTTCAG CTGTAAACTC 


ATAGACACGA 


CGATAAATCG 


360 


CTTGTAATTT 


AGCAATTAAT 


ACATTGGTAT AGAAAGGTTT 


TTGCATATTA 


ATCAATCTGC 


420 


GCCAAGTTCC 


ATACTCATCA 


CTTGGATCCA ATTGGATTAT 


CNCGAGATGA 


TANAAATAAT 


480 
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ATTGGGNCGT 


TGGAAACTTC 


TCTCATTTTT 


CCTGCACCAA TAAAAACCCA TCATATTTAG 


540 


GTAATTGAAC 


AACCCAATAT 


AACAATTTCA 


nrmn'p & & & & r* 

uuMu 1 AAAAL 


1 W IT— AAA LXj 1 


ATCCATTACT 


600 


TTGCCGAAAT 


CTTCAATACC 


AGCAACATTA 


AAA L-A 1 1 


VjlviC IAATTL 


TTTTTTCAAT 


660 


TCTTGANACA 


AAGTATTGTC 


ATCTTCTACT 




UCATCCATAT 


CACCCAATAT 


720 


CATTTAGTAT 


ATTTCATATT 


TTCTCCTGAG 


Mi lAAl\.ViAL 


TCTUANAATC 


TTTAATCCGG 


780 


CAGTCAACTT 


CAAAGCCAGG 


ANTCATCATT 


TTTTGNATTG 


GTGCTTGTAT 


m k /"tin ik ik m » it 

AAAGTAATAA 


840 


ATCGNNCNCA 


ATAATCGAGG 


GATATAATCG 


I A I AUA 1 \aT A 


TATAAACGAC 


TACCGACTCT 


900 


TTGATTAATC 


TAAATTCTAA 


CTTCCCTTGA 


V» X AAC l\a iVj I 


ATTTTTTCAC 


CTAAACTCCC 


960 


ACTCCAATNA 


AAGTAAAGTT 


ATTATTCCGT 




I lv- 1 ATNNTA 


AATACTGCGA 


1020 


GCGGTCGCAC 


CTTATTCTTA 


ATATATATCT 


1AAA1 1\»ATC 


ATGTGATTTT 


TCTGTTTTCA 


1080 


CACAAGTTCC 


TTTAGTGTAA 


CCCATCCATG 


CAATAAAATG 


GTTTACAACG 


TTCTTTAATG 


1140 


TCCATCCCTT 


TGGTAAAACT 


ACCTTCATCG 


TTGATCTAAC 


ATCATCATAC 


TTTGAAACTT 


1200 


GTAATTCTAC 


ATTAACTAAA 


GAACGTTTAA 


AAACTAAATT 


TGTTGTTTCT 


ACAGGTGTGC 


1260 


CATATGCACC 


TAGGCGTTCA 


ATTGTTTCAT 


TATCATAACG 


ACTCCCAGGT 


ATGTAGATTA 


1320 


CTTTTTTTAC 


TTGATTGATA 


GCCGCTGCTC 


GACCAAAATT 


ATCTGCTGCG 


ATTAATGTTA 


1380 


AATCTCTTGC 


TGATGATTGT 


GTTATTTTGG 


CA 






1412 



(2) INFORMATION FOR SEQ ID NO: 813: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813: 



TAGAAGACAT 


AACAATTGGT 


GTTTCTTCAT 


TACTTTTCAT 


CCATTGGCTG 


ATATTATCAT 


60 


GAATATTATG 


CGTCATCATG 


TTGCACCTCA 


CTCTCAGCTT 


TTAGTGCTTT 


AATTTCATCT 


120 


CTAACAATGG 


CTGCTTCCTC 


AAAATCTTGG 


ATTTCAATAA 


Gn-rrn-cAA 


ATATTCATTC 


180 


TTTTCTTCGA 




TAAAGCTATC 


TTTTTATGTG 


AAGAATGTGG 


TGTCTTTCCA 


240 


ACGTGCTCAA 


ATTGTCCACC 


TTGAACTCTG 


CGGACGATAT 


CAATGATGTC 


ATCTTTAAAT 


300 


GTTGCATAAC 


AATTAGCACA 


CCCAAATTTA 


CCAACATGTG 


CAATATCTTT 


CAAAGTCATA 


360 


TGACATGATG 


GGCAACGTTT 




AAACGCTACT 


TCTTGAAAAT 


TAATGCCATG 


420 


TTTCGGAGCT 


AAATGTTNGG 


TAAAATTTGG 


TTAACAACAA 


ATGCTTCTTC 


GAAATTATCT 


480 


TGATGTCCTT 


GATATTCAAG 


GCTGNTCCAA 


TAGCTTGATT 


CCACGGATTG 


GTGCCCCTCA 


540 


GCACAAGTGT 


GACACACCAT 


TTNCTCTTCT 


NGTTNTATTT 


NNACTTGGTA 


ACTTTAACTT 


600 


TTAATTCCGC 


TTCATTAAGT 










620 



(2) INFORMATION FOR SEQ ID NO: 814 : 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1033 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:814: 



CCGAAGTAAT 


ATAGAAAATA 


CATNCGNTGA 


TTTCTGGGCC 


TTCCTATCAG 


GTTGGGCGCA 


60 


ATCATTTATT 


NATTTTCCAG 


CTAACGTAGC 


AGCATTGTCT 


ATCGTATTTG 


CGACACAGCT 


120 


AATTAATTTA 


TTCCATTTAT 


CTATAGGTTC 


GTTAATACCA 


ATAGCAATCG 


CATCTGCGTT 


180 


ATCTATTGTG 


TTGATAAATT 


TCCTAGGTTC 


AAAAGCAGGC 


GGAATTTTAC 


AATCAGTTAC 


240 


TTTAGTAATT 


AAACTGATTC 


CAATCATCGT 


TATTGTAATT 


TTTGGTATTT 


TTCAATCTGG 


300 


AGATATCACT 


TTTTCATTAA 


TTCCAACCTA 


CAGGTAATTC 


AGGAAATGGT 


TTCTTTACAC 


360 


CAATTGGTAG 


TGGTTTATTA 


GCAACTATGT 


TTGCATATGA 


TGGTTGGATT 


CATGTAGGAA 


420 


ATGTTGCGGG 


GGAACTTAAA 


AATCCTAAAC 


GCGATTTACC 


TTTAGCGATT 


TCAGTTGGTA 


480 


TCGGTTGTAT 


TATGGCTGTG 


TATTTATTAA 


TTAACGCTAC 


ATTTTTATTA 


ACGTTGCCAA 


540 


TAGAACTACT 


TGCTGGTAAC 


TTAAATGCAG 


CTTCAAGATA 


CATCAAAAAT 


ATTATTTGGT 


600 


GAAAATGGCG 


GTAAAATTAT 


TACCAATCGG 


AATATTAATT 


TCTGTTTATG 


GTACGGATCC 


660 


AATGGCTATA 


CTATGACTGG 


TATGCGCGTA 


CCATATGCAA 


TGGCTGAAAG 


AAAATTATTG 


720 


CCATTTAGCC 


ACTTATTCGC 


AAAATTAACA 


AAATCTGGCG 


CACCATGGTT 


TGGCGCAATT 


780 


ATACAACTTA 


TAATCGCTAT 


CATCATGATG 


TCAATGGGAG 


CATTTGATAC 


AATAACAAAC 


840 


ATGTTAATCT 


TTGTTATTTG 


GTTGTTCTAT 


TGTATGTCAT 


TTGTTGCTGT 


AATAATTTTA 


900 


AGAAAACGTG 


AACCAAATAT 


GGAGCGTCCA 


TATAAAGTAC 


CGTTATATCC 


GATCATACCT 


960 


TTAATTGCTA 


TTTTGGCAGG 


ATCATTTGTA 


TTGATTAATA 


CACTGTTTAC 


ACAATTTATA 


1020 


TTAGCAATCA 


TTG 










1033 



(2) INFORMATION FOR SEQ ID NO: 815: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1009 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815 : 

CAAATCACTT AACATAGCAT CTTGTACCAG CTAAATATTC ATCACTTACA GGTAATGCCG 60 
GTCTGACATT TGCCAAGCCT CGATATAATT ACCACAAATC TTCAAATGAA GATGGCATTA 120 
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CCAATACGTC 


ATTGTCATTT 


CTCTCTCGAC 


GCATATCATT 


GATTAAATAT TCAAGTCTCG 


180 


CTTTATTTGA 


TTTTAACGTT 


TCCATTTAAG 


CCTCCGGTAA 


TGCTAGGAAT 


GCTTCTTTAT 


240 


CGACATCATC 


CAATTTGAAC 


AACCAATTTT 


CTTCTGGGTT 


TTCAGAGTTT 


AAAATTGTCG 


300 


GTTCTTCTTC 


CGCTTTTGTA 


TTTCGCTCAA 


TAATCGTTCC 


TGACAATGGC 


GTTTGCACAT 


360 


CAATGACCGT 


TTTCGATGCT 


TCGATACTCA 


CAATTTCATC 


ATCCACTTTA 


ACTTCATCTG 


420 


GACTTACGAA 


TTCAACATAA 


CCTACTGTCC 


CAATATCATC 


TTGCAATTCA 


GGTGTCATAC 


480 


TAAACACATA 


CAAATCTCCT 


ACTTTTTCTA 


CCCATAAATA 


ATTGGCTAAC 


TTTTTCATCT 


540 


TATCTTCATC 


CTTTCTCTTC 


TATTTAAAAT 


GCTACAAATA 


AATTTCCGCG 


AGTAATTTTA 


600 


ATGTTTTACA 


ACGCTGTTCA 


ATACCTGGAA 


TAAGCGGTGC 


TACTAACACC 


TCATCAACTT 


660 


CAAACGTAGC 


AATGAAATCA 


TCTAATTGTG 


CTTTAACCTG 


TTCTTGTGTG 


CCTGCAATGA 


720 


TGCGTGCTTG 


ATGTACTTGA 


ATCATCTCTT 


TGTCTCGATC 


ATTAAGCTTA 


TACTTTTGTG 


780 


CTGTGTCTAC 


TGAAGGAAAA 


TGTTCAAATT 


CTGCAAATTG 


GTAATTTACC 


TAATAACCAA 


840 


ACATCTAAGG 


GATGTTGGAA 


TGNTGGT ACT 


TCCGCTTCCG 


GTATCTGGTA 


CAAATGACAA 


900 


ATACAGGATG 


CCATCAACCT 


TGGGGGTCCA 


TTTTTATCCG 


NTGGATGTTG 


GNAATGTTTT 


960 


TGGGAATATC 


AAAGTNATCC 


CTCGNTGGAT 


GTATCGGATT 


ATATCTGGG 




1009 



(2) INFORMATION FOR SEQ ID NO:816: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816: 



GAATTCAACA 


TAACCTACTG 


TCCCAATATC 


ATCTNGCAAT 


TCAGGTGTCA 


TACTAAACAC 


60 


ATACAAATCT 


CCTACTTTCT 


CTACCCATAA 


ATAATTGGNT 


AACTTTCACA 


TCTTATCTTC 


120 


ATCCTTTCTC 


TTCTATTTAA 


AATGCTACAA 


ATAAATNTCC 


GNGAGTAATT 


TTAATGTTTT 


180 


ACAACGCTGN 


ACAATACCTG 


GAATAAGNGG 


TGCTACTAAC 


ACCTCATCAA 


CTNCAANCGN 


240 


AGCAATGAAA 


TCATCTAATT 


GTGCTTTACC 


CTGNTCTTGT 


GTGCCTGCAA 


TGATGCGTGC 


300 


TTGATGTACT 


TGAATCATCT 


CTGTGTCTCG 


ANCATTAAGC 


TTATACTTGT 


GGTGCTGTGT 


360 


CTTACTTGAA 


GGAAAATGTA 


CAAATTCTGC 


AAAATTGGTA ATTTACCCTA 


ATACCCANAC 


420 


ATCTTAAGGC 


ATTGGTTGCA 


ATGCTGCTAC 


TTCCGCTTCC 


GGTATCTTGC 


TACAAATGAC 


480 


AAATACAAGG 


TGGCCATCAC 


CGTGGGGGTC 


CAATTTTCTC 


CGGTGGATGN 


TTGGGAAATT 


540 


GTTTNGTGGG 


GAATATCAAT 


GGTCATCCC 








569 



(2) INFORMATION FOR SEQ ID NO: 817: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 915 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817: 



GANTTCAGCA 


CGCGCAATAT 


TGGGAGATAT 


TCCTAAAAAT 


ATTGAAATAC 


ATTATTATGA 


60 


ATCATTTGCT 


CAAACACACC 


AAGGACATGG 


TACAGATGTC 


GCTATTGTGG 


GTGGAGCAAT 


120 


GGGATATTCA 


ACATTTGATA 


ACCGAATTAA 


ATCTTCATTA 


GAAATTGCAG 


CTGATGATGG 


180 


TATTAAAATA 


GATATCATCG 


AAGAAGAAGG 


CGATAGTATT 


GGTCAACATC 


CAAACTGTGC 


240 


TTATATTAAA 


TCATCTCGTA 


ATGATGGTCG 


CTATATTGAA 


ATTATCGGTA 


TTTCAATCGG 


300 


TGGTGGTACA 


ATTAAAATTA 


AAGGTATTCA 


ATATAAATGG 


ACTAGAAGTA 


GATTTAAATC 


360 


ATGGATTACC 


AATTTTAGGT 


GGTTGGACGG 


AAATATTGAC 


TTAAAGCACA 


AGTTAAATCA 


420 


ATTTCATTAA 


TGATATCAAC 


GATATGGAAA 


TTAGACTTGT 


AAGGATTGAT 


TTATTAAGAT 


480 


AGATGAAGAT 


AAATGCTGGG 


TTGGGATACC 


CATAAATTAA 


GCAATCTCAA 


GATCGGACAT 


540 


TAAATCAAAT 


TAAAGGAGAA 


ATACAAGTGA 


CGTAAACGCT 


TCCTATATAA 


ATTAGAGGGG 


600 


GAATAAACAT 


GTTTGATTCA 


ATTAGAGAGA 


CTATAGATTA 


TGCCGCAGAA 


AATAATATGT 


660 


CATTTGCGGA 


TATCATGGTT 


AAAGAAGAAA 


TGGAATTAAG 


CGGTAAATCA 


CGTGATGAAG 


720 


TGCGAGCGCA 


AATGAAACAA 


AATTTAGATG 


TCATGCGAGA 


CGCAGTAATC 


AAAGGGACGA 


780 


CAGGTGATGG 


CGTTGAAGTG 


TAACGGGCTA 


CACTGGTCAT 


GACGCTGCTA 


AACTACGTGA 


840 


CTATNATGAA 


ACACATCANG 


CTTTGGCTGG 


ATATGAAATG 


ANTGACGCAG 


NCAAAGGTGC 


900 


CATTGCAACA 


AATGA 










915 



(2) INFORMATION FOR SEQ ID NO: 818 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 767 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818: 

GTAAGTGGTC GCTATGAATA ATACCTCATC GTCATGATGG GTTGTAATCC CTAGGATATG 60 

ATTTATGTCC AAATAGACAT ATCTTGGATA TCAATTCAAC ATATATATAT GTATTATCAC 120 

GTACTCATTG TGATATATTC CAAAGATTTT TAAACCTATA CCCTGCACCC ATTAATAANC 180 

TATTATCGTT GTTGACCTGC CCTTTAATAT GTATACAGGT TCTGACGTAA GAACTGTCGG 240 

AGATTTAGGG AGTATTAAAC AACCTTTACC TCACTTTTTA ATCCCTAATG TGCCCTTTAA 300 
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TTTAGAAACA 


CTTCACATTA 


TTTTCCCTTA 


CTCTTTATCT 


ATGGCTATTG 


TAGGTTTAGX 


360 


AGAAAGCTTA 


CTTACAGCTA 


AAATTGTAGA 


TGACGCTACA 


GATACTTATA 


GTAGTAAAAA 


420 


TAGAGAGTCA 


CGTGGTCAAG 


GTATCGCTAA 


CATGATCACC 


GGTTTATTTG 


GTGGTATGGG 


480 


TGGTTGTGCC 


ATGATTGGTC 


AATCGGTCAT 


CAATGTTAAA 


TCAGGAGCAC 


ACAGTAGATT 


540 


GTCTACATTT 


ACAGCCGGTA 


TCGTACTTAT 


ATTTATGATT 


ATTGTACTTG 


GAGGACTCGT 


600 


AGTGCAGGTT 


CCAATGCCAA 


TTTTAGCAGG 


TATTATGGTT 


ATGGTTTCTG 


TTGGTACATT 


660 


TGATTGGAAT 


TCTTTTAAAT 


ATATTAAAAA 


AGCACCCNAA 


AACAGATGCA 


GTTGTTATGC 


720 


TCATTACGGT 


GATTATCGTT 


TTAATGACTC 


ATAACTTAGC 


TCGTGCC 




767 



(2) INFORMATION FOR SEQ ID NO: 819: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819: 



TCAACAATAA 


ATAATCGTCT 


TTATTAAACA 


ATTCTACTGT 


CACTTTTGTA 


CCGTTTGAAT 


60 


GACGCAATGT 


ATTCGAAATT 


GCTTCCTGTG 


TAATTCTGAA 


CAAATGATCT 


TCAATACCTT 


120 


TAGGCACTTT 


AAAATCTTGT 


ATTTCATGCA 


CAACTTTCAT 


TGGCACTTTT 


TTGTGTAAAT 


180 


CAATAACTAA 


ATCTTTAATA 


CCCTCACCTA 


AAGATTTGTC 


TTTTAAACCA 


AGCGGTCTTA 


240 


AATGTAACAG 


CAAAGCACGC 


ATTTCTAACT 


GCGAATCTTG 


AACCATTTTC 


TCTAAAATCG 


300 


GAATTTGTTG 


GTCTAATGGT 


GGTTCTAACT 


TCCGTTTCTT 


TGATAGCAGA 


AAGCATCATA 


360 


CTTGCCGCAA 


AAAGTTGCTG 


ACTAACAGAA 


TCGNGAAGTT 


CTCGAGCTAG 


TCTTTGACGN 


420 


CCATCTTCAA 


TAATCTTTTT 


AACTCACACA 


TCATTAATAT 


TATAATTTTC 


ATTGGNTAAG 


480 


TTTTGAGTTT 


TAAGTCGCAA 


CTTATGCAAT 


TCTTGATTTA 


AAGGTACGAG 


TGTATGGTAT 


540 


AAATCTAACG 


TTTCACTATA 


TATTTCTATA 


TTTGGATCAT 


TAATGCCAAC 


TGTTTCGCCT 


600 


TCCATTGAAC 


GCTCAATTTG 


CGTCTTAATC 


CAATCATTTT 


GCTGATTGAT 


TTTGTATGCG 


660 


AGTACCGAAC 


CAACAATAAT 


ACACAATAAT 


ATGATGATGA 


GATTTAAAAA 


TAAAAAGACT 


720 


GGNATTCCGA 


ATATTGGTGT 


ATAAAACATA 


CCTTGGAAAA 


TAGATGATAT 


TTACAAAAAC 


780 


TTTATCGATG 


AATAGAAATG 


CAGCTAGCAT 


GCTATATACT 


AAGATGAGCA 


TGGAACCAAT 


840 


TGTTCTAATG 


TAGGGGTTCA 


TCGATAAATC 


ACCTCTACGT 


CTCCGATAAA 


CGTTGATACG 


900 


TAGATATTAA 


CTGTATAGTT 


ATCCGGTTTC 


ATCATTTCTT 


CAATATGAAT 


ATTGTTATTT 


960 


TCAACTTTAT 


ATGATTTTTC 


ATTCACGTAA 


GTACTTCCAT 


AAAAAGCAGC 


TCGTGCC 


1017 



(2) INFORMATION FOR SEQ ID NO: 820 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1318 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820: 



ATTAAAAAGG 


TAGGGATAGA 


TGTGAAGAAA 


ATCATTAGTA 


TCGCAATTAT 


AGTTTTAGCG 


60 


TTGGTATTAA 


GTGGTTGTGG 


TGTCCCTACG 


AAATCAGAAG 


TGGCTCAAAA 


GTCATCGAAA 


120 


GTTGAAGTGA 


AAGGCGAGCG 


ACCAACAATA 


CATTTCCTAG 


GACAAGCAAG 


TTATGAAAAT 


180 


GATATGAATA 


TCGTTAAAGA 


TCAATTGGAA 


AATGCAGGAT 


TTAACGTGAA 


GATGAATATC 


240 


CACCCAGATT 


ATGGTAGCTA 


TCGCACACAA 


CGTCAAGCCG 


GCAATTATGA 


TATCCAAATT 


300 


GATGACTGGA 


TGACAGTGTT 


TGGGTGACCC 


GAACTATGCT 


ATGACGGNAT 


TATTTAGCTC 


360 


TACAGGATCA 


AATAGTTTAT 


TGANAGATAA 


ACATGTAGAC 


CAGGTGGTTA 


AATAAAGCTT 


420 


CTACTCAAAA 


TGAAGCAGGT 


TGTTAAACAA 


ACATATTATG 


CAANTTGANG 


ATGANGTTGG 


480 


TTTTGGTAAA 


GGGTATATGG 


CGCCTTTATA 


TGGATCAAAA 


AAGAATTTNG 


TATATGACAA 


540 


TAAAGNGTCA 


GATAAAAATA 


GTGTTGGATT 


GCCAAATTCA 


CGTGCATTAA 


TATGGCAACA 


600 


ATTTGATTAC 


AACAATAGTA 


GAGAACGACN 


CGATNCGCGG 


CCACTTGTGA 


TGACACAACA 


660 


AGATGGTGAA 


ATTCCTACAT 


TGGATCCAAT 


ACGTTCAATT 


GCGCCGNCAG 


TATATTCAAT 


720 


TAATATGAAT 


ATGTACACAA 


GGTTATTATT 


ATTAGATGAA 


AATGATCACT 


TAACAACGAA 


780 


AGGTTCGTTA 


AGTCGTGATT 


ATGCTGTGAA 


TAAGAGACAA 


TAAAGCATTT 


TATTTCTTGT 


840 


TAAGAGATGA 


TGATTATTTT 


GCGAAAGTGG 


TCAATGGACA 


AGCACGTCAA 


TACTGGAGAG 


900 


CGTGTATCGG 


CTGAAGATGT 


TAAGTTTTCT 


TTAGATAGAG 


CACGTGATAA 


AAAGTCTTGT 


960 


GCCTAACAAT 


AATACTTACA 


ATATTGCACA 


AANCATATTA 


AATTGACATC 


AAAGATATTA 


1020 


ANAGATGNGG 


ACATCGATCA 


AGTTGCGTTN 


AAAGAGAAAG 


ACAAAGGATT 


GATAAATCAA 


1080 


TCTTATGATA AGTTNGGANT 


AANGCTTATT 


AACGTCAAAT 


CCGGNNAGCG 


ACAGATNGNC 


1140 


TCAAAANGGT 


AAATAATAAA 


GGCGGTATTT 


ATCACATTGT 


TAAAATTACG 


ACAGATCAAT 


1200 


CGATGCCTCG 


AGAGGTAAAT 


TACTTAACAC 


ACTCTTCGGC 


AGGCATTTTA 


TCTAAAAAAT 


1260 


TTGTTAATCA 


AGTAAATCAA 


GAATATCCCA 


CAGGATATGG 


GGATAGCAGT 


CTCGTGCC 


1318 



(2) INFORMATION FOR SEQ ID NO: 821: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821: 
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TTAAATTACA 


AGTTGCGGAT 


CAAGTCAGCA 


CGCTAAATCG AATTACAAGT GCTTTTGTJC 


60 


GCCTACAATA 


TAATATCGAT 


ACATTACATG 


X lAUiLnl JLv, 


iuAAUuiLL 1 


GGGATTTCTA 


120 


ACATGGAAAT 


TCAAGTCGAT 


ATTCAAGATG 


Al AC-A.TLAV- 1 


rCATATATTA 


attaaaaaat 


180 


TAAAACAACA 


AATTAATGTT 


TTAACGGTTG 


AATUU l AUvJA 


CCTTGTTGAT 


AACGAAGCTT 


240 


AATTTTAAGA 


CAAAGGCAAT 


GATGCGCTAA 


TTAGTTATAG 


ATATATCATA 


GGCTGCTAGT 


300 


TAACATCTGC 


CACTATTACA 


AAGTTATATT 


TCAGAATTTT 


CGAAAAACAA 


AACATTTAAT 


360 


TATTTGGAGG 


AATTTATTAT 


GACAACAGTT 


TATTATGATC 


AAGACGTAAA 


AACGGACGCT 


420 


TTACAAGGCA 


AAAAAATTGC 


AGTAGTAGGT 


TATGGATCAC 


AAGGNCACGC 


GCATGCACAA 


480 


AACTTANAAG 


ACAAATGGAT 


ATTGATGTAG 


GCATCGGTCT 


TCGCCCAGGT 


CGGTTCTTTT 


540 


GACAAAGGCT 


TAAAAGAAGA 


TGGATTTGAT 


GTCGTTCCC 






579 



(2) INFORMATION FOR SEQ ID NO: 822: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822: 



CTATGATAGN 


ATCGGAAAGA 


TGTATAAAGT 


TATCTAAAAG 


TTATACGACA 


CAAGTACACG 


60 


AGGCACATCG 


CTATGCGGTG 


TGTCTTNTGG 


TATGCAATCA 


AAGAGGTGTA 


AGAGATGACC 


120 


AAGCATAATA 


ACATTTATTA 


GCATGGNCGT 


AAGTCATATC 


AATACGATTG 


GTTCTATCAT 


180 


TCAAAAGCAT 


GGAAGAAGTT 


AAGAGAGATA 


GCATTAGATA 


GGGATAATCA 


TCTTTGTCAA 


240 


ATGTGTTTAC 


GTGAAGACAT 


NGTAACAGAT 


GCAAACATAG 


TGCATCATAT 


TATTTATGTT 


300 


GATGAAGATT 


TTAATAAAGC 


TTTAGACTTA 


GATAATTTGA 


TGTCAGTTTG 


TTATAGCTGT 


360 


CATAACAAAA 


TTCATGCAAA 


TGATAATGAC 


AAAAGTAATC 


TTAAGAAAAT 


TAGAGTATTA 


420 


AAAATTTAAA 


TAAAAAAATA 


ATTTATTTTT 


ATAGCCCCCT 


ACCCATCGGC 


TTAAAATGTT 


480 


TTTTCGACGG 


GTACCGGCGG 


GGGCCCTTCG 


CTTGCAACGC 


GGATAAACTT 


TTATGAAAGG 


540 


GGGTCTTTAT 


ATGAAATTAA 


CAAAAAAACA 


GCTGAAAGAA 


TATATAGAGG 


ATTATAAAAA 


600 


ATCTGATGAC 


ATATTAATTA 


ATTTGTATAT 


AGAAACGTAT 


GAATTTTATT 


GTCGGTTAAG 


660 


AGATGAACTT 


AAAAATAGTG 


ATTTGGATGA 


TAGAGCATAC 


AAACAAGGCT 


GGTGCCGAGC 


720 


AATATTGTTA 


AGAATCCATT 


AAGCATAGAA 


CTGACAAAAA 


CAGNTCAAAC 


ACTAAATAAC 


780 


TTACTCAAGT 


CTATGGGTTT 


ANCTGCAGCA 


CAAAGAAAAA 


AGATAGCNCA 


AGAAGAAGGT 


840 


GGATTCGGTG 


ACTATTAAAG 


TTTTAAATGA 


ACCTTCACCA 


AAACTATTAA 


CAACATGGTA 


900 


TGCAGAGCAA 


GTCACTCAAG 


ggaaaataaa 


AACAAGCAAA 


TATGTTAAAA 


AAGAATGTGA 


960 


GAGACACCTT 


AGATATCTAG 


AAAATGGAGG 


TAAATGGGTA 


TTTGATGAAG 


AATTAGCGCA 


1020 


CCGCCCTATT 


CGATTCATAG 


AAAAGTTTTG 


TAAACCNTCC 


AAAGGATCTA 


AACGTCAACT 


1080 


TGCATTACAA 


CATGGCAAC 


ATTTTATTAT 


TGGCAGTTTG 


TTTGGTTGGG 


TTCATAAAGA 


1140 


AACAAAACTG 


CGCAGGTTTA 


AAGAAGCTTT 


GATATTTATG 


GGGGCGAAAA 


AATGGTAAAA 


1200 
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CAACTACTAT ATCTGGTGTT GCTAACTATG CTGTTTCTCA AGATGGAGAA AACGGCGCTG 1260 

AAATCCATCT NTTAGCAAAC GTAATGAAAC AAGCTAGGAT TCTATTTGAT GAATCTAAGG 1320 

CGATGATTAA AGCTAGCCCA NAGCTTAGAG AAAATTTTAG ACCTTTGAGA GATGAAATTC 1380 

ATTACGATGC ACTATATCTT AAAATATGCA CAGGCTTCAG ACAGTGATAA GTGGTTGGTT 1440 

AA 1442 

(2) INFORMATION FOR SEQ ID NO: 823: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823 : 



AAGCCGACCG 


ATGAAGGTTA 


TTTGAAATTA 


CATAAGACGT 


GGTTTTGTAA 


ATCAAAGCTA 


60 


TGTCCAGTTT 


GTAATTGGAG 


GCGTGCTATG 


AAAAATAGTT 


ATCAAGCTCA 


AAAAGTGATT 


120 


GAAGAAGTAA 


TTAAGGAAAA 


GCCAAAAGCA 


CGTTGGTTGT 


TTTTAACACT 


TCCAACAAAA 


180 


AATGCGATAG 


ATGGAGATAC 


TTTAGAACAA 


AGTTTGAAGC 


ATCTAACTAA 


AGCATTTGAT 


240 


AGGTTGAGTA 


GATATAAAAA 


GGTTAAACAA 


AATCTTGTTG 


GATTTATGCG 


TTCAACAGAA 


300 


GTTACCGCTA 


ATAAAAATGA 


CGGTAGTTAT 


NATCAGCATA 


TGCACGTNTT 


GGTATGTGTT 


360 


GAAAATGCAT 


ATTTTAGAAA AAAAGAGAAT 


TATTTAACTC 


AAAGAAGAAT 


GGGTTAATTT 


420 


ATGGCAAAAA 


GCATTACAAG 


TTGATTATCG 


ACCTGTNGGT 


ATTATTAANG 


CAATTACCCC 


480 


ACCAAAAAGG 


CGATAAAGAT 


ATTTGGATTC 


CGGCAACAAN 


GAGACATCAA 


ANTATCCGGT 


540 


CAGTCATCTT 


ATTT 










554 



(2) INFORMATION FOR SEQ ID NO: 824: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824: 

CTGTGCTTCA TCAAAAACAA TATAATCAAA TTCATTTTCA TCAAATTGCT TAAAATTATC 60 
ATCCCAAGAT AATGTTGGAA GGGTTCCAAA TAAAAATTTG CCACCGACAN CTCCATGTTT 120 
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TCCTTGCCAN 


CCAACCCAAA 


ATCACTATCA 


TTTTTAATNG 


GTAAATACTT 


TNTNGAAATT 


180 


CTTCCCTTAG 


CTCTAATTGN 


AAAATCCCCT 


CATTATGGGG 


CAATAAATAA 


AAATTTATTA 


240 


GGGTTTACCT 


TCTCTAACAA 


TCTAATGCAC 


ATAAAATCGT 


TTTACCCTGT 


ACCCAGTTGC 


300 


AAGATATTAT 


TAACGCCTTA 


TCTTTGGNTT 


TATCCCTAAT AGCTTTTAAT 


GACCTTAATG 


360 


CTTCTGCTTG 


CATTAAATTG 


GGGACAATTT 


CCACTGATTT 


TTTCACCTTA 


TCAGCTAGCA 


420 


GCATTTGAGT 


TTGTTCAACC 


TCCGCTAATT 


TTTCTAAGGA 


GCGGTACTCA 


NATGATTCTT 


480 


TATATGAATT 


AATCCATTGC 


TCAGTCAGTG 


GGGTACTCTT 


TTGCCATAAC 


AATTCAAATT 


540 


CATTTTTTAC 


ACTATCAACT 


AAATCACCAT 


TTTTCATAGT AGACAGTAAA 


ACATTATGCT 


600 


CAGAATTAAC 


CTTTAACGCA 


TTAGATGTTA 


AATTAGAGCT 


TCCTATTACC 


ATAGAACTAT 


660 


AATCCTTATG 


TTCAAAAATA 


TATCCTTTGG 


CATGGAATCC 


AGCAATATCA 


GTTAATCTTA 


720 


CCTCTACATT 


TTTTAATTTA 


AGTAATTCTC 


CATACATTTT 


AGGACTATTA 


AATCCTAAGT 


780 


AATTAGATGT 


TAATATTTTC 


CCTTTAACAN 


CCTTATTGCT 


TAATCTAATA 


GGTGAGCCTT 


840 


TAGCTGGCTA 


ATCCGCTTCT 


TGTATTAAAG 


CTACAGAAAA 


TAGACGTTCA 


CATTTNGAGG 


900 


TCACTATATG 


GTGGAAGG 










918 



(2) INFORMATION FOR SEQ ID NO: 825: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825: 



CCATTTAAAA 


GTATTGTAAA 


ATCATCCACN 


TTNTATAAAC 


CAACCACNTT 


AACNTTTTTG 


60 


ACATTTGTTA 


TCCGATGAGA 


TTAAAAGATA 


TCAATNAATA 


CAATTTTTAN 


AATTAATGTC 


120 


ACTATGTTTT 


CCGATAATAT 


NACCCAATCA 


TCGNAATGTT 


ACCCATTTAT 


AAAATGANAA 


180 


ATCNTTGACA 


TAGGTANAGG 


GAATGTATAT 


TGGTCNCGGA 


TCACTTAAAT 


TAAACCCANA 


240 


TCATGTCATC 


TGGTAATGTN 


TCAATGTTAA 


TTGCTCCTGA 


AGCGGCGTAN 


ACTTTAATCT 


300 


TCCATGTTAA 


ATGAGTAAAT 


TGATGCGTCA 


ACTCNAAAAT 


AGGTGTTTCT 


NCTGGNTGAA 


360 


TGTCATGACC 


GATTTTTTCA 


NTCATTTTAC 


GTCTANCATG 


CTCACTATCN 


AACATAGGAN 


420 


ATTGCCACAT 


ACCATACNAT 


AATTNTTCCC 


TACGCTTTTG 


CAACAGATAT 


TGACCTTGAT 


480 


TATTTCTAAT 


TAANAAGACG 


GATTGCTCAA 


TTACNTTTTT 


ACTTACATTT 


TTAGATTTAA 


540 


CAGGTAACTT 


TTCAAATGGA 


CCTTTATCAA 


ATGCCTCACA 


GTTTTCTTGN 


ACTGGACNAA 


600 


ATAAGCATAA 


TGGATTTTTT 


GGTGNACAAA 


TTAATGCCCC 


TAATTCCATC 


ATAGCTTGAT 


660 


TAAACGTTCC 


AGCTTCTGTA 


GTAACATACG 


GTAACAATTC 


TTGTTCGTAC 


GATTTCCTCG 


720 


TCGATTGTAA 


TTTAATATCT 


CGATAGTCAT 


CATTCAATCT 


AGACCATACG 


CGAAAAACAT 


780 


TTCCGTCTAC 


AGTTGCTAGT 


GGTACATTAT 


ATGCAATGCT 


CATTACTGCA 


GCTTGTGTGT 


840 


ATGGGCCAAC 


ACCTTTTAAC 


GCTTTAAATT 


GATCAGGATC 


TTTGGGAACT 


AAGCCTTCAT 


900 


ATTTATCANA 


AAC TTCTTTA 


ATCGCCGTAT 


GAAAATTTCG 


AGCTCTACTA 


TAATATCCTA 


960 



740 



WO 97/30070 



PCT/US97/02318 



AGCCTTCCCA 


ATACTTTAAC 


ACTTVAXVX 1 


1-1~IjAAvjV 1 1\3 


AL XV AAAAV, X 


XV VAVAu X Xjo 


xuzu 


GAAATCGGNC 


ACCAAAACGA 


i oA 1 nn 1 n\J 1 


V.nn X Aril— 1 \7 1 


X X X AAV X X vj X 


v> XV X v7 X IVj X A 


1 nan 


ACATGACCTC 


ACTTAACCAA 


x rp jk rp IS /T* 1\ /^/~* 
ATA1 Au 1 AL.O 


uAl Iviva XVV* X 


X X\7 XVIjV VA X 


uuLAl X XV IV 


1 1 JIA 


TTTGATTTTC 


ATCAAACCAG 


I vjl A IV AAA I 


1 X IV 111 AAA 


nt X AuAL X \iV 


X\aAX A VAX X X 


x/uu 


ATAAAACCCT 


TTCCTCACCA 


X * Ik rnrn j\ jk mqv-< 

AAAX 1 AA1 Av» 


TCTTTACTCA 




ATTGTACATT 


1 O t A 

lZbU 


AAAATCATGG 


TTAGTATGTA 


AGTTAATTTA 


X X AX NTLrVLi 


AAA ITvjVj A i X 


ATAATAGTAT 


1 1 O A 

1320 


ATATAATATT 


ATGAAATGAG 


TGAAL. 1V» A I A 


XVAjAL-At- 1 vjL. 


AAvAL. ATAT V 


GCAATTGGGG 


1 O O A 

1380 


TGGGCCTTAC 


AGCACTTGCA 


ACTCAAGATC 


CAGCAATGGC 


TTCTACGTTT 


GGTGCAACAG 


1440 


CTACAACCCT 


TATCGTTGGT 


TCATTAATTC 


CTGATGGGGA 


TANTGTNCTT 


AAATTANAGG 


1500 


ACANTGCAAC 


ATATATTTCG 


NATCATAGAG 


GNATNACGTC 


ATNCCATCCC 


CTCCCACAAN 


1560 


NNTATGNCCA 


GTCNCNTTTA 


CANTTTNTAT 


NTNTTCACGT 


CACTNTNGCT 


GGTANGCATC 


1620 


CCNCCTCACG 


TATGGCTTGT 


GG 








1642 



(2) INFORMATION FOR SEQ ID NO: 826: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 6: 



CTAATCGCTT 


ATGCAACAGC 


TCAAACAGAA 


TTATCGATGA 


GGCAATATTT 


CGAATCAACA 


60 


GCTTATCGTG 


TGTTCAGTGA 


TCAAGCAGGT 


ATTTTAGGTG 


AAGGTGTACA 


AGTAGCTAAA 


120 


GAAGTTATAG 


ATACAATTAT 


AGATACACAA 


TTTCATGGTG 


TCGTTTGGTC 


AGAGCGATTA 


180 


TGGACTAATA 


CCGAAGCAAT 


GAAGCAAGAA 


ATAGAAGAAA 


TAATTGCCAA 


TGTAGTTATT 


240 


AGAGGTCGAC 


ATCCTAACGA 


ATACGTTAAA 


GATATGCGTA 


AGCACCTAAA 


TAAATTCGAA 


300 


GGCACAGCAC 


GACAAAAGGC 


CGCAGCAATC 


AAATCATTGG 


TTTATACAGA 


ATCAGCACGT 


360 


GTTCACGCAC 


AATCAAGTAT 


TGACAGGATG 


AAAGAAATCT 


CTCCAGAAGG 


ATATTATATG 


420 


TATATTGCAA 


AAATCGATAA 


TAGAACAACT 


AAAGTATGTA 


AAGGGCTTAA 


TGGAGAAGTA 


480 


TTCAAAGTTT 


AAAGACGCTA 


AAATTGGTGT 


TAATTTCTAT 


CCTATGCATA 


TCAATTGTCG 


540 


CTCAGATTGC 


GCTTTATTAC 


CTAAATCCAT 


GTGGCCGAAA 


AAAACAAGAA 


CAATAANCCG 


600 


AAAAACAAAA 


TACTTTGGAG 


GGAAAGTGAA 


AAGCGGTGAT 


TGATTTGAAA 


GTGAAGGTTT 


660 


TTAGAGGCAA 


AAAAGGTAAG 


TTAGCTTTGT 


ATGACAGTGA 


ATTAAAAATT 


TGGAGGATAC 


720 


TAATATGAGC 


AATACTGACA 


AATACCTTAG 


AGACATAGCA 


AGAGAGTTAA 


AAGGTATACG 


780 


TAAAGAGTTA 


CAAAAGCGAA 


ACGAAACAGT 


TATTATTGAT 


GCAAACTTAG 


ACAGCGTAAG 


840 


GTCGGCAGTA 


TTANCCAATA 


AAGANAAACC 


GAAATATAAC 


GAACCACTCT 


TTTAATAGTC 


900 


GTAGCACTTA 


ATTGTGTTGG 


CTATTTTTTA 


TGTCCAACAC 


GTGCTGATGA 


CATAAAAAGC 


960 


A 












961 
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(2) INFORMATION FOR SEQ ID NO: 827: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827: 



ACGATATTGT 


TCTGTTTGTA 


TCGGTTCTAT 


CACGATTTCA 


AATTGTTTCA 


TTCATTTCAA 


60 


TCCTCTATGT 


TAATTGTGTA 


TCACAATCAT 


TTTTAATTTT 


ATTTAATAAC 


CATTGCTCAA 


120 


ATGCTGTAAA 


ATCTTCATGC 


TCCCTTACAT 


TAATGCTATA 


ACAAACATTC 


GACAATTGTT 


180 


GTAATACTTG 


CAACTCTTCT 


TCATCTCGAT 


AGACTACGAC 


TTTTTCAAAA 


TCAGCATTTT 


240 


TAAAGCCTTC 


AACTAATACG 


ATATTGGTGT 


CAATTGTAAC 


AGATTTCTCA 


ATAATTTGAG 


300 


TAAGATTGTT 


GGATTATCTA 


CACGTGTTAC 


AGTTTGCTGA 


TATTGAAAAC 


CTTGTACAAT 


360 


ACTTTGGTCT 


GCCCCCGCTT 


CAAAATGCTT 


CATGTGATCC 


GACGGTCTTG 


AATCCTGTAA 


420 


TTGAATATCT 


TCCTTAGCAT 


GCCCATGATG 


TTTAATAGTA 


GCAACTGTAT 


TACCATGTTG 


480 


TCTTTAAGAA 


AGAGAAATAT 


GGCCTCATCA 


ATGTNGNCTT 


ACCAGACTTT 


GTGGTNACCT 


540 


ACAATTGGGA 


AATCATAAAG 


AAGTCCCCTT 


CAAGAGGGTC 


AGTCAAGTCA 


ATTTATTCTC 


600 


T 












601 



(2) INFORMATION FOR SEQ ID NO: 828: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 8: 



TTGAACTCGT CAAGCGTTTG AATCAATTTA NTTGAANTTT NACAAGAACC CAATACTTTG 60 

TCCCGANCCG ATTGGATTAA CTATTGTTGC CCCGTGACCC ACCACCTAGC ATTGATAGAT 120 

CCCNACATTA ACNTTAAAAA TATTGAATCA CNTCAGTGAA AATTGCCCAC CGATAACTAA 180 

CAATTGCNTG TATCNTTGGG CACAATATAC CAAACCAATA TTCAATAAGT AAGTAGGGAA 240 

AGTTATGACA GTATTTGTAA TGCAATTACA GAGTAACTTG AATAGTATTG AAGAATTAAT 300 

ATCACAAAGT CGTTGGTCAT ATAGGNAAAA NCCGCGTACA GTCAACTATN GATACAATCA 360 

AGATAAACTC ATGCACAGNT TGGGAGATAT TTTNGCGCAA TATGGAATNC AACATGACAC 420 
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AGGTTTATNA 


CCACATGAAT 








LnuninJ J. Vj\_ 


%o\J 


NCAACACAAT 


CGTGATGGAC 


Aij*~v_UA L\- 1 A 


If V^OV^ A ^^J' *J 1 M 11% 

lo luAuC X 1 A 


IvJAIAXAIjI 1 






CGTGTGTGTT GTCGATAAAG 


ANCCAGGTGG 


TATAGATATC 


GAAAAGATAT 


CACANCGTTT 


600 


AGNCTGGCGN 


AATCGTTAGT 


GACGTGTCTC 


TCTACAANCG 


AAGCACATCA 


NATATGTAGG 


660 


CTTACGTGAT 


TTNTCTCAAN 


TCTGGACACA 


NAAAGACGTT 


TTACAACCTT 


GATTGGGCGA 


720 


ANGGTTTACT 


TGGCAAGCGA 


ATCATACTTC 


TTTTAATAAG 


GTGCTTGNTA 


GAAAAGGTGA 


780 


CCTNGATGAC 


ATTACATTAC 


AGCTTTATAC 


AGAAGCTGAT 


TATGNATCAN 


TTTG 


. 834 



(2) INFORMATION FOR SEQ ID NO: 829: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1825 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829: 



TTCCATATCT 


TGTATTTCTT 


ACCATTACGT 


TGATGCTTCA 


TAATTTTCGT 


ATNACAAATN 


60 


CCGCTTCATC 


ACGTTCTGTC 


ATTGCTTCAT 


AGTAATGAAT 


TTTCTCACCA 


TTCGTATTTG 


120 


CAGTCCACAG 


TCCTTTTGGC 


TTACGTTCAG 


AATTATTTTT 


AATCACTTCG 


TTACCCNCAN 


180 


ANAAAATCGT 


TCTCGTCGAN 


CGATAATGTN 


GCTCTAAAAA 


GATTGTATTC 


GCTTCTGGGT 


240 


AGTCTTTTTC 


AAATGATAAG 


ATATTTTGAA 


TATCAGCACC 


ACGCCAACCA 


T AAATTG AC T 


300 


GATCTGAGTC 


ACCTACAACA 


CATAAGTTTT 


T AAACTT AC T 


TGCTAATAAT 


TTAACTAATG 


360 


TATATTGTGC 


TTTATTAGTA 


TCTTGATACT 


CATCTACATG 


AATATATTGG 


AATNTGTTCT 


420 


GATAATATTC 


TAGAACTTCT 


GGTACACGCT 


CAAATAAGTT 


AATCGTTGTC 


ATAATAAGGT 


480 


CATCANAATC 


TAACGCTTCA 


TTACGCGACA 


ATTGGCGTTG 


ATATCCACTA 


TAAACCGTTG 


540 


CTACCATTTG 


CGAGTGATAA 


TCTGTGGCTT 


CTTTTTGAGC 


ATCTGCAGGT 


GTTTTAAGTT 


600 


CANTTTTCAA 


ATTACTGATC 


GCACCGATAA 


ACATACGAGG 


NTCANACTTT 


TTACTATCNA 


660 


TATTTTCATT 


TTTTAAGACG 


TCTTTAATAA 


CAGATTTTTG 


GTCTGTAGGA 


TCAATTATCG 


720 


TAAAATTGCG 


TTCTATACCT 


ATTCGATCTG 


CATCACGACG 


TNAAAATACG 


AACACACATT 


780 


GAGTGGAATG 


TTGACATCCG 


CATAACTTCT 


GCTTGATCAC 


CTACTAATTT 


TTGNACACGT 


840 


TCTTTCATTT 


CTCTTGCAGC 


TTTATTTGTA 


AAAGTAATAG 


CCAAAACATT 


GTATGGTGAG 


900 


ACATCTTTTT 


CGTCTAATAA 


ATAAGCAATT 


CTATGTGTTA 


AAACACGTGT 


CTTCCCTGAA 


960 


CCAGCACCTG 


CCATAANTAN 


CAATGGTCCT 


TCTGTTGTCT 


TTACAGCTTC 


ACNTTGCTCT 


1020 


GTATTCATAT 


GATTTAATAA 


CGCATTCATT 


TACTAGACTC 


CTTTATTTTT 


NCTGNNTNTA 


1080 


NAGCTTTNTT 


AATATCTTAA 


TAAATAATAT 


CNCCGCCGAT 


AATTGTATCT 


CCAATAGCTG 


1140 


CCATCTCTGT 


CGCTTGTTGT 


TCTGAGGAAA 


TACCGCCACC 


ATAAAAAAGT 


TGCGTTTCTG 


1200 


TTAGATGGTC 


TGAGACAGCT 


TGAACCTTTG 


AGACGTCGCC 


ATAAATGCCA 


CTATACTCTA 


1260 


TATACATAAC 


CGGTAATCGA 


CACATATGAT 


TGACCAGTTG 


GGCATATGCT 


TCTAAATCTN 


1320 


CTGTTGTTAA 


ATCTGTATTT 


GCTTTGGTAT 


GTNTNGNCAC 


TGNGCTATCA 


GCATTGCACA 


1380 
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CGACATACCC NACANATATT ACTTCTACAA AATCTATACT ATGTCCATAN GTTTTAAGCG 1440 

CTTCTAATAA AGTCTCGTNC CANTGTGAAA TGCAACATCT GTACTGCACA GNACTGNAGG 1500 

TACAAAATAA AANTCAANAC CAGGCATTAC ACTNGCGATA TGTGATATNA CAAGTACTAA 1560 

AGGCAATGGG TATCTTCTTA TTCTGCTCAT TAAATGAATG ACATTATCTT CAGTAACGNC 1620 

ATCAGNTCCA CCANTCATAA TTGNATCTGT GTGGAGACAT ACAAACTCGC ATCTAAATCA 1680 

ACATCTTGAA ATATGTNTAG CTGGGGTCTT AATTTAAAAA TATGGCGCCA TTTTTTAATG 1740 

TCATACATTN NCCGAAACTC CTTTTNTAAC ATACATATGA TTATAGCATT TNNAAGTGTC 1800 

GAGTTCTAAA GACTTAATAT GAATT 1825 

(2) INFORMATION FOR SEQ ID NO: 830: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 410 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830: 



GAATTCGCAT 


CATTTTTATA 


TTGTCTTACG 


TATAATTTGT 


TTTTAATTTT 


AACCAAAGAT 


60 


AGAAAGAGGG 


TTGTTTATGA 


AAATAGCAAT 


TGTAGGATCA 


GGAAATGGCG 


CAGTTACGGC 


120 


AGCAGTAGAT 


ATGGTGAGCA 


AAGGCCACGA 


TGTTAAATTA 


TATTGTCGTA 


ATCAATCTAT 


180 


AAGTAAGTTT 


CAAAACGCAA 


TCGAAAAGGG 


CGGATTTGTA 


GCGCGTAAAA 


TGACATTACT 


240 


TAAAGATTGC 


GGAATATCCT 


TACGTACATC 


TGTTGTCACA 


TTTGGCACAG 


AATCCTGAAT 


300 


ATGTTTAATC 


GCAATGCTAA 


CTGCAGNTTC 


TCCATTAAAG 


GGTGGTTCAC 


CAACAAGCAT 


360 


TTCATATAAC 


ACAAATACCN 


ATAGAATAAA 


TATCTTGTAC 


ATTCAACCCG 




410 



(2) INFORMATION FOR SEQ ID NO: 831: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831: 

GCATCATTTT TATATTGTCT TACGTATAAT TTGTTTTTAA TTTTAACCAA AGATAGAAAG 60 
AGGGTTGTTT ATGAAAATAG CAATTGTAGG ATCAGGAAAT GGCGCAGTTA CGGCAGCAGT 120 
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AGATATGGTG AGCAAAGGCC ACGATGTTAA ATTATATTGT CGTAATCAAT CTATAAG r CAA 180 

GTTTCAAAAC GCAATCGAAA AGGGCGGATT TGATTTTAAT AATGAAGGTG ATGAACGTTT 240 

CGTAAAATTC ACTGATATTA GTGATGATAT GGAATATGTT TTAAAAGATG CTGAAATTGN 300 

TCAAGTGATT ATTCCATCTT CATACATAGA GTATTATGCT GATGGTAATG GCAGAGCATG 360 

TAACTGATAA TCAGTTGATA TTCCTTCAAC ATGGNTGCAG CAATGGGGTC AATTCCGNTT 420 

TATGAATTGT TTTAGAAGAT AGACATATTG GAACAACACC ACAACTAGCG GGAAGCTAAT 480 

ACGGTGACGT ATGGTACGCT CGTGCCGATT GTGAAATGCA GCAAGGTGAT TATCTCTTAA 540 

ATTGTACGCC G 551 

(2) INFORMATION FOR SEQ ID NO; 832: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832: 



AAAANCCGCC 


TCCTTCTCCT 


TCCTTCTCGG 


CTTCATGTTG 


TTTGCCAAGT 


GCACCCACCC 


60 


TGTCCNCTTG 


AGCACACGAA 


AGAGGGNGCC 


AATAATAGGA 


TANCTAACNC 


TGTTCTAGCA 


120 


CATTGCCCCT 


TCTTTTTAGG 


TANAAATGCA 


CAAGCTTTAA 


ATCCTTTTTT 


ACGTATAGGT 


180 


GTTTCAATAA 


AGCGATACGA 


AATTTCAGCC 


ATTAACGCTG 


TTAACACAAT 


TTCTATAATA 


240 


TAAACGTATA 


CCGGTATTTG 


TCCTTGTNCG 


TAATAACTGT 


TCACAAAAAC 


AATGATAGGA 


300 


TAATGCCATA 


AATATAAGCT 


ATATGATCGT 


TTACCTATAA 


TTAGTAAAGG 


GTTCATACTT 


360 


AAAAATTTAG 


CANATAAACT 


AGAAGGATGT 


NCCGCAATTG 


CANTAATGAA 


TAAAGTTGCA 


420 


AATGATATAA 


TNTAAAATCC 


TCCATTATAG 


ATCCATTGAT 


CTTGGNCGNC 


AACTATAAAC 


480 


GNNCAAAGGT 


CANNAGANCC 


CGNAAAACCC 


AGATATCCCC 


NNNCCCCCCT 


CTNATCNATA 


540 


CCGCCAANCC 


CCCNC 










555 



(2) INFORMATION FOR SEQ ID NO: 833: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833: 
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GGCACGAGAC 


TTTGAAAGCT 


ACTGCAACAA 


TTATTCCAAG 


CTCAATTAAC 


ACTAAAAACG 


60 


AAGCACGTGA 


TAACCACTTA 


AAATCTGGTG 


ATTTCTNTGG 


GACGAGACTG 


ATGAATGTGA 


120 


TAAAATCACG 


TGTGAAACAA 


AATCAGTTAC 


TGAAAACAAA 


GNTGTTGGTG 


ATTTAACAAT 


180 


TAAAGGTATC 


ACTAACGAAG 


AAACATTCGA 


CGNTGGAATT 


CAACGGAGTA 


AGTAAGAATC 


240 


CTATGGATGG 


NTCTCAAGTA 


ACAGGTGTTA 


TTGTTACTGG 


NACANTCAAT 


AGAGAAAACT 


300 


ACGGCATTAN 


CTTTACCCAA 


GCACTTGAAA 


CTGGTGGCGT 


NATGCTAGGC 


AAAGATGGTT 


360 


AANTTCCGAN 


GCATCAGCTG 


AATTCTCAAT 


CTCAGAATAA 


TCTCATTACA 


CAATCCTCAT 


420 


TGTTTTAATA 


TATATATACG 


AAATGCCAAC 


TATATCATCC 


CTAGGTATAG 


TTGGCATTTT 


480 


TCGTTTAACT 


CATGTGTAAC 


AAGCACTAAC 


TGTTCACCGT 


AATTTAAGAC 


GCCTTTCATT 


540 


CAGGTAGGTC 


TTATTGAAAC 


TGAAAACTGT 


GATGACCTTG 


TGCAAAGCCA 


TTAACTGTAT 


600 


AAAGCATAGA 


TCCTCCGCCC 


ATTTCTATAT 


CATTGGAACA 


AATGATGAGT 


TGATGTGTTC 


660 


CAGGTATAAA 


TTGCGGATGA 


GTAGAACGTA 


ACATATGCCC 


TTCATCTCGG 


NCTGGTATCA 


720 


ATATNTGTCC 


TATTGGATAA 


CCCCTCTTAT 


TAACAAACTA 


ACACTCGACC 


CTGACCANAC 


780 


ATTGGTACGT 


ATAAATTATC 


GNCACTATCA 


ATACAACATG 


AGTCTGGTCC 


TTCATGAACC 


840 


TGTAAAATAG 


NACGGNATAG 


TAGCTCCAAA 


TGGGTGTGTC 


GCACACCATC 


ATCTTCAAGT 


900 


GCAATGCGAN 


GTNATCGACN 


GG 








922 



(2) INFORMATION FOR SEQ ID NO: 834 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834: 



AATACCCATC 


GTGATTACCT 


ATCATTAAAA 


CCNTCCTNGG 


ATANACNTTG 


ATATGCTACT 


60 


TGCTTCTTTC 


CCACCCATTG 


CTTAATTCAT 


TAGAAAACCC 


CTTTGCTATC 


GCTGTGTTGA 


120 


TTTTCTTATT 


NTTAATTCTT 


GTATCTATAA 


TGACGAGTTC 


AATAACATCC 


TGTGCTTTGT 


180 


TTTTCAATAT 


ATNTAAAATT 


GCTGAACCAG 


CCTGTTGACC 


TAACATTCGA 


GGCTTGATGT 


240 


CAATACAGGN 


TTGAGGTGGT 


GACGCAATTT 


CGGNTAAATA 


AGAATCATTG 


AACGTTGCTG 


300 


TCATTACATC 


TTTCGGAATT 


TCAATATTAA 


GTTCATATAG 


GACACTTAAA 


ATCGCTAAAT 


360 


GTAACATAGC 


ATCTAACGAA 


ATGATTGCCT 


GTTTAATATT 


TGGGTCCTTC 


AAACGCGTAT 


420 


GTAGATTTTG 


CATGTAATTT 


AAAATAACTT 


CTCTTTCATT 


ACTAGTCTCA 


ATAATTTGAT 


480 


AATCCAGATT 


AAATGGGGAG 


GCAACCGTTT 


CAAATCCTGG 


AATTCTATCT 


TTTGAAACTT 


540 


CAAAATTTCC 


TTTTTCTGTA 


ATAAATATTA 


ATTCATCTAC 


ACCTTGTTCA 


ATAACATGTC 


600 


GTGTCAAATT 


TNCAGAAGCT 


AATATATTAT 


CATTATCTAT 


GTGTGTAAAT 


TGATGATCTA 


660 


TATCCGATGT 


AGGCTTACCA 


ATCACAATAA 


ATGGCATGCT 


TTCATCAATT 


AACATTTGGT 


720 


TAATTGGATN 


CANTTTCTTT 


TGAATAAAGC 


AGGATAAACG 


CATCAACCAT 


NCGTTGTTTA 


780 
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ATCAATTTTA TAANCTTCCA ACCCAT .806 
(2) INFORMATION FOR SEQ ID NO: 83 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 835 : 



GAATTCTGGA 


TCAACCANAT 


TCCATTCATC 


TTGNATTGCA 


TCAACAAATT 


CCCATGTTGA 


60 


TTNTAANTCT 


CCCCNGTGCG 


TGAAGTTAGT 


GGNATCANCN 


NCACGACATT 


CGANTNNNNG 


120 


ATTTTCATNT 


GCATCTNCAG 


NTATTCANTC 


ATATCTTGAG 


CGGCTCANTG 


ACTTTGACAA 


180 


TCGATCAGGT 


NCAGCCATCG 


NNNGCCNAGC 


GTATTNNTCT 


TNAGTAANTA 


GATGTAAAGA 


240 


TACACCATCA 


TNAGGTNGGN 


TGATTGTTTG 


ACTAANTAGG 


TGTGGTNTGC 


TNTTTGTTTN 


300 


ATCAGNTANC 


ATTGATATTC 


NTTCATTGNT 


AGCTCCTTCT 


ATTTCAACCC 


ATAATTAGTT 


360 


TNGGGTNCTG 


CGNCTTGTCN 


NTCCNTTTNT 


CAGTCTGGAC 


TGTTGANTNN 


NTTCAATNNA 


420 


CCCTCCTTAA 


CGTNACAATC 


CGTTNNCTTA 


ACCCGCNATA 


AA 




462 



(2) INFORMATION FOR SEQ ID NO: 836 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836 : 



GAACAANTGG 


TAAAACCACG 


ACTTCNAACT 


TAATTGGACN 


TACTTTAAAA 


GCAAATAATA 


60 


TNCNAATTAT 


ACNCAATAAT 


GAAGGTGCTA 


ATATGGCTGC 


AGGTATAACT 


TCTGCATNCA 


120 


TCATGCAATC 


AACACCTAAG 


ACTAAAATTG 


CGGTAATCGA 


AATTGATGAA 


GGTTCGANTC 


180 


CACGTGTGTT 


AANAGAAGTT 


ACACCTNCAN 


TGATGGTATT 


TACTAATTTC 


TTNAGAGATC 


240 


AAATGGATCG 


CTTCGGTGAA 


ATTGATATTA 


TGGTTAATNA 


CATNGNAGAG 


ACAATTAGTA 


300 


ATANAGGCAT 


CAANTTATTG 


CTAAATGCTG 


ATGATCCATG 


TGNGAGTCGG 


GTGAAAATCG 


360 


CAAGTGANAC 


GATTGTGTAC 


TATGGTATGA 


AAGCACATGC 


CCATGAATTT 


GNACAAATGT 


420 


ACGATGAATG 


AAAGTAGATA 


TTGNCCAANC 


TGTGGTCGCT 


TATTGCAATA 


CGATTATATT 


480 
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CATTATAATC 


AAATNGGTCA 


TTATCACTGT 


CAGNGTGGTT 


TCANACGAGA GCAAGCAA&A 


540 


TATGAAATAT 


CANGTTTTGA 


TGTGGCACCG 


TTTCTATATC 


TANATATCAA 


TGATGAAAAN 


600 


TNTGATATGA 


AAATTGCAGG 


TGACTTGAAC 


GCTTATAACG 


CGTTAGNAGC 


ATATACTGTT 


660 


TNAAGAGAGC 


TAGGGTTAAA 


TGAACAANCA 


AATTAANAAT 


GGCTTTGAAT 


ACGTATACAT 


720 


CAGACAATGG 


TCGCTATGCA 


GTACTTTTAA 


NANAGAACGA 


AAAGAAGCGA 


TGACTCAANT 


780 


TTAGCTAAAA 


ATCCTGCAGG 


AATGAATGCA 


NAGTCTATCA 


AGTGGGTGAA 


CAATTAGAAG 


840 


GCGAAAAAGT 


GTATGTTATT 


TCGCTAAATG 


ATAACGGTGC 


AGATGGTCGA 


GATACTTCAT 


900 


GGATTTATGA 


TGCAGATTTT 


GGAAAAATTA 


TCTAAGCAAC 


AAATTGAAGC 


TATCATCGTG 


960 


ACAGGTACAC 


GAGCAGAAGA 


ACTTCAATTG 


CGATTGAAGT 


TAGCAGAGGT 


TGAAGTACCA 


1020 


ATTATTGTTG 


AGCGTGATAT 


TTATAAAGCA 


ACGGCAAAGA 


CTATGGATTA 


TAAGGTTTCA 


1080 


CAGTTGCAAT 


ACCAAACTAT 


ACATCAATTA 


GCGCCTATGC 


TTGACAATTA 


AACCGTCCGT 


1140 


TTGAAGGAGG 


GCAATCATAA 


TATGCATGAA 


TTGACTATTT 


ATCATTTNAT 


GTCAGATAAA 


1200 


TTGAATTTAT 


NCAGTGATAT 


AGGAAATATT 


ATTGCTTTAA 


GACAAC 




1246 



(2) INFORMATION FOR SEQ ID NO: 837: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837: 



AAAGATAAAA 


GGAGGAAATA 


GACATGACAG 


ATGGTCCAAT 


TAAAGTAAAT 


AGCGAAATTG 


60 


GAGCTTTAAA 


AACTGTGTTA 


CTTAAGCGTC 


CTGGAAAAGA 


ATTAGAAAAT 


TTAGTACCTG 


120 


ATTATTTAGA 


TGGATTACTA 


TTTGATGATA 


TTCCATATTT 


AGAAGTAGCT 


CAAAAAGAGC 


180 


ATGACCATTT 


TGCGCAGGTG 


CTAAGAGAAG 


AGGGTGTTGA 


AGTACTTTAC 


CTTGAGAAGT 


240 


TAGCAGCTGA 


AAGTATTGAA 


AATCCTCAAG 


TAAGAAGTGA 


ATTTATTGAT 


GATGTATTAG 


300 


CAGAGTCTAA 


AAAAACAATA 


TTAGGTCATG 


AAGAAGAAAT 


TAAGACATTA 


TTTGCGACAC 


360 


TTTCTTAATC 


AAAGAACTTG 


TAGGATAAAA 


TAATGTCAGG 


GGGTACCGTA 


AGGAAGAAAT 


420 


TAATCCCGAA 


ATGTACACAT 


CTNGTGGGAG 


TACAATGATG 


GATAAGTTTC 


CATTCTTATC 


480 


TAGGATCCCA 


ATGNCAAAAC 


CNTTTATTTT 


ACTTAGAGGA 


ATCCACAAAG 


CCTCAATTAG 


540 


GNNACCGGAT 


TTACCAATCA 


ATCCNGNTTG 


TTCTTGGAGA 


AGCACG 




586 



(2) INFORMATION FOR SEQ ID NO: 838: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

748 



WO 97/30070 



PCT/US97/02318 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838: 



CAAGAAGTCC 


ATTTTCAAGT 


CCNNACCATT 


AAGCGCAAAG 


ACNAGTTCNA 


AGATAGTTGC 


60 


CCAATCCCAA 


TNATNGANNA 


AGTCAAGAAA 


CCCNACAATA 


ATTTATTANC 


CAAGNAGGGT 


120 


AAAAGNAAAA 


GATTACCACC 


NNNAAAATAC 


GGTTGAAGAA 


GCTTTTTGAA 


GTCGTAATTG 


180 


GAAGCAATTA 


NAGGGTGACC 


ACAAAAGCAT 


TTGTAAGTGA 


AGTAGCAGAT 


GAACTTTATC 


240 


ATTTATTTGT 


CTTGATGCAT 


GCGCTAGNCG 


TCGATTTTTC 


AAGAAATTGA 


GGCGGAATTA 


300 


GCGCGTAGAC 


ATCATAAGCG 


CAATAACTTT 


AAAGGTGNAC 


GACACAATAT 


CGANCAGTGG 


360 


TAAANCAAGT 


ATGGACTAAG 


ATATAAGGAA 


AAGGATCATG 


GCTTATACAC 


TTACANATAT 


420 


TGTGGNTGAC 


GTGACATTTT 


CAAGNTTAAA 


ATACGACACC 


AACATATTTT 


AACTATGNAT 


480 


GCTGTGATGG 


TACTAAAGTT 


GCGAACTCGT 


TATAGATAAG 


TAGTGGATAA 


TCACAATACG 


540 


AACCCACAAA 


TAATTATGAA 


AAGTAAATTG 


AGCAACTCAG 


GAATAGATGT 


CACTGTTAAA 


600 


GATGTCGAAA 


AGTATATGAA 


TCGATATAAT 


GAAGTTATGA 


TGGGAAAAAN 


TGGCGAAAAA 


660 


GCTAAAGAGT 


TATGTTTGTC 


GTTACTACCN 


TATTAATNTC 


ATAGCTGTCT 


TTACATTCTT 


720 


TGNATGTATA 


CTATAANTAC 


AAATATATCT 


AGCCTGAAAT 


AGAAATGTCA 


NAGCCTATTT 


780 


AAAAGACAAT 


CTCCATCAGG 


AACTAGATCG 


TGCATCCCGA 


ANGTNAGACT 


ANANAACTNA 


840 


CTNNATGGGG 


NGGTNTTCNT 


ATGCTTAATC 


ANCATNAATT 


CGAGGATTAC 


CGNTNANAGG 


900 


T 












901 



(2) INFORMATION FOR SEQ ID NO: 839: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 9: 



CTAGCATATA 


TGCAACATGA 


TGATCATGAT 


CCACATGCAT 


TTCAAGAGCT 


ATTACCTGTA 


60 


TTGANAGAAC 


ACCAAGCAAG 


AGTGATGANT 


CACAGAATCC 


CNGGGCGACA 


TAATGATGAT 


120 


TCACCTACCA 


TTGNAAGTNG 


GTTTGTTAAT 


TNCNACAACA 


TTATTTAGGA 


AGACCAAGAT 


180 


TGGGAGGAGG 


TCCAACATNC 


TGNAAANCCN 


AAACAATTTA 


AAGTGACTTG 


GAATTAATAT 


240 


TTCACCANCC 


TACATTTATG 


TATGGCCCTA 


AAGTATCATT 


TTCCANATGG 


TGAAGTGACT 


300 


TTTATAAATC 


CTTTGATGCC 


CTCTGGATTA 


CCGATTCATG 


AGTGGTTGAT 


GTTAAAGCAA 


360 


TTTTCAAAAT 


ATAAAAGTGC 


GCCAAGTTTA 


CCAATTTTGC 


GTAGAGGACA 


GCATTATAAA 


420 


TTGCATTTTG 


ATTTTGATGC 


AACGCCTGCT 


GGTTCAGTGT 


ATTTTATCAT 


CATTTTTTAT 


480 
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AATAAAAATG 


GCACAAACTA AGTACTGAGA 


TTGTAAAGTC 


AAATAGTATT 


ACGATTCAAT 


540 


ATCCTGATGA 


AGCCTATGCT 


TATAAAATAA 


AAATGATGAA 


TGCAGCATCG 


ACGAGTCTAA 


600 


TATTTAGATG 


TTTAACAATT 


ACAGAGATGA 


CGCATCAAGA 


TGATTTAGAG 


TATAAAAGCA 


660 


TGCGGGTGAC 


GAAGATAGAC 


GATAACCAAT 


ATGGAAATGA 


TAGGATCAAT 


GTCATTATTG 


720 


CAGAACCATC 


TGACACCTAT 


CCAACTATTT 


CGAATGATTT 


TTTGAAACTG 


TTTGGTCATG 


780 


TTTGGCTCGT 


TGAACGTTGG 


ATGGACGACG 


ATATCAAGGA 


AAATATTAAG 


CAACTAAAGG 


840 


ATGATTTGCA AAGTCAAGAT 


ACCCTAACGG 


CTATTAATTT 


GATTAGTTAT 


GGGTCAAAGT 


900 


CAAATGTTTC 


TGCAACTTAT 


GTCGCTCAGC 


ATTTAGATTG 


CAAAGTATAT 


CGTACAAGCC 


960 


ATGAAGATGA 


TGATTTAAAA 


GAATGGCTTA 


CAGAACATGT 


CCCCAGGAAA 


TAATGAACTA 


1020 


AAAGATCAAA 


TTGTCGAGTT 


TCTATTTTAA 


AGAAGAGCAA 


GACAACCATC 


TAAATTATAT 


1080 


GTCACGTTTA 


ATGAATCCAG 


TCTCGTGCC 
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(2) INFORMATION FOR SEQ ID NO: 840: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 840: 



CTGAAGCAAA 


ATCAGCATGA 


ATATGTGTTT 


CTGTAGCTTG 


TGTAATTGTT 


AAACCTTCAG 


60 


AATCTGCAAC 


TTCTATATAT 


TTCGATAAAT 


CACGAACAGG 


GTCTATTATT 


ATTGCCTCTC 


120 


CTGTACGTTG 


ACAACCCACT 


AAATATGATG 


CTTGAGATAA 


ATGATTATCG 


TAAAACTGTT 


180 


TAAAAAACAT 


GATAACAACT 


CCTTTTATTA 


ATAGATTTAA 


TTAGATAAAT 


AAATTATGAT 


240 


TCGCATTTTC 


AGTAGCACCA 


ATATAAGTGC 


CTACACCACC 


GTACTCAACT 


TCATCTCTAA 


300 


GTTCTTCTCT 


CTGAATACCC 


ATGACATCCA 


TACTCATCGN 


ACAAGCGATT 


AATTTGATAT 


360 


GTGGCTCAAT 


AGCTTGATTG 


ATAAGTGTTG 


GTAATGAATC 


AACATTTTTC 


TTTTTCATTA 


420 


CGTAGCGCAT 


CATCATATTN 


CCTAAACCAA 


ACATATTCAT 


TNGGGGAAAG 


AGGCATTCGT 


480 


ATTTTCTTTN 


GGGGGGNAAC 


ATTAAATCAA 


ACATTTGNGG 


NAATACCTTG 


CTTTTNTACA 


540 


ATAACTGGNT 


GGACTTTTTT 


TAATGCATTA 


AGCCCCCAAA 


AGGTAAGGAA 


GATAGTTACA 


600 


TCTTTTCCAG 


GAGGTCTTAG 


CGCCATTGGG 


AATACCAATG 


CTGTACAGGT 


TTGGCTTTAT 


660 


TCTCCACTAA 


TTAATACAAA 


TGGTCGGTCC 


C 
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(2) INFORMATION FOR SEQ ID NO: 841: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841: 



GAAGATACGT 


TGTCTAATGT 


TCTTGCACTC 


ANAATACGAG 


AAGAAAGAAA 


TTATTATCAT 


60 


T AATGATGG A 


AGTTCAGATA 


ATACAGCAGA 


NCTCATCTAT 


AAAATCANAG 


AAAATNATGA 


120 


CTTTATTTTC 


GTCGATTNAC 


AAGAAAACAG 


AGTGTAAAGC 


CAACGCACTC 


AATCAAGGCA 


180 


TTAAACAGGC 


TTCATATGAT 


AATGTAATGT 


GCTTGGATGC 


AGATACTATC 


GTTGATGCAA 


240 


GATGCACCAT 


ATTATATGAT 


TGAGAATTTC 


AAACATGAGC 


CCAAAACTTG 


GTGCAGTTAC 


300 


AGGTAATCCT 


AGAATTCGAA 


ATAAGAGTTC 


TATTTTAGGT 


AAAATTCAAA 


CGATAGAATA 


360 


TGCAAGTTTA 


ATTGGCTGTA 


TTAAGCGAAG 


TCAGACACTT 


GCTGGCGCAG 


TCAATACTAT 


420 


TTCGGGTGTC 


TTCACTCTAT 


TTAAAAAAAG 


TGCAGTTGTC 


GACGTTGGCT 


ACTGGGATAC 


480 


TGATATGATT 


ACCGAAGATA 


TTGCAGTTTC 


TTGGAAATTG 


CATTTACGTG 


GATATCGTAT 


540 


TAAGTATGAA 


CCGCTTGCCA 


TGTGTTGGAT 


GTTGGNTCCA 


GAAACATTGG 


GAGGTCTTTG 


600 


GAAGCAACGC 


GTGAGATGGG 


CTCAAAGGGG 


GGACACGAAG 


TATTACTACG 


AGACTTTTNT 


660 


TAGCACAANG 


ACGAAAACGA 


AAGGGTTTCC 


TTTATATATT 


GTGATTGTGT 


GAGCAAATCA 


720 


TCTCAATTTT 


ATGGGTATTA 


TATTAGGTGC 


TTCTATATTT 


AGGCTATTTG 


GCTCCAATAC 


780 


AAGAAACTTC 


CTTAGACTTA 


TACAATTATG 


ACAAT 
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(2) INFORMATION FOR SEQ ID NO: 842: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 880 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842: 



CCAAATCCTA 


AAATCGATGG 


AGAANCGGCT 


GAGAAAGATT 


GGGATTTCTC 


CAAGTTGAAG 


60 


AGGTTGAAGG 


TGAGNCACTT 


TTCCCTTTAA 


GTTGATAATA 


AAAAGTCAGT 


ACCGTAAGTA 


120 


ATATCTTTGA 


TTCAGCTAAC 


ATGACAAATC 


ATGATGGAGA 


CGGTGAAAAA 


GGCGAAGAGN 


180 


CTTTCTTAAA 


GAAAATTTTA 


GGCGAAGAAT 


ATACTGGAAA 


CGTGNCAGAG 


GGTAACGGAA 


240 


GAAACTTTGT 


AACACAACCG 


GCTTCATCGG 


ANACTGCGGT 


AAAGTCGGTT 


AATATACCAG 


300 


ATAGCATTAA 


AACACTTAAA 


GTTGGCGACA 


CAATACGATT 


TAAATGTTGT 


AGTAGAGCCA 


360 


TCTAATCANA 


GTAAGTTATT 


GNAATACACA 


ACAAGATCAN 


ACGAATATTG 


TATCAATCAA 


420 


TAGTGATGGT 


CAAGTTACTG 


CGGAAGCACA 


AGGCATTGCT 


ACGGTTAAAG 


CAACAGTTGG 


480 


TAATATGAGT 


GACACTATAA 


CAATAAATGT 


AGAAGCATAA 


GAGGGGGCAA 


CCCCTCTATT 


540 


TTATTTGTAA 


ATAAGGAGAG 


TATTATAAAA TGGCANAATT 


AAAACGTAAC 


ATTATTCAAT 


600 
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TAGTAGAAGA CCCGAAAGCA AATGAAACTA AATCACATAC GTACTTAACA CCACACTTCA 660 

TTTCATTTGA AATTGTATAC GAGGCAATGG ATTTAATCGA TGATATTGAG GACGNAAATA 720 

GCACGATGAA ACCAAGAGAA ATCGCTGACA GATTGATGGA TATGGTTGTA AAAATTTACG 780 

ATAACCAATT CACAGTTAAA GACCTAAAAG AACGTATGCA TGCACCTGAT GGAATGAATG 840 

CACTTCGTGA ACAAGTGATT TTCATTACTC AAGGTCAGCG 880 



(2) INFORMATION FOR SEQ ID NO: 84 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843: 



GGCACGAGCT 


AACGTTAAAT 


GAATTTATGA 


TTGGCGAAGG 


ACATTATGAT 


GTCAGACTTA 


60 


AATTACATCA 


CGAAACAAGA 


AGCACACAAT 


GTATGTACCT TTAAGTGTCA ATGCGAATAA 


120 


ACAATATCGT 


TTTAACATTA 


TGTTAGAAGA 


TATTAAAGCG 


TATTTACCTA 


AAGAAAAAAT 


180 


NTGGGATGTT 


TTCTTAGAAG 


TCCAAATAGG 


TACGGAAGTA 


TTTGANGTGC 


GTGTTGGGTA 


240 


ATCAACGTAA 


TAAATATGCA 


TANTNCTGCA 


GGAAACAAGT 


GCATTAANTC 


ANNNACATAA 


300 


TGATTTNTAT 


TAGATTAACA 


ACCGTATTTT 


CACAAAAAAG 


ACTTNNANTA 


ACATCTCCGG 


360 


TATTCCTTNN 


CAGGTTATTT 


ACAATNAACG 


GGTTCANTCT 


CA 




402 



(2) INFORMATION FOR SEQ ID NO: 844: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844: 



CGGCTCTTCG TAATATTGAT AATGTGCAAT ATTTNAAGAA TAATCAATTT ATTGAAGAAG 60 

AAACCGTAGT GACCGTGAGC GAATATCGAA NCGGCTATTG ATAGAATACG TACTGAAATG 120 

GACCCGAATG AATATCGAAG NCGATATAAA TGGTAGACCT AAACATATTT ACAGTATTTA 180 

TCGGNAAATG ATGAAGCAGA AAAAACAATT TGATCAAATT TTTGATTTGT TGGCGATACG 240 

TGTTATTGTC AATTCTATTA ATGATTGTTA TGCGATACTT GGGTTGGTGC ATACGTTATG 300 
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GAAACCGATG CCAGGACGTT TTAAAGATTA TATTGCAATG CCTAAACAAA ATTTGTATCA 360 

GTCATTGCAT ACTACAGTAG TAGGTCCAAA TGGAGACCCG CTCGAAATCC AAATACGAAC 420 

GTTTGATATG CACGAAATTG CTGAGCATGG TGTTGCAGCA CACTGGGCTT ACAAAGAAGG 480 

TAAAAAAGTA AGTGAAAAAG ATCAAACTTA TCAAAATAAG TTAAATTGGT TAAAAGAATT 540 

AGCTGAAGCG GATCATACAT CGTCTGACGC TCAAGAATTT ATGGAAACCT TATAATATGA 600 

CTTACAGAGT GACAAAGTAT ACGCATTTAC CCCAGGGAGT GATGTTATTG AGTNGGCATA 660 

TGGTGCTGTG CCGATTGGAT TTTGGCTTAT GCGAATCACA GGGAANGTAG GTAATAAGAT 720 

GATTGGCGCC CAGGTGGAAT GGCAAAATTG TACCANATTG ACTTATNTTT TCACAAAACA 780 

GGCGGATATT GTTGGAAATA CCGTTCTAG 80 9 

(2) INFORMATION FOR SEQ ID NO: 845 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845: 



ATTGGGATTG 


TTTTCCTAAA 


GAAGCCCCAA 


ATAGGTNNGG 


GAAGTAATTT 


GGAAGTGCGT 


60 


TGTTGGTCAA 


TCCCCNCGTA 


ATAAAATATG 


CATAATCCCT 


GCAGAAACCA 


AGTGCATTAA 


120 


ATTCAATTTA 


AAATNATGAT 


TTTTATAGAT 


TAACCCCCGT 


ATTTCACCAN 


AAGACTTTAA 


180 


TAACATTTCG 


TAATACTTTA 


CAGCTATTAC 


ATTANCGGAT 


TCAATCTCAA 


TGAAGTTAAA 


240 


AGGTAAAAAC 


ANAATCATTT 


TAACTGGTCT 


GGATCGTGGT 


TATGTATTTG 


AAGAAGGTAT 


300 


GGCTAGTGTC 


GTGCTAAAAG 


ACGACATGAT 


TATGGGAATG 


TTAAGCCCAN 


ACGTCAGAAA 


360 


ACGAAGTGGA 


AATCGTACTT 


AGTAAAGATA 


TTAAGAAGCG 


AGACTTCANA 


AATATTGTTA 


420 


AGTTAAACAC 


TGCACATATG 


ACTTACTCGC 


GAAAATAAAT 


AATAAATGCC 


CTCAAATCTA 


480 


TGTGAACTAA 


CATAATTTGA 


GGGTTTTATT 


TTGCTATTTA 


TGAAATGATT 


ATGACATTTC 


540 


CCTGATTTTC 


ATTTTCATAT 


ACATTAAATT 


GTATACACTG 


AAAATGAGGG 




590 



(2) INFORMATION FOR SEQ ID NO: 846: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1130 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846: 



ATAAGTGANT 


ANNCCGNCTT 


AANACAATCT 


GATAGTAGTA 


TCCANATCTC 


CACCTATAAT 


60 


AAATGTTAAT 


GGAATCTTGA 


CTAAATAATT 


CTTACTCACA 


TAGCGTTTGN 


CNATATACGG 


120 


NNCGATTAAN 


CCACCANATC 


CAATTGGTCC 


TCCCNCTGAC 


ACCGACATAC 


CTGTAAGAAT 


180 


AATGNCTAAT 


AGNCCAATGA 


TAATTCTAAC 


TTTATTTATA 


TTTTGACCCA 


ATCCCTAANC 


240 


AATGTCATCN 


CCTAGTTCCA 


TAATCGTTAG 


TTGACGTCCA 


ATAAAAATCG 


TGACAATTAA 


300 


AGCNCCCAAT 


AATCCATGGT 


AAAATGGGTA 


AAATTTCATC 


CCACTTCNTA 


CTGGNTAANG 


360 


ATGCCGACAA 


GCCCCNANAA 


CATCNCTGGT 


NCGTGTGAAT 


TCTCACTCAA 


AATAATAATA 


420 


CCTTCAGTCA 


TACTACTAAA 


AAACANATGG 


NTGGCCATAC 


CAGCCAATGC 


TAATTTAATG 


480 


GGTGTCATAC 


CTTTCGTAGC 


ACCTGTNAGT 


GTATAAACAG 


NTAAACCACC 


TATAAATGCA 


540 


CCTATCACAC 


CTAAATATAA 


TGCATAATAT 


TCTAATGATG 


GANTGATAAT 


CGTCACAAAT 


600 


ACAATGTGAA 


ACGGATGCAC 


CCAAGAGCTT 


GACACCAAAA 


ATTTTAGGTG 


ATGCTAGCGG 


660 


GATTGCGTGT 


CATTGCTTGC 


ATTAACCCAA 


ACCAGATACC 


GCTAATGCAC 


CGCCAATAAA 


720 


TAACCCTGCA 


ATCATCCTTG 


GCATACGTAC 


ATTATGCAAT 


AAAAACGTTG 


CTTTCGTATC 


780 


TGTNTGACCA 


GTAACATAGT 


GANTGATATC 


GCTAAAATTA 


ATTTTCGAAG 


ANCCAATCGC 


840 


CATATTTAAA 


TATATACAAA 


TAAAAAGAAA 


GCACANACTC 


ACTATATACG 


TGAGTGTTGT 


900 


GCGTCTTTTT 


CTTTTATGAT 


CTATGGCAGA 


TTGGCTACTT 


ATTTCTTTTA 


TAGCCATTTA 


960 


CTTCCACCTT 


ACTTTTGTTC 


TTTTTTTGAT 


AATTCAACAA 


GTTCTTTAGC 


CATTTCTTCA 


1020 


GAAGAAATTA 


AGCCACGAGA 


TCTTGCCCAA 


ACATCACGGT 


CAACAATATC 


CACGCTCGTG 


1080 


CCTTTAACTG 


CATTCAACTT 


TTTCCATGTT 


GCATCTTCTT 


GTAACTTCTT 
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(2) INFORMATION FOR SEQ ID NO: 847: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 847: 



TCATTGGGAA 


GATTAATTAA 


TCCTCAATTG 


GACTANCCAT 


TGGAGGTCAT 


TCAACCCAAT 


60 


TGGGCCTNCT 


TGAAAATAAT 


TCGCGCCAGC 


AACACGCNCG 


ATTGACNAAA 


GTATCATGAN 


120 


ACGGCACAAG 


TATTTACGCA 


TGAAAATCAA 


TATTGGCGTG 


AAGGTGATTG 


GATTGTACAA 


180 


CCCGAATTAG 


GTAAGACATT TCAAATATTA 


AGAGAACAAG 


GGTTTAATGC 


ATTTTATAAA 


240 


GGTGACATTG 


CGAAACAATT 


AGTCAATGTT 


GTTAAAGCAT 


GTGGTGGGAC 


AATCATTTTA 


300 


GAGGATCTAG 


CCAATTATGA 


CATTCAGATT 


AAAGCGCCAA 


TCAGTGCGAC 


ATTTAAAGAC 


360 


TATGACATTT 


ATTCAATGGG 


ACCATCTAGT 


TCTGGCGGTA 


TCACGGTCAT 


TCAAATATTG 


420 


AAGTTATTAG 


AACATGTCGA 


TTTACCATCT 


ATGGGTCCAA 


GATCTGTCGA 


TTACTTGCAT 


480 


CATTTGATAC 


AAGCGATGCA 


TTTAGCATAT 


AGTGATCGCG 


CGCAATACTT 


GGCGGATGAT 


540 
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AATTTTCATG AGGAATGCCG GTACAGTCAT TAATTGATGA CGATTATTTA AAAGCACGCA 600 
GTACGCTCAT TAATAGCAAT AAAGCAAATA TTGATAG 637 

(2) INFORMATION FOR SEQ ID NO: 848: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848: 



CAGCGTGATG 


TNTTGGANNT 


TTCATTGTGC 


GGANTCCTCC 


AAAGTCAATT 


GGATTTGATT 


60 


CCTCAGGGGA 


CTCAGTGTAT 


CCCCCCAACG 


TGANCNGNTT 


CACAANCCAC 


AGCAGTATGG 


120 


TAATCGATAT 


CAATAGCTGA 


TGAATCCGTT 


TCTTCTATTG 


TTTCAATATA 


CCCATCAACA 


180 


TAACCACCCT 


CACCAGCTAC 


GAGCTGTGTG 


GTAATCAATG 


TCAAGCGGTG 


TATTGTCTTG 


240 


ATTTTCTTCT 


GTTTCAATAA 


TTTGCTTGGC 


ATCGTACTGC 


CCGCTCATAG 


TTCCTGNTGC 


300 


AGTATCTTCT 


TTATATTCAN 


AATCATTATC 


TTGAATAATT 


TGTCCATTTT 


TACCATTGCC 


360 


GTCAGCTTTA 


TTGCTATAGA 


AAACTAAACC 


ATTATCCCNA 


GGNAAGCGAN 


ATCCCCCATA 


420 


ACATAGTAAG 


ATTTATATGC 


TCGTTCTGGA 


TACCCATATA 


ATTCAGTTCT 


AAAATTAACC 


480 


TGANCTGACC 


CTTGCAAATA 


TTCACCTGTA 


TAATGAATGA 


CATACGTTTT 


ATCCAACTTA 


540 


TCTAAATTCA 


ATGAGTAACT 


ACCATTGTCT 


TGCACACTCA 


ATTTTCCATT 


CATTTCCTTT 


600 


GTTACATCTT 


TGAATTTGTT 


AGTATCTGAT 


GTATTTGCAT 


AAACACTTTG 


TGGCAATTCA 


660 


TCTTTTTTCC 


CTAGATATTC 


ATATACTTTA 


ACAGTAGGTT 


GTCCACCAGC 


TAAATTACCA 


720 


CCTTCAGTCA 


ACGTCCCTGT 


TACTGATACA 


GTGTTTGACT 


GGTTTCCATT 


CATTGGCTTA 


780 


ATATAAGCTA 


TATGTGTAAA 


TTTG 








804 



(2) INFORMATION FOR SEQ ID NO: 849: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1053 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 849: 
ACATATGCGA AGTCGTTAGC AGGTATAAAG TCCCAAGAAT GTATGTACCC AGTGACACAA 60 
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ATGCCGTTAT 


ACAGTACGCA 


GAGACCAGAT 


AAACTTGCGA 


TTCGACAATG 


ATGAATGCCG 


120 


AAGTCTCACA 


AACACTTTAA 


AGNTGAGGGA 


GATTTTTGAC 


ATTTAGTGTA 


AGCGTTGACA 


180 


AATAAAGCGT 


GTTGTTTTTG 


AATTAGATGC 


ATTTCACATT 


AGTATTCATA 


TTATTTTTAG 


240 


GAGGAATTTA 


TATGACATTT 


GAAAAAGAAK 


CGGTCTTAGA 


AACATTATTT 


CCTGAAGATG 


300 


TACTTAGTAT 


TGCTAAAGGT 


TTAACAGACG 


GTGAAGTCGA 


ATTTTTACAA 


CAAGTAGATT 


360 


CATTGCTAGA 


AAGTAAGTAC 


CGTGAAAATA 


TTAATCANCA 


TTGGATAGAC 


GCTACTGTAC 


420 


CCGAGGACTA 


TTTTAAAGAT 


CTGGGAGAAT 


TAAATTATTT 


TAACAATCCA 


TTACTTTACA 


480 


AGGATCGTCC 


AAACGCCAAA 


ATGCCTAGTC 


AACTATTTCA 


GTTTTTCATG 


TCTTACCTAC 


540 


TCGCGCGATT 


TGATATTTCC 


TTAGCTACCC 


TACTCGGTGT 


TCACCAAGGT 


TTAGGGCATA 


600 


ACACTTTCTA 


TTTCGGAGGT 


AGCAAAGAAC 


AAATTGCGAA ATATGTACCT 


AAATTACAGT 


660 


CACATGAACT 


GCGTACATGC 


TTTGCTTTAA 


CTGAACCAGA 


ACACGGTTCG 


GACGTTGCGG 


720 


GAGGTCTTGA 


AACAGTCNCT 


GAACGCCAAG 


GCGATACTTG 


GGTTATCAAT 


GGTGAAAAGA 


780 


AATGGATTGG 


TGGTGCACAT 


GTATCTGATG 


TCATTCCAGT 


ATTCGCAGTA 


AATAAAAAAA 


840 


CTGGCAAACC 


CCATTGCTTT 


GTAGTCAGAC 


CAGAACAAGA 


TGGCGTCGAT 


ATTGAAGTCA 


900 


ATTGATAATA 


AAATCGCACT 


TCGCATTGTT 


CCTAACGCAC 


TAATTAATTA 


CTAATGTCAA 


960 


GTAGATGAGC 


GGTCCGCTTA 


CAAACATACA 


GCTTTAAGAT 


ATTGCCAAAT 


TCCTTATCAA 


1020 


CGAGAGCAGG 


CGTTGCTATA 


TGGTACAGGT 


GGG 






1053 



(2) INFORMATION FOR SEQ ID NO: 850: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850: 



ATATTAGCAG 


CTGTGCTTGC 


TTGTATTTTA 


GCTGGTGTAC 


CCTATCAAAG 


CAATTATTGA 


60 


TAGTTTAACT 


ACATTTTCAG 


GAATAGAGCA 


TAGATTGCAA 


TATGTTGGTA 


CTAATAGAAC 


120 


TAATAAATAT 


TATAATGATT 


CCNAAGCAAC 


AAACACGCTA 


GCAACACAGT 


TTGCCTTAAA 


180 


TTCATTTAAT 


CAACCAATCA 


TTTGGTTATT 


GTGGTGGTAT 


TGGATCGAGG 


GAGATGAATT 


240 


TGACGAACTC 


ATTCCTTATA 


TGGAAAATGT 


TNCGCACGAT 


GGTTGTATTC 


GGACAAACGA 


300 


AAGCTNAGTT 


TGCTAAACTA 


GGTAATAGTC 


AAGGGAAATC 


GGTCANTGAA 


GCGAACAATG 


360 


TCGAAGACGC 


TGTTGATAAA 


GTACAAGATA 


TTATAGANCC 


AAATGATGTT 


GTATTATTGT 


420 


CACCTGCTTG 


TGCGAGTTGG 


GATCAATATA 


GTACTTTTGA 


AGAGCGTGGA 


GAGAAATTTA 


480 


TTGAAAGATT 


CCGTNCCCAT 


TTACCATCTT 


ATTAAAGGGT 


GTGAGTATTG 


ATGGATGATG 


540 


AAACGAAGAN 


CGATC AACAA 


GAATCAAATG 


AAGATAAAGA 


TGAATTAGAA 


TTATTTACGA 


600 


GGAATACATC 


TAAGAAAAGN 


CGGCAAAGGA 


GAAAGGTCAA 


AGGCTACACA 


TTTTTCTACN 


660 


TCAAAATAAA 


GATGATACAT 


CTCAACAAGC 


TGATTTTGAT 


GAAGAAATTT 


ACTTGAG 


717 
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(2) INFORMATION FOR SEQ ID NO: 851: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1063 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

£ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851: 



ACTTTACCCC 


TTCCCAACTC 


TGCTCAAGTT 


TACCCTAAAT 


GTCTTAATGT 


CGGCGTTAAT 


60 


GCTGCGGNAG 


AAGAATCCTT 


CTTTTAACTT 


GCGAACTTGT 


GNCGAATATG 


GTGTGTACGC 


120 


TCTTTGAACC 


CACCATTGCC 


GACAGAAGCT 


TGATTTACAT 


CTAACAATAA 


ATAGCCAATG 


180 


TCCTTCTTCA 


ATGATATTAA 


TTCACTTAAA 


TGTGAAGTTA 


AGTTACCGTC 


ATCTAATATT 


240 


GCATAATATT 


CCTCTCTAAG 


TTGNCGATAA 


CGTTTTTTAA 


AAGCTNCAAC 


ATCATACTCT 


300 


AAATATTGAT 


TTATTCCGTT 


TTGCGCATTT 


GAAAATGCAT 


CAATTAACTC 


TAAAGAACTT 


360 


NTAATTGAGC 


CTAATCCTAT 


TGGAATTTGG 


TAAGCAATAA 


ATTGCGGGCG 


AATAAGTTGC 


420 


CGCAAACTGG 


CTTCGCATCA 


TGCGTGATGA 


TTTGTGATTG 


TGGCACTACG 


ACTTGATTCA 


480 


ATGTGATNTG 


ATACGTTGCT 


GACCCGTTGA 


CCCCTAAAAA 


ATTGGTTTTT 


TCAACGAGAG . 


540 


TGATACCATC 


TTGATTGGCA 


CGTAGAATGA 


ACATGACAAA 


TTCATCTGAT 


GAATCATGTT 


600 


TCGAAATCGC 


ACCAAAATAA 


TGGTCTTCTT 


GAATATTAGG 


CTTACGAGCT 


GGCATACGTC 


660 


CACTGACAAC 


CAATTGTCCA 


TCAACATAAG 


TGTGTTCAAG 


GTTCANCTTT 


TCTAAATCAG 


720 


TAAATGACTT 


CATCGGATTA 


GACAATCCGG 


TAGCACCTAA 


TATTTCTCCA 


GATAACAATT 


780 


GCTGTTGTAA 


GTCATTATTT 


AAATGTGGCT 


GCGTGGCATT 


TTCTAAATAC 


GTTGAAAAAG 


840 


CTAATTGGCA 


CCATAAACAA 


AATCCTGTTG 


TCAAACAAGA 


CTGCGATACA 


GCTTCGATTA 


900 


CTTCAGCATT 


TTTTCTCAAT 


GCCGCCTCAC 


CGAAATAACC 


ATCTACAAAT AATTGTTGAA 


960 


TAAATGTTCT 


CGGATAATAC 


GTCCCTTCAT 


CAACTTCTAC 


TAAATGAGGA 


TCTTATTGGT 


1020 


CTTTGATAAG 


CGTTCTTAAG 


TCATCATCAT 


CTCTCCNTAT 


TCA 




1063 



(2) INFORMATION FOR SEQ ID NO: 852: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852: 
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AACCCCTATA 


ACTATTGCAG 


TTAAGCACCA 


TAGCTGTAAC 


GAAGGGATCC 


ACAAACAATT 


60 


GTTTCCACCA 


GTTATTAATG 


GTTCTGATCC 


TACTAGAGCC 


ATACGTTCCC 


ATACTCATAA 


120 


TAATGAGATT 


ACCAGCATGA 


GTATATATTG 


AAATTCCGAT 


TACAATACGA 


ATTAAATTGA 


180 


TTGATAAAAT 


CATATATGTT 


CCTATAAACA 


CTATAAATCC 


TATAACTAGT 


AATAATATTA 


240 


AATTCATGAT 


CGACCTCCGC 


TAAGCGACAA 


CATCACTGTG 


ACAATAACAC 


CAACAACTGA 


300 


GAATAAAATA 


CCTAATTCAC 


AAAGTGTTAT 


TGTACTTACN 


TGGAATTTCC 


CCTAAANTTG 


360 


GANNAACCCA 


ANTTNTTCAA 


TATTGAGCCC 


CAAATGGTTT 


TCCAAGAAAC 


ATAGGTATTA 


420 


TCGCAGTAAT 


AGATGATACC 


AATGCTCCAA 


TAATCACTAA 


AATTCTAAAA 


TCAATCGGTA 


480 


AACTTTCTAA 


AACCTCTTCA 


ACATTAAAAG 


CCAGAAACAT 


TAAAATAAAC 


GCTGAACTAA 


540 


ATATTAAACC 


ACCAATAAAG 


CCACCACCAG 


GATTATTATG 


ACCTGCGAAG 


AAGACATAGA 


600 


ATCCGAAAGT 


CAATAAAATA 


AATACAACAA 


GTTTCGTGAC 


CGNTCTTAAC 


ACGACATCAT 


660 


TCTCTTTCAT 


CTTGTCCCCT 


CCGATCTTGA 


TAATTTAATA 


ATGTGTAAAT 


ACCCTAGCCC 


720 


AGGTAATAAT 


TAACACTAAA 


TCCTNCAAAA 


TTAATGTATC 


TAATGCTCTA 


AAGGTCACCA 


780 


ANGTATCGGN 


ATTTACAATA 


TTTTTACCAC 


CT 






812 



(2) INFORMATION FOR SEQ ID NO: 853: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853: 



TCTACAATAA 


CGGAAGTTTT 


TCTTTTAATA 


TTGAAATTTC 


TTAAGGATAG 


GTCTATACTT 


60 


TATAAATCGT 


AATTATTACG 


ATTTATAGTC 


AAAAACAATA 


ACTTGAAATA 


GATCATTGAG 


120 


GGAGTGTTAA 


TATGCAACAT 


CATAAAGTGG 


CTATTATCGG 


TGCCGGTGCT 


GCAGGTATAG 


180 


GTATGGCCAT 


TACCTTAAAA 


GATTTCGGTA 


TAACAGATGT 


CATTATTTTA 


GAAAAAGGAA 


240 


CAGTAGGACA 


TTCATTTAAA 


CATTGGCCGA 


AATCGACCGT 


ACGATCACGC 


CATCATTTAC 


300 


GTCTAATGGA 


TTTGGCATGC 


CTGATATGAA 


TGCAATTTCC 


ATGGATACTT 


CACCAGCATT 


360 


TACATTTAAT 


GAAGAACATA 


TTTCCGGAGA 


AACATATTGC 


TGAATATTTA 


CAAGTGGGTG 


420 


CCAACCATTA 


CGAGCTGAAT 


ATCTTTGAAA 


ATACAGTTGT 


CACAAATATA 


TCTGCAGATG 


480 


ATGCATATTA 


TACGATTGCA 


ACGACAACAG 


AGACATATCA 


CGCTCGTGCC 




530 



(2) INFORMATION FOR SEQ ID NO:854: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 : 



CCCATTTTCA 


TTTTTTAATT 


CATAAACGAA 


TCAATAACCA 


CCTAATAACA 


AATCATATTA 


60 


TACACCTTTG 


TTCGCTATTT 


TTCTAAGGTT 


TAAAAAATAT 


TTTTAGGTAA 


GCCTAAAAAT 


120 


AGATGTAATA 


AAAACGCCTC 


CTCAGATATT 


TATATATCTA 


TGAAGACGTT 


TAAATACATT 


180 


ATAGATGGTC 


TGGTTCTGGG 


TGAACGTATA 


CTGAGGAAAT 


ACCTTTNTTG 


TGCAAATGAT 


240 


GTTCGACATT 


GTCACAAATT 


TGATGCGCTT 


CTACTAAGGA 


AAGGTTAGCA 


TCTACAACAA 


300 


TTGTGACATC 


AATAAACACA 


CTACTTCCAT 


GNGTAACGCC 


CTTTAATACT 


TTTAACTTCT 


360 


TGTACTTCAT 


CAACTTCTTA 


AAATATCATT 


GCGATACGCT 


TCTAATTCAA 


GTTTCATTGA 


420 


AACCCATCAC 


TCAACATAAA 


AATTGATTCC 


TTTAAAAATA 


CCAAAACCAG 


TATTAAACGA 


480 


TTAGTAAGCC 


TAGGTAATNG 


TTGCTAAAAT 


AATATCGGAC 


AAATTGGG 




528 



(2) INFORMATION FOR SEQ ID NO: 855: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855: 



CGAACATCAA 


TACCGCAAAA 


ACAATCAATG 


CAATAATTCT 


AAATAACCAT 


CGCACCTCTT 


60 


TAACATCGGT 


CAAATGACTA 


AAGGAAAATA 


GCGTTTCACT 


TGCAATAATA 


AGTATTAAAC 


120 


ATGCAGCTAA 


TACGATTAGA 


AAACTTTNCC 


AACTNTGTTT 


TAAACGTTCT 


GTTACATACC 


180 


TTTTAGTATG 


CGGTAAGTTT 


TTAAAATATA 


AAATAATATT 


ATCTAGCATC 


TTTACACCTA 


240 


CAATTCAATA 


TAAATCCATN 


TGTCTTCTCT 


CAAATTGGCA 


ATACANTATC 


CATTCCCAAC 


300 


TTTAGTAATC 


CAAATGTTAT 


TGTTAATATT 


AANCTTATCT 


NCNAATTCTT 


NNNNNATCTA 


360 


CAANTGGATA 


AACAANACCG 


NATATACTGG 


ATTATTNTCC 


ANCGANANGN 


NANTAATTGN 


420 


ATCGGNTGTT 


GTGTANACGG 


ATNCAACCGA 


ANCCGGTTAT 


CATCATCAGG 


AACTAAAGTT 


480 


TTATAACTGA 


ACCTATGCTT 


TTTATGTAAT 


TGACTGTTAA 


AGTATGCATA 


GTAATTACTA 


540 


TAATTTGATT 


TCATATATGG 


CGCTAATTTT 


TTAAAACTAT 


GCGTTTGATA 


TAAACCTGCC 


600 


GGGTTAAAAT 


ATTCTAATTT 


ATCTTTACTA 


ACATAGTAAG 


TCTTATTATC 


GGCGGTAACA 


660 


CGATAATTTT 


TACTATTTTC 


GCCAGTAATT 


GTAACTACAC 


TATATTTTGG 


AATTGTGACT 


720 


TGTTTATCCC 


CTTGCATATC 


CACACTGCCA 


TTAAAAGCAA 


TTCCATAAGC 


TAACTTTTCG 


780 


CCATATGGAT 


CTTTATTTTC 


ATTTTGCATC 


GTAGCACGAT 


TCGTAAAATC 


TGTTATCGAA 


840 


ATAATTCCTA 


TATTATTGAA 


AATGAGCAAC 


ATCATCAATG 


CTAAGCCTAA 


AATAGTACCT 


900 
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CGAGCCTAAA GTCCAAAAAC GCACTAAAAA CTTAGGCGGT TTTGCATGAT TATAGCGCTC 960 
AACGCGGCAC GAGGTGCC 978 

(2) INFORMATION FOR SEQ ID NO:856: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856: 



GAATTCCTAC 


CTAAAACAGT 


TGTTATAATT 


ACTGTAATAA 


GTGCACTAAC 


CCCGAAAGAT 


60 


TGTATTGATT 


TATNCCATAA 


AACGATACCT 


GGTATGGTTG 


CAAATAATGC 


AAATATTAGT 


120 


CCCAAACCAG 


GCACTAGATA 


TAAAGCTAAN 


GCGACAAATA 


CCAACGTAAT 


CACTGCAATT 


180 


ATTGTTGCTT 


TAGGTTGTAT 


TTGTGAAAAC 


ACATAAGCCA 


CTCCCATATT 


TTTAGCTATA 


240 


GCTATTATGT 


TANCCTCTTT 


AATGGAAATT 


AACACTTTAT 


AGATTGTATG 


CTTCTATTTC 


300 


ATTTAATTGA 


ATAATAACTT 


NCATGTTTTA 


TAAGTAATTA 


ACATACTCAT 


TTGAATCGCT 


360 


TGTG 












364 



(2) INFORMATION FOR SEQ ID NO: 857; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857: 

CGGTCGCTTG TCTAAACCCT TCTTCTAATA TTTGGGTCAT ATTCGCATCG GCACCGCTCA 60 

TCGTAATCGC TTCATGTAAT GTTTGNTTGA AAGGTTTTAC TATGCTCGCT GATGGTGGAA 120 

TCCTTCCGAA TGTTTGNAGT AACATAAAAG CGATTAATGA AATTAAGCTC ATCGCTACTG 180 

TTGTTACGTA TAACATTCTT TCTTTCGACG TTCTTTCTTT GAGCAATTGA CCAATAATTA 240 

AACTTGCAAT TAAGACTAAT ATGATGGCAC TTAAAACGAA AGTATTACCT AAAACAGTTG 300 

TTATAATTAC TGTAATAAGT GCACTAACCC CGAAAGATTG TATTGATTTA TTCCATAAAA 360 

CGATACCTGG TATGGTTGCA AATAATGCAA ATATTAGTCC CAAACCAGGC ACTAGATATA 420 

AAGCTAAAGC GACAAATACC AACGTAATCG GTGCAATTAT TGTTGGTTTA GGGTTGGGTT 480 
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TTTGGAAACA CATAAGNCAC TCCCCATATT TTTAACTTAT TGGTTATTAN TTTAACCCT£ 540 

TTTTAATGGG AAAATTAACA AATTTATTAG GATTGGNTTG CTTCCTNTTT TCAAANTAAA 600 

TTTGAATTAA TTAACTTTCC AATGTTTTTA TTTAGGTAAA TTAACAATAC TCCATTGGAT 660 

TCGCTTTTGG GGTGCTTTCA ATTTTCAACA ATGAATTATT TAATCCCCAC TACATAGCAA 720 

ATCAAGCTTG GATTTAGATT TACAATTCAA TTCCCACTCT CAATGTACTC TGGATGTCTT 780 

TGGGATATGA TAAC 794 

(2) INFORMATION FOR SEQ ID NO: 858: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858: 



ANCGTATAAA 


CGAGACACAC 


TTCGCGAAAT 


GAAGTGGTGT 


AGACAGATGG 


TTATCACCTG 


60 


AGTACCAGTG 


TGACTAGTGC 


ACGTGATGAG 


TACGATGGAA 


CAGAACATGG 


CTTATAGAAA 


120 


TATCACTACT 


AGTTTAGCTC 


TCCTAGATGA 


TGGAGAGCTT 


TTTTCATGAA 


AAGAACACTT 


180 


AAAATTAACG 


CCTTGTCTTG 


ATATAATGAC 


ACTGCCTTGT 


TTTAAAATAG 


TAAGCGGATG 


240 


CGTTAATGTA 


TCAGCGATTA 


AATGTGTTGG 


AAATGTATAA 


AAAACACAAG 


CTAAGAATAA 


300 


AATACCTGTA 


TAAAAGGAGA 


ATCATATATG 


TCTCAATTAC 


TTGCAGTGTG 


CCCGATGGGA 


360 


TTAGAAGCTG 


GTGTTGCTAG 


GGAAATTCAA 


GAATTAGGCT 


ATGAAACAAA 


TTGTTGGAAA 


420 


ATGGGTCGTT 


TNTTTTGTGG 


AAGGAGGACG 


CAANGTGCAA 


TTGGTNAAGG 


CAAATTTATG 


480 


GGTTGGGCAC 


AGGAGGACCC 


ATCAAAATTG 


TGGGGTGGGC 


CGGTTTTAAC 


GCAACAAACG 


540 


GTGGACGGAT 


TTATCCGGAC 


AAACCCAAGG 


GCCCCCCTGG 


GAATCTTTTA 


TTGGTAAAGG 


600 


AGGTNCTCCC 


C 










611 



(2) INFORMATION FOR SEQ ID NO: 859: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 628 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859: 

761 



WO 97/30070 



PCT/US97/02318 



CGTTCTGTTT 


TTGAAAATCC 


GAGTATACAT 


ACGATAGGAG 


TTAAAAAAAT 


GATTGAACGC 


60 


TATTCTTGAG 


AAGAAATGTC 


TTATATTTGG 


ACTGATCAAA 


ATCGNTATGA 


AGGATGGTTA 


120 


GAAGTGGAAA 


TTTTAGCATG 


TGAAGGATGG 


AGTGAACTAG 


GACATATACC 


GAAAGCTGAC 


180 


GTACAAAAAA 


TTCGTCAAAA 


CGCAAAGGTA 


AATGTCGAAC 


GTGCACAAGA 


AATTGAACAA 


240 


GAAACGCGCC 


ATGATGTTGT 


AGCCTTTACT 


AGACAAGTNT 


CTGAAACGCT 


AGGTGAAGAA 


300 


CGTAAGTGGG 


TACATTATGG 


TTTAACTTCT 


ACTGATGTTG 


TAGATACAGC 


TTTAAGTTTC 


360 


GTTATTAAAC 


AAGCAAATGA 


CATTATTGAA 


AAAGATTTAG 


AAAGATTTAT 


CGACGTTTTA 


420 


GCTGAAAAAG 


CAAAAAATTA 


TAAATATACA 


TTGATGATGG 


GACGTACACA 


TGGAGTGCAT 


480 


GCAGAACCCA 


ACTACATTTG 


GGTGTCAAAA 


ATGGCATTGT 


GGTACACGGA 


AATGCAACNG 


540 


TAACTTACAA 


CCGTTTCCAA 


CAAGTAAAGA 


GAAGAAATTG 


ANGTAGGAAA 


ATGAGTGGGT 


600 


GCAAGTAGGG 


TACTTTTGCT 


AACAATCC 








628 



(2) INFORMATION FOR SEQ ID NO: 860: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1036 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860: 



ATTTAAATTC 


ATAAAACCNG 


GACCACCTTA 


TTTGTCATTA 


ACCCAAATAA 


TAACCAAAGA 


60 


CGTTATTATT 


TCCAATCTTT 


TTTAATANAA 


TAAATAGGAA 


TCATAGTATT 


GTCAATTTAA 


120 


AAGATAAAGT 


AAGGACNGTT 


TTTATTTTTC 


AAGATTTTTT 


AAATTATTAT 


GAATATCTAG 


180 


TTTTAGGAAG 


GAAATTACCA 


TTGTAAAAAC 


AAGTTATTAT 


TTCGGGCCTC 


ATGTTATTTT 


240 


CACTATTTTT 


TGGAGCCGGA 


AATTTAATAT 


TCCCGCCCAT 




ACAGCGGGTC 


300 


ACAATATGTG 


GANTGGTATG 


CTAGGCTTTG 


CCCTTACAGG 


CATATTACTC 


CCCTTTATTA 


360 


CTGTTATTGT 


TGTTGCATTT 


TATGATGAAG 


GTGTTGANAG 


TGTAGGCAAT 


CGTATACATC 


420 


CATGGTTCGG 


GTTTATTTTT 


GNTGTCGTGA 


TTTACATGTC 


TATCGGAGCA 


TTTTACGGTA 


480 


TTCCACGTGC 


TGCAAATGTC 


GCTCGTACCG 


AAATTCGGCA 


CAAGACACAC 


TTNACCTGTG 


540 


CANAACCAAT 


GGACTTTAAT 


TATANTCGCA 


GCAATCTTTT 


NNGGCCATCG 


TTTACTGGAT 


600 


TAGTTTAAAT 


CCATCGAAAA 


TCGGTGATAA 


TTTAGGTAAA 


TTATTAACAC 


CATNATTACT 


660 


ATTAATGGNC 


GCTCTATTAA 


GTATTGNTGN 


CATTNACACC 


CCTGAATCTG 


CACTAAGTGC 


720 


ACCTAAGGAT 


AAATATATAA 


CACATCCTNN 


CATTNCAGGA 


AGTGTGGAAG 


GCTATTTNAC 


780 


AATGGATCTT 


GNTGNTGCGT 


TAGCTTTCNC 


CGGAGNCATT 


GACAATGGTT 


ATAAGTTTAA 


840 


AGGCCTCACA 


GGATCGNATG 


AAAATTTNAA 


AATATTGTCT 


GCTTCACAGG 


TCTTTATTGG 


900 


AGGCCATATT 


ACTTGGGGAT 


TGATTTACTT 


TTGGNACTTG 


CAATACGGTN 


GGGGGCATCA 


960 


ACAGCTCCCA 


GGGAACTTTT 


AAAGGTGGNA 


CAAGATATAT 


TGACGTACAA 


CTCAATAACG 


1020 


AGTTTTTGGG 


CTCAAA 










1036 
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(2) INFORMATION FOR SEQ ID NO: 861: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:861: 



ATCAGTCCAA 


TTTGACATAG 


CAGCCATTTT 


TATTAATTTT 


CTTCTTAATA 


TTCTTCAACT 


60 


TGATAAAATC 


TATTTTTCAG 


TATATATAAC 


ACTAATGGTA 


TAAATAACAC 


AGCACTGATC 


120 


GTAAGTGCAT 


AAAATCCACC 


TTTATTTTCA 


AGTAATATAC 


CGCCTAAAAA 


AAGGGCTATA 


180 


GGTCTGAATG 


CAAAAACCAT 


TGTTTTAAAA 


ACACTATTAA 


CACGTCCTAA 


CATATTTTCT 


240 


GAACATTTCT 


TTTGGCTCAG 


TGTGATTATC 


GCTATATTTA 


AAATGGATTG 


TGTCATATAT 


300 


GTCAACCCTA 


AAAGTGGGAT 


GAGTATNAAA 


ATATAAGTAC 


TAAATGGGTA 


ACATTAATCT 


360 


TAAAAATTGN 


NTCTTAAAAA 


TAATATTAAT 


TACAATTAGG 


NNTCCCGAAA 


TTAATAATTT 


420 


CAAAGAGAAA 


ACTTGGAGGA 


TGATGAACCG 


NTTAGCGCAC 


TAATGATGAG 


ATACCTAAAA 


480 


CTAACCCCAA 


TGCTTGAGAT 


GATTGGATCA 


AGGTCTCCCT 


TACAAACATA 


ACTATTTTAA 


540 


TTTTATAGGG 


GGAGATC 










557 



(2) INFORMATION FOR SEQ ID NO: 862: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 457 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862: 



GGCACGAGAC CTTTAGATAA AGTTGAAGAT GAAACTGTCC AATGGTGTAA AGAGATTATT 60 

GAAACACTCA CCAACCACCT TTNCCGATCC CTNAAACGCA GCTATGNATG CTGACACAGA 120 

TGGTTTAGCT GGTTTACAAC ACATGNCTGG GGATCCAACA TTGCTTTATT ACACAACTGA 180 

TGAAGCGAAA GAAGCCCGTG ATGCGTTTAA AGAAAAACGT GATCCTGACT TCGATCAATT 240 

CCCTAAATTC CCATAAGTTA TTTTGAATAT GATATTAAGT CACTTGCGTC GTTTATTAGC 300 

GACGATAGGT GGCTTTTTTA TTTTATAAGA ATTTAAAAGA GAATTTTAAT TAATTATCAT 360 

AAAGATAAGA TGGNTTGAGT AAATTAATAG AGAAAGAATA AATGTTAAAG ATT ATT ATT A 420 

CAATGAAAAA ACGAGCTGGT GAGAAANTGA TTCTCGC 457 
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(2) INFORMATION FOR SEQ ID NO: 863: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863: 



ATTATTTTCA 


ATATGCACTA 


GCATTTCACG 


TTTAATTCCA 


GTACGTTGCT 


CTAATTCTGT 


60 


TAAAGTCATT 


CCTAACCTTT 


CACGTCTACC 


TTTTAGCGCT 


TCACCGACCG 


TTTTCAATTC 


120 


ATAGCCTCCT 


TACACTTACT 


CGAAGAAACC 


AAAACCACCA 


CCAAACGGAT 


CTCCAAAATC 


180 


TAAATTATTC 


TTACTTTGGA 


CTGTCTGTTT 


CTTCATTTCT 


TCGTAATGTA 


TTTCTTGATT 


240 


TTCATTTTCT 


CTCAATTCGA 


TAATATAGTC 


AAAATCTTCC 


ATTGTACAAG 


TACTTGTTTC 


300 


AACAAATAAA 


TCTGGATGGT 


TCAACGACTT 


TNGGATTGAT 


GGGTAACGGT 


CAATAACTTC 


360 


CGNTTAACAA 


GCTCTTGATG 


TTTGGGATTT 


AAGATTCTCT 


TGGCAAGGTA 


ACTTGCACCA 


420 


TCAAATAATG 


T 










431 



(2) INFORMATION FOR SEQ ID NO: 864: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864: 



CTTNATAAAN 


ANCNATCNTA 


TGCCAATTTG 


ACCATTGTTT 


GAATCCATAT 


ANAAACCGGC 


60 


NACGGTTCTT 


TTCAAATATA 


ATAGTAAGTG 


TATAATGAAA 


ATGTAAATAT 


TATTAAANAT 


120 


GGGGGTTCAC 


TCAATGAAAA 


TGAAACGTTT 


TATAGCTATT 


GTAATGGCAT 


TATTTTTAGT 


180 


ATTAGNTGGT 


TGCTCTAATT 


CTAACGATAA 


TAATGAAAGT 


AAAAAAGATG 


ACGCAGACAA 


240 


TGGTAAGAAA 


CAAGAGATTC 


AAGTTGCAGC 


GGCAGCAAGT 


TTAACAGATG 


TAACCAAGAA 


300 


ACGAGCTTCA 


GAATTTAAAA 


AAGAGCATAA 


AAATGCTGAT 


ATTAAATTTA 


ACTATGGTGG 


360 


ATCAGGGGCA 


TNAAGAAAAC 


AAATTGANTC 


AGGCGCACCN 


CTTGTTGACG 


TATTNATGNC 


420 


TNCCNAANTN 


CTAAAGATGT 


AGATGCATTN 


NNAAGACAAG 


GAATNNAGCG 


CATTTGATAT 


480 


CATATNAATA 


TGCGNNNNAT 


AGTCTAGTAT 


TAATTGGTGA 


TAAAAGATTC 


AAATTACACT 


540 
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TCAGTAAAAA GACTTAAAAG NCAATGATAA 
AGCAGGAAAA TATGCGAAAC AGTATTTAGA 
TAAAATCGTT TATGCTAAAG ATGTAAAACA 
GCGAAACAAG GTTTTGTGTA TAAAACTGAC 
GTAAAAGTAA TTAAAGAAGT AGAACTTAAG 
TCAGATAGTA AATTANCAAA AGAGTGGATG 
AAAGATACAC TTTGCAGCAT AAGA 

(2) INFORMATION FOR SEQ 



PCT/US97/02318 

ATTAGCATTA GGTGAAGTGA AAACTGTACC 600 

TAACAATAAC TTATTTAAAG AAGTCGAAAG 660 

AGTATTAAAT TATGTTTGAA AAGGGTTAAT 720 

TTATATAANC AANNCNNAAA AATTGATACT 780 

AAACCAATCA CATACGAAGC TGGTGCTACA 840 

GATTCTTAAA TCAGATAAGC TAAGAATATT 900 

924 

ID NO: 865 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 662 base pairs 
<B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 : 



CTATTCTTCT 


ATGTCTTTGA 


CAAGCGCGAA 


TATTTTTCGT 


CGTTGCTTGC 


CGGTAATTTG 


60 


AAACGGATCT 


ATGACGCTTA 


CATCGACTTC 


CACATCAAAT 


CCGTTATCAA 


GTAGTAATGT 


120 


TTCTTTATTG 


CCTAATTCAA 


CACCCGAGAT 


GACAACTGTT 


GNTGTACCGN 


CATCTTGAGT 


180 


GATATAACTA 


GTAATTATTG 


GCATCTAATC 


ATTCCAATCA 


GAACGGGAGG 


TCTGAAAAAT 


240 


CTTCTTCACT 


ATTGTCAAAC 


GGATTATTGC 


CAGTTTGAGC 


TTGTCCTTGT 


TGTTGATAAT 


300 


TGTTGTTTTG 


NTGTTGGTTG 


TTATTCTTCG 


GTTCTAAGAA 


TTGAACGCTG 


TCCGCTACTA 


360 


CTTCTGTCAC 


AAATACACGT 


CGCCCTTCTT 


TGTTATCGTA 


ACTGCGTGAT 


TGTAAACGTC 


420 


CATCAACGCC 


AGCCAATGAC 


CCTTTGGATA 


AATAATTATT 


T AC ATTTTC T 




480 


TAAAAGTTAC 


ACAGTTAATA 


AAGTCTGCCT 


CACGTTCTCC 


TTGAGCGTTA 


GTAAATGTTC 


540 


TGTTAACTGC 


GATAGTGAAA 


GTGGTNACAC 


TCACACCATN 


TGGGCGCTGT 


TCTATATTCT 


600 


GGATCTTTTT 


GGGTAAGCGT 


CCCACTTAAT 


ACTGTTCTCG 


TNNTAACATT 


ATTTGNTTTC 


660 


CC 












662 



(2) INFORMATION FOR SEQ ID NO: 866: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866: 



AAGAAGTGGT 


GAAATTTAGT 


CCCCANAACA 


TTAAGCCATT 


TGTATAATAA 


GTTAGTTAAT 


60 


TGCTTTTGAA 


GAAGAAGACA 


ATTTGATGAT 


GNTGTTGGTT 


GATTTTAAGA 


CTTTTAAGCC 


120 


AAAGAGCTGT 


TAAATCCATT 


TGCATCCACT 


TGAAGAATAT 


AATCGATTTT 


CCACAAGGGT 


180 


TATTTGAATG 


GATAGGCTAT 


AATACTAAAG 


TGTTTACGTA 


TCACAATGTT 


GAGAGACACA 


240 


AAAGGGAATC 


TAAGTGGTCC 


TTTAAAAAGT 


TATTTAATTA 


TGGTATTGAT 


GGATTGATTT 


300 


CCTTTAATAG 


TAAACCTTTG AGAATGATGA 


TTTATCTTGG 


CTTGTTTATC 


TTTTCAATAA 


360 


GCGTGCAATA 


TATTATCTAT 


TTATTCATCA 


ATATTATGAT 


ATCTGGTGTT 


AATATTCCAG 


420 


GATATTTTTC 


AACGATTGCA GCTATTTTAT 


TATTAGGCGG 


CATACAGTTA 


ATTTCAATTG 


480 


GTGTTGTAGG 


TGAATATATT 


GGCAGGATAT 


ATTATGAAGT 


TAAGGCACGT 


CCTAAATATA 


540 


TTATTCAAGC 


TACAAATCTT 


TCAAGTATTG 


AAAATGATGA 


GAAGGATACC 


CATAAAGTTT 


600 


ATTCTAAATA 


AACAAAAAAA 


GAAGCCCTCA 


TTAATGGGAG 


CTTCTTTTTA 


GTCTTTGCAT 


660 


TTTATTTTAT 


AAATAAATCG 


GATTNTGACG 


TAATGTCTAA 


TTTGTGTAAT 


GTTACAGTCA 


720 


TCGTAGTTCC 


TACATCTATA 


TCACTGCTTA 


CACTGATTTT 


TGCGTTATTT 


TGNTGCGCGG 


780 


CACGAGCTCG 


TGCC 










794 



(2) INFORMATION FOR SEQ ID NO: 867: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867 : 

AAGATATTGG AANCATTAAA AAATTTANTT ACCGGGTCCG AAAATGTCTG GTAAATCGAC 60 

ATATACTGAG ACAAGTTGCC ATAATTAGTA TAATGGCCCA AATGGGAGCT TATGTCCCTT 120 

GTAAAGAGGC AGTGTTACCT ATATTTGATC AAATATTCAC TAGAATAGGT GCGGCAGATG 180 

ATTTGGTTTC AGGTAAGAGT ACGTTTATGG TAGAAATGCT AGAAGCACAA AAGGCATTAA 240 

CTTATGCAAC AGAGGATAGT TTGATTATTT TCGATGAAAT TGGACGTGGT ACTTCAACGT 300 

ATGACGGTTT AGCTTTAGCG CAGGCAATGA TAGAGTATTG TAGCTGGAAA CATCGCATGC 360 

TAAAACGTTA TTTTCAACAC ATTATCATGA ATTGACAACA TTAGGATCAA GCATTACCAA 420 

GTCTTAAAAA ATTGTTCACG GTCGCTGGTA ATGAATATAA GGTGGAACTT ATTTTCTTGC 480 

ATAANGTCAA AGATGGTGCA AGGNGGACGA TAG 513 

(2) INFORMATION FOR SEQ ID NO:868: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 762 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868: 



CTGNAAATTC 


GATTTCATGT 


TTATGCTTCA 


GAAAACTTGT 


CACTAAGGCA 


GAAATCCCAA 


60 


ATATGACTGT 


CAATTCTAAA 


AGAAGATGAT 


CTTTAACCTC 


GTCAATATAC 


ATGTTAAACC 


120 


AATTATCAAA 


TGTTCCCCTA 


GGTGCTAAGT 


CATAAGTATT 


CTCACAAATG 


GCGTCAGATG 


180 


GAGATTTATT 


ATCAAATTCC 


GNAGTAGTAT 


AGATTTCACT 


TAACGATACA 


ATAGGACCAA 


240 


ACGGGGTTTC 


CAGTATACCT 


ACCCCTTCAT 


ATAAGTAGGA 


AATGGGTAAT 


TGATTGCGCA 


300 


TTTGTTGCAA 


CGCATAACCT 


AAATCTTNTG 


NGTNTTTTTC 


ATTAATACTA 


AATCCCATAC 


360 


TTCANTAAAG 


ATGGNAGGTT 


TTGGGGTTGG 


TTAAAATATT 


CACTAGGANT 


CAACAAATTA 


420 


CTTTNTGAGG 


CCCTCGNCTG 


TAATAATTAC 


TTTTTCAGTG 


TTAGTTTTAG 


GGTCAATAAA 


480 


TTTATTTTCG 


ATAACGATAG 


GACCTGCAAT 


TTCAACTTCA 


ATAGGCATTC 


CTCCTTTTTC 


540 


TTTGGGAGGC 


TTGTCTTTAT 


ACCAACCTTT 


TTTTGATTTG 


TATCGTGGTG 


AAGGATTAAA 


600 


TGAAGGGTTA 


GTTTGAGTCA 


TTAGCGAACA 


CCTCCTTTCG 


AAGGGTTGCT 


GTTATTATAT 


660 


GGATTTGGCC 


CTGTTTTAAG 


ATAAACCAAG 


TGTCCATGAG 


TATTTTTACC 


AATAATAACA 


720 


AATGGAACAC 


GTGGCGCATG 


TTTTACAAAG 


TACGCTCGTG 


CC 




762 



(2) INFORMATION FOR SEQ ID NO: 869: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869 : 



AAAAAACGGT 


TCGGATGTGC 


TTTTGTTTTA 


CCCTTCTTTT 


GTCTAAATCA 


ATNNCCCNAC 


60 


ACAAGATAAN 


ACACTTCTAC 


CTTCAGTTGG 


AACAACATGT 


CTTTGTGNCA 


ANCTAGTGAT 


120 


NANNCGCTCA 


ATTACCTCTG 


TTTNATTTAT 


TCCTAAAATG 


CCGTCATNNN 


AACCTGTCAT 


180 


ACCAACATCC 


GTAATATACC 


CTGTCCCCTT 


TGGTAAAATA 


CGTTCACTTC 


TGTTTGAATG 


240 


TGTGTATGCG 


TTCCAACAAC 


AGCGCTAGCT 


CTACCATCTA 


AATGCCATCC 


CATTGCATAC 


300 


TTTTCAGAAG 


TTGTTTCTGC 


ATGAAAATCA 


ACAAATATAA 


ACGGAGTTTG 


TTCTTGTGCT 


360 


TCCTTGACTA 


ATTGATGCTG 


CCTTTTTAAA 


AGGATCATCA 


ATATCTGGCA 


TAAACGCTCT 


420 


TCCTTGCAGA 


TTAATAACTG 


CAAGTTTAAT 


ATCATTAATT 


TGTATAAATC 


TCATACCAAT 


480 


TCCCGGCGCT 


TCATCCGGAA 


AATTCGCTGG 


GCTTACTAGT 


CGTTTTGCTT 


CATCTATAAA 


540 
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ATCATAAATT TCACGTNGAC CAGATGTGTG ATTACCCATA GTCATGAAAT CTACACCATT 600 
TCTTAGTAAT TGTTTATATA TTTTTTCAGT CACACCTTTA CCATGTGCTG CATTTTCAGC 660 
ATTTACAATT GTAACTGTTG GTTTATACTT TTGCTTCAGT TGAGGTATGT CTCGTGCC 718 

(2) INFORMATION FOR SEQ ID NO: 870: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870: 



CTGGTAAAAC 


AACATTAACA 


GCAGCAATCG 


CTACTGTATT 


AGCAAAAAAT 


GGTGACTCAG 


60 


TTGCACAATC 


ATATGACATG 


ATTGACAACG 


CTCCAGAAGA 


AAAAGAACGT 


GGTATCACAA 


120 


TCAATACTTC 


TCACATTGAG 


TACCANACTG 


ACAAACGTCA 


CTANGCTCAC 


GTCGNCTGCC 


180 


CAGNATCCCG 


CTGACTACGT 


TAAAAACATG 


ATCACTGGTG 


CTGCTCAAAT 


GGACGGCGGT 


240 


ATCTTAGTAG 


TATCTGCTGC 


TGACGGTCCA 


ATGCCACAAA 


CTCGTGAACA 


CATTCTTTTA 


300 


TCACGTAACG 


TTGGTGTACC 


AGCATTAGTA 


GTATTCTTAA 


ACAAAGTTGA 


CATGGTTGAC 


360 


GATGAAGAAT 


TATTAGAATT 


AGTAGAAATG 


GAAGTTCGTG 


ACTTATTAAG 


CGAATATGAC 


420 


TTCCCAGGTG 


ACGATGTACC 


TGTAATCGCT 


GGTTCAGCAT 


TAAAAGCTTT 


AGAAGGCGAT 


480 


GCTCAATACG 


AAGAAAAAAT 


CTTAGAATTA 


ATGGAAGCTG 


TAGATACTTA 


CATTCCACTC 


540 


CAGACGTGAT 


CCTGACAAAC 


CATCAAGATG 


CCAGTGAGAC 


GTATCTCAAC 


ACNGTCGGGG 


600 


TCTGTTGGTA 


CAGGCCCGTT 


GACGTGGGCA 


ATCAAGTGGT 


GAGGAGGTGG 


GATCAC 


656 



(2) INFORMATION FOR SEQ ID NO: 871: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 871: 

AC C TT AAGTT TGAGTTCCCN AATAAAATAA AAAATAATAA ATTTGNAGAA TAAATATGAC 60 
ACAATTTACA AGCCTGATAA ATCTGTGTAA CGTCAATTAC CTGTAACATT TTATTTCATC 120 
AATTTTTACC GTTTGGCGTA CTTGAAAATT TTGATAAAAG GGGTTCCATT ATGATTAATG 180 
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AGCAAAGATT 


ACTTAATACG 


TTTTTAGAAC 


TTGTACAAAT 


CGATTCTGAG 


ACAGGGAATC 


240 


AATCAACAAT 


CCAACCTATT 


TTAAAAGAAA 


AATTCATAGC 


ATTAGGGTTA 


AGATGTTAAA 


300 


GAAGATGAAG 


CGCCTAAACA 


TCCTAAATTA 


GGTGCCAATA 


ATCTAGTGTG 


TACAATGAAT 


360 


AGTACTATCG 


AAGAAGGCGA 


AGTACCTAAA 


TTATATTTGA 


CGAGCCATAT 


GGATACTGTT 


420 


GTACCCGAAT 


TCGCAATTAA 


TGTAAAGCCA 


ATTGTAAAAG 


ATGNCGGCTA 


CATATACTCT 


480 


GATGGTACGA 


CTATTCTAGG 


TGCAGATGAT 


AAAGCAGGAT 


TAGCANCGAT 


GCTTGAAGTT 


540 


TTGCAAGTGA 


TAAAGGANCA 


ACAAATCCCA 


CATGGACAAA 


TTCAATTTGT 


GATTACTGTT 


600 


GGGGAAGAAT 


CAGGTTTAAT 


TGGTGCTAAA 


GAATTAAATT 


CAGAGTTGTT 


GGACGCTCGT 


660 


GACTTTGGTT 


ATGCTATTGA 


TGCAAGTGCT 


GATGTCGGTA 


CTACGGTTGT 


AGGTGCACCG 


720 


ACGCAAATGT 


TAATTTCAGC 


TAAAATTATT 


GGCAAAACAG 


CTCGTGCC 




768 



(2) INFORMATION FOR SEQ ID NO: 872: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872: 



CAAGGTATAA 


ATGAATCAGT 


TGTTGGTAAG 


GCACTTCTTA 


AATAGGCAAC 


AACGCGATGA 


60 


TATTTTCATT 


GGTACAAAAG 


TAGGCAATCG 


TTTAACAAAA 


GATGGCAGTA 


CAACATGGGA 


120 


TCCGAGTAAA 


TCCTATATTA 


AAGAGGCAGT 


TAAAGGTTCA 


CTAAAGCGTT 


TAGGTATCGA 


180 


TCATATCGAT 


TTATATCAAC 


TTCATGGCGG 


AACCATTGAT 


GACCCATTAG 


ACGAAACAAT 


240 


AAGCGCATGT 


GATGAATTGA 


AACAAGAAGG 


AATTATACGT 


GCTTACGGTA 


TTTCTTCTAT 


300 


TCGCCCAAAT 


GTAATTGATT 


ATTATTTAAA 


ACATAGTCAA 


ATCGAAACGA 


TAATGTCTCA 


360 


ATTCAATTTG 


ATTGATAATC 


GTCCAGAATC 


ATTATTAGAT 


GCAATTCACA 


ACAATGATGT 


420 


TAAAGTATTG 


GCAAGAGGAC 


CTGTGTCTAA 


AGGATTATTA 


ACTTCAAACA 


GTGGTTAATG 


480 


TGCTCGACAA 


TAAATTTAAA 


GATGGGATTT 


TTGG 






514 



(2) INFORMATION FOR SEQ ID NO: 873: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873: 



GGCACGAGCT 


CGTGCCGCGA 


ATTATACGTA 


AAGAGCAACC 


TGACGCTTTA 


CTTCCAACTT 


60 


TAGGGTGGTC 


AAACAG6TTT 


AAACATGGCG 


ATTCAACTAC 


ACGAAAGTGG 


TGTGCTTCAA 


120 


GATAATAACG 


TCCAATTATT 


AGGAACTGAG 


CTAACATCAA 


TTCAACAAGC 


AGAAGACCGT 


180 


GAAATGTTTA 


GAACATTAAT 


GAATGATTTA 


AACGTTCCTG 


TACCAGAGAG 


TGACATTGTA 


240 


AATACAGTAG 


AGCAAGCCTT 


TAAATTCAAA 


GAGCAAGTGG 


GATACCCGCT 


AATTGTTAGA 


300 


CCGGCATTTA 


CGATGGGTGG 


GTACCGGAGG 


CGGTATTTGG 


TCATAATGAT 


GAAGAATTAC 


360 


ATGAAATCCG 


TCTCAAATGG 


TCTTCCATTA 


TAGNTCCAGN 


AACGCAATGT 


TTATTAGAAA 


420 


ATCTTATCGC 


AGGGTTTTTA 


AAGAAATCGG 


AATTACGAAN 


GTAATGCGGT 


TGATAAAAAC 


480 


GATAATGCCA 


CCGTTGTATG 


TAACATGGAA 


ATATGGATCC 


AGTGGTATTC 


ATACTGCGAT 


540 


CAATGTGGTG 


GTCCTAGTCA 


AACATTATCA 


GATGTGATAT 


CAA 




583 



(2) INFORMATION FOR SEQ ID NO: 874: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874: 



CTGATAAGCA TGAGGATATA AGTGCACGAG 


NACNAAATAT 


GAAGATGAAT 


TTTTAAGTCA 


60 


AGATGAATTA 


AAATGGTTTA 


CTAAATCTAA 


TAGAACGCTA AAATCAAAAG 


AAGTTCAGAA 


120 


AATTGNGNCC 


CATAGAGCAA 


AAGGAATTAA 


AATGTATATT 


TGTGNACAGA 


AAAAAGATGA 


180 


TGATGGTATA 


TATTTNTATT 


ATTTAGGAAC 


TGCTGGATAT 


ATAGAAGGNT 


CAGAGAAGCA 


240 


AGATAAAATG 


CCGAATGGCT 


CAAACGTAGT 


AACGATGGAT 


CTAGCACTCG 


ATAAAGCTGT 


300 


AAGAGATGAT 


ATATATCGCT 


ATATAACAAA 


TTAATGGNAA 


CCCCGAAAAA 


ATGAATTGTT 


360 


AGTAAAGATG 


TATTTAAACA 


TCAATTGTAA 


TAAGTAAAGA 


CATTGAAGTT 


TCAGCCGTTA 


420 


TTATTTTAAA 


TATAAGCGCA 


ATTCTCTTAA 


TTAATAGTTG 


GCTAG 




465 



(2) INFORMATION FOR SEQ ID NO: 875: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875: 



ATGGGTGGAA 


CCACTAAGAT 


TCATTTCCTG 


GCGAATTTGC 


GGAAGCAAGA 


AGGTTTCAAA 


60 


ATAATACTAT 


GACAAAGATA 


GTGGTAANAA 


GCNTTGATCC 


TAGGTGTTAC 


CTGTCGGTTG 


120 


TTGATGGAAA 


AAAGAACACA 


TCGTATTTTC 


ATGATTGGGC 


AAAGAGGTAT 


TTTAGAAATA 


180 


CCTCACTCAA 


GAGGCGTTCC 


TTTTATCATG 


AGTGACACAG 


CTGGTAGAGA 


TAAACCTTTA 


240 


CCAATGAGGC 


CTGATAAACT 


TAAGAATCTT 


GGGATGTTAA 


CAGAGCCAGG 


TCTTTACTAT 


300 


TTATACACTG 


ATCATACAGC 


TCAAATCGAT 


GATTTCCCAT 


TNCCAAGAGA 


ATGGCGTGAT 


360 


GCAGCTTGGT 


TCTTGGAAGT 


TAAGCCACCA 


CAAACTGGCG 


GTGATGTAAT 


TCAGATATTG 


420 


ACGCGTAATA 


GTTAGGCAAG 


GAATATGATG 


ACTTTTGAAA 


GGGTCCTCTC 


TGGAAGAACT 


480 


GGAGACATTT 


CGGACTGGAA 


TTATGTGCCT 


ATAAATAGTG 


GTAAATGGGA 


GAGAGTCTCG 


540 


TGCC 












544 



(2> INFORMATION FOR SEQ ID NO: 876: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876: 



GGAAGCAAGG 


ANCCNAATCT 


TGATGCTGCC 


AAAATACCAA 


ACAAGTGATG 


TNNCCAACAA 


60 


GCTAAAAGAC 


AATGGTNTTT 


GCTGCAATTA 


ATCCAAGTGC 


AAGCTGCCAC 


CNACTAAGAA 


120 


ATCGGATGCT 


AAAGCGGAAA 


TCGCTCCAAA 


AANCAAGTGA 


ACGTAAAACT 


GCAATTGAAG 


180 


CAATGAATGA 


TTCGACTACT 


GAAGAACAAC 


AAGCTGCAAA 


AGATAAAGTT 


GATCAAGCAG 


240 


TAGTTACTGC 


AAACGCTGAT 


ATAGATAATG 


CTACAGCAAA 


TACAGATGTA 


GATAATGCAA 


300 


AAACTACTAA 


TGAAGCGACA 


AGAGCTGCAA 


TTACGCCAGA 


TGCAAATGTT 


AAACCAGCAG 


360 


CAAAACAAGC 


ANTTGCAGAT 


AAAGTACAAG 


CTCAAGAAAC 


AGCAATTGAT 


GCTAATAACG 


420 


GCTCAACAAC 


AGAAGAAAAA 


GAAGCTGCTA 


AACAACAAGT 


TCAATCTGAA 


AAAACGGCTG 


480 


CTGATGCAGC 


AATAGATGCA 


GCTCATTCAA 


ATGTGGAAGT 


TGAAGCGGCT 


AAAAATGCAG 


540 


AAATTGCTAA 


AATTGAAGCG 


ATTCAGCCAG 


CAACAACATC 


TAAAGATAAT 


GCGACACAAG 


600 


CAATTGCTAC 


GAAAGCGAAT 


GAACGTAAAA 


CAGCACTCGC 


TCAAACGCAA 


GATATTACTG 


660 


CTGAAGAAAT 


TGCAGCGGCT 


AATGCGGACG 


TAGATAATGC 


TGTGACACAA 


AGCAAATAGC 


720 


AACATTGAGG 


CTGCTAATAG 


TCAAAATGAT 


GTCGGACCAC 


GCGAAAACGA 


CAGGTGAAAC 


780 


TAGTATTGAT 


CAAGGTACAC 


CACAG 








805 



(2) INFORMATION FOR SEQ ID NO: 877: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877: 



TTGGTGCACC 


ACCTCAAAAC 


TTTGGTACCA 


TTGATGCATT 


GTACATTAAT 


TGCGCCAAGT 


60 


TTTTGTTAAT 


GCCACCAATG 


ATTCAGGTAA 


TAATTGTCCC 


AAAAGCAGCA GTTACAATTA 


120 


AATCTAC ATC 


TAATTGAANC 


AATTGTTCTA 


ATTCTTCTGA 


TCCACTTAAT 


TTTTCAGGCT 


180 


GATATACAGG 


TAAATCATAT 


TTCATTNCAA 


CTTTTTTAAC 


TGGTGGTGGT 


GTCATAACAC 


240 


GTTTACGTCC 


AACAGGTCGA 


TCTGGTTGCG 


TTACGACTGC 


AATGACATCA 


TGTTCTGCAA 


300 


TAAGCATTTC 


TAAAACAGTT 


GTTGAAAAGG 


CACGAGTACC 


CATAAATATT 


ATTTTAGTCA 


360 


TTTATAAAAT 


ATGCCTCCAC 


TTCTTTATCT 


GTTAAAATAC 


GGTCCGCACG 


TTCNGTAAAA 


420 


GGGAGACCGT 


TCATTTGATC 


TATAATATGC 


AAAATCATTC 


TTGCTACATC 


TTCATGTGCA 


480 


GTTAGTTCAA 


CTTTGTTCCC 


ATTGACGTCA 


TAACTTTCGA 


CAACTATCAT 


TTTACTTCTT 


540 


GTCACTTCGC 


CGTAAACATC 


TGGCAATGTA 


ATTGAACCTT 


CTAAGTCTGT 


TATTGTTTCA 


600 


TTTGATTGAC 


TAATAATTTT 


CGGATTAACA 


AGTTGGTAAT 


AATCCTTCCA 


TTTCCATATT 


660 


CAATAATTGG 


CACTTGGCAA 


TGACTTGATT 


AATTTGAGGG 


TGCACATAAG 


CCAGCAGCTT 


720 


CTTGGTGCAT 


ACATTGGTAT 


CTTCTTAAAT 


CTTGGTAATA 


ATCTTTTTTA 


CGAATCACAA 


780 


TTGTTTTAAC 


TTGCTTGCGC 


TTTTTCCGGT 


TAAATAGGAT 


GCGATGCTGG 


GTACTAACTT 


840 


TTTTAATCGC 


CAATACTCTT 


ACTCCTCAAT 


AAATCAATCA 


ACTATATACC 


G 


891 



(2) INFORMATION FOR SEQ ID NO: 878: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878: 



ACCAANTTGG GCTAGGTGTN CCTGNTAATC CCGATAAATC TATTAATGAG TGGTANTTTC 60 

NTTTTCAATA GAGNTNANCC NTTGACTCTT AGGACTTTTA AATGTAGACA GTTCATCAAT 120 

TACAACCATA TCAGNATGGC CATTCTTTTT TATATTGATC GCATAACCAT TTAGTANTTT 180 

CTTTATTGGT TCCATAGATA TCAGCCTCTG TGTTTAATGC ATCATTTCTT TCTTTAGGTG 240 

TTCCTAAGAC TAAAGACACT TTCAGATGAT TTAAATGGTT CCACTTATCA ACTTCATCAA 300 
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CCCATGTATC TTTAGCAACT TGTTTAGGTG CTATGACTAA CATTTTTTTA GTGTCTAACA 360 

ACTGCAATTC ACTAAATGCT GTAAGTGTTG ATACTGTTTT CCCTAGCCCC ATATCTAAAA 420 

ACAAACCGNA TTTCTCATTA TCAATAACTT TATCTATTGC ATACTTTTGA TAGCTATGTG 480 

GTTTGAAGTC AATCGNCAAA TGTTCCACCC TACCATTCTG GATAAAAGTA TTTACTTGTT 540 

CTTTTATTCC ATAACACATA TTACTTNATG GATCTCTTAT TTNCAAATTG GCTTATGCAC 600 

ATATTTTTGG G 611 

(2) INFORMATION FOR SEQ ID NO: 879: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879: 



GGCACGAGCG 


ACACATAATT 


TTAACACTTT 


TCTAATTAAA 


AAATCAATAC 


TCTTTTTTAT 


60 


CTTGNGGTTA 


GTTTAATAAC 


TTTAATTGTT 


AATTAAAGCT 


TTTCTATGTG 


ATCGAATACA 


120 


NCTTCTACTG 


GTGTTTCTCG 


GNCAAACATA 


TCTACTAATA 


CTGTTAGCTT 


AAACTTATCT 


180 


GTNACAATTT 


CTTGAACTNC 


ACCAACTTGA 


TTCGCAAATG 


GACCTGATTT 


AATACGAACT 


240 


TGCTCGCCAA 


CTTCGAGTAC 


AACATCGATA 


GTCTTTTCTT 


TAAGANCCAT 


NTGGTTTNAG 


300 


ATGAAGCGAA 


CTTCNTCTGG 


GAACAATGGA 


TGTGGNTTAG 


ACCCTGCANC 


TGGAGAACCC 


360 


TACAAAACCA 


GTAACGCTGG 


TGTATTCTAC 


ACATACCATG 


ATTCATCTGT 


CATGATACTC 


420 


CACTAAAACA 


TTCAGGATGT 


TT 








442 



(2) INFORMATION FOR SEQ ID NO: 880: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880; 

GGCACGAGCT TTCAAATTCT TCCACATGTT TGTTGATTCA TATCCTGGTG TAGGTTTACC 60 
TTCACTTGTA CTCACAATGT AATCACCAGC ATATTTTACA ATTTCTTCTT GTTTCACTTC 120 
AGCCCAACCT GCTTTTGGAG TTAACTTTTG GTGCTCTGGN TGTATTTACA AACCAAATGC 180 
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TTGATATAAT ACTTCTCCAC CACGACCCCA GTTATCGCCG TAAGTGTATA ATTTTTTATC 240 

AAATTCATCA AACAATGACA CTGGTGTATC TTGTCCAATT GCTTTTTTAA TTTCTTTACC 300 

GTCTTTAGGA GTTGTTTCTT CCCAATCTNT CTTCCAAGCT TTTACTTTAT CTTCTTNACC 360 

AACAATTTTC CCTAACATCT CTGGTTGCTC TAAATATTTA TGCTTATTAT AGACAACAAC 420 

TACTGTTGGG TGGTACTTTG TGATATTTTT TATATCTTGG CAGGAGAGTA TACAATAATT 480 

AATCTTGCTT CTCTTTAGCA ACTT 504 

(2) INFORMATION FOR SEQ ID NO:881: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881: 



GGCACGAGCT 


AGTTTGGTAT 


CGGCAATTTC 


TCAAGGATAC 


AGACCAGGTG 


ATTTTGAATC 


60 


AATAACTGTA 


ACCGTAGATG 


CAGATAAACC 


GTCATCACCT 


TGTGGTGCAT 


GTCGTCAAGT 


120 


TTTAAAGGAA 


TTATGTGATG 


ATGATATGCC 


TGTGTATATG 


ACAAATCATA 


AAGGAGATAT 


180 


GGTTATGATG 


ACAGNCGCAG 


AGTTACTACC 


ATTTGGATTT 


TCAGGAAAGG 


ATTTAGAATA 


240 


AATGACAGAA 


CATAAATCAG 


GATTTGTTTC 


AATTATAGGT 


AGACCAAATG 


TAGGGAAGTC 


300 


AACATTTGTT 


AATAGAGTGA 


TTGGCCATAA 


AATAGCAATC 


ATGTCCGATA 


AAGCTCAAAC 


360 


AACTAGAAAT 


AAAATTCAAG 


GTGTTATGAC 


AAGAGATGAC 


GCGCAAATTA 


TATTCAATGA 


420 


TACGCCAGGT 


ATTCATAAAC 


CTAAACACAA 


ATTAGGTGAT 


TATACTGATG 


AAAGTCGCTT 


480 


AAAATACATA 


TCTGAAATAG 


ATGCAATCAT 


GGTTTATGGC 


TAATTGCAAT 


GAGAAATTGG 


540 



(2) INFORMATION FOR SEQ ID NO: 882: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 882: 

GGCACGAGCG TAAAATAAAT TGAGTGGATG CGTTTATATG GCGAAACAAA AAAAGTTTAT 60 
GAAGATTTAT GAGGCGTTGA AAGAAGATAT ATTAAACGGG CAGATTCAAT ATGGTGAACA 120 
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AATTCCGTCT GAACATGATT TGGTGCAATT GTACCAGTCA TCTCGAGAGA CCGNGCGTAA 180 

GGCATTAGAT TNGTNGGCAT TAGAGGGCAT GATTCAAAAG ATTCATGGTA AAGGGTCACT 240 

TGTCATTTAT CAGGAGGTTA CAGAGTTTCC ATTTTCTGAA CTTGTTAGTT TTAANGAAAT 300 

GCAAGAAGAA ATGGGCGTCG CATATTTAAC TGAAGTTGTT GTGAATGAGG TTGTTGAAGC 360 

GCATGAAGTT CCAGAAGTTC AACATGCTTT AAACATCAAT TCTAGGTGAN TCGCTCATTC 420 

ATATTGGNTA GGAACTCGGG CGACTTAACC AACAT 455 

(2) INFORMATION FOR SEQ ID NO: 883: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883 : 

GGCACGAGCT TCACTCATGG CAGGGCTCGC ACCAATACTT AATAAACCAT TCGCTGTAAA 60 

ATTTTTAACT ACATCGTTTG TATAACAAAT TGTTAATGGG TTTTCAATAC GTATTTTATT 120 

TAGATAATTC ATCGTCTAAT CCCTCTTTCT TTAAATATGC AAAATGATTC ACTGGTCCTC 180 

TACCACGGNC GATTTCAGGT GTATATTGTA TACTCATTGA AATAAACTTT TTAGCCTTGT 240 

GTACAGCCTC AAATAATGGT CTACCTTTTG CAAGTTCTGC CGTTATAACT GCTGAAAATG 300 

TACACCCTGT TCCATGCGTA TGTTTTGTTT TAAATCGTTC ATTTTCAAAT GTTTGAACAA 360 

CTTCGNTAGT AAATAAATAA TCTTTTGCTA TATCTAGGAT CATTTGAATG ACCGCCTTTA 420 

ATGATGATAC CTTTACTACC AATCTCATTA ATAAAGATGC GGGCAGGCTG CATAATTT 478 

(2) INFORMATION FOR SEQ ID NO: 884: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884: 

CCACTTCATC GCACAATAAT ACACGGCACG AGTCGCCTAC GAGCGTTGAC GAGATCTTTT 60 
CAATATCAAT TTCGGATAAA AGTTTTCCAT CGTCCATTGT CCTTTACAAT CACAAACAAC 120 
GGACGGACGA AGTTGTTTAA TAAATCATTA CCATATTCTG TATGACGGTA CTTCTGGATG 180 
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GAATTGCACA CCATAAATGC GACGTTTCTT ATCTTCAATT GCTGCATAGT CTGTGCTTGG 240 

GCTATCAGCG ATAACTTCAA ANCCTTCTGG AATTTCAATA ACTTTATCAG AATGACTCAT 300 

CCAAACAGTT TGTTCTGCTG GTAAGCCAGC GAATAACTCA TCTGACTTCG CATTAATGAT 360 

TGCTTTACCG TATTCACGTT CATTGGCACG NTCAACTTTA CCACCTAATA ATTTAGTAGT 420 

TAATTGCATG CCGTAACAAA TACCAAGTAC TGGAATTCCT AAATTATATA TTTCCGGATC 480 

AATGTAAATG AACCTCTTCA TAAACTGATT TGCACACTGA TAAGATGATA CCTGTGGGTT 540 

CATTTCTTAT CTCTCAATTG AATTCAGATC GG 572 

(2) INFORMATION FOR SEQ ID NO: 885: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885: 



GGCNCGAGCG 


ATAATTTTAA 


AAATCCATGT 


ATAACCCCCC 


TCTTATAACC 


ATTTTAAGGC 


60 


NGGTGATGAA 


ATGGAGATTA 


TAGTTNATGA 


AAACTTAGTG 


CTTAAAGAAA 


AAGAAAGGCT 


120 


GCAAGTATTA 


TATAAAGACA 


TACCTAGCAA 


TAAATTAAAA 


GTAGTTGATG 


GTTTAATTAT 


180 


TCAAGCAGCA 


AGGCTACGTG 


TAATGCTTGA 


TTACATGTGG 


GAAGACATAA 


AAGAAAAAGG 


240 


TGACTATGAT 


TTATTTACTC 


AATCTGAAAA 


GGCGCCACCA 


TATGAAAGGG 


AAAGACCAGT 


300 


AGGCAAACTA 


TTTANTGCTA 


GAGATGCTGC 


ATATCAAAAA 


ATAATCAAAC 


AATTATCGGA 


360 


TTTATTGGCC 


GNNGAGAAAG 


AAGACACAGA 


AACGCCATCT 


GATGATTACC 


TATGATTAGT 


420 


AATAAATACG 


TGGTGAATAT 


ATTAATTGTG 


G 






451 



(2) INFORMATION FOR SEQ ID NO: 886: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886: 

GGCACGAGCG TAAACCGATG TGCATACCAC CAGCCGTGAA GCCTTGAGGT GTACTGATAT 60 
CGCCATGTTT AATAATTGAA AAGTTATATT GTTGTGATGT CGTTTCTTGA TGTTTCATTC 120 

776 



WO 97/30070 



PCI7US97/02318 



TAACACCCCT TATGGATAAA CTGGTGATTG ATTTAGGCCA GTCGTTACTT CAAAATCATA 180 

TAATATATTT AAATTTTGAA TGGCTTGACC ACTTGCGCCT TTGACAAGGT TATCAATCAC 240 

TGATACTAAA ATTGCTGTTT GTGTTGTTTC ATCTACATAG ATACCGATAT CGCAGTAGTT 300 

ACTACCGAGT ACTTCTTTTG TGGTTGGAAA AGTCCCAATA TCTCTAATTC TGACAAATGG 360 

CTGATTAGCA TAATAAGAGG TCATTAATTT ATGTAATGAT TCAGTCGTTT TTCAGATGAA 420 

ATTTGACATA AATTGTTGAT AAATACCTCG TGTCATGGTA CGAGATGTGG GNGTAATATG 480 

ACTGATACAT CTTGACCGCA ATGATAGATA ATATTGTTCG A 521 

(2) INFORMATION FOR SEQ ID NO: 887: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:887: 



GGGACGAGCG 


GACGAACATT 


AGAAGATGCA 


TTTGTAATTC 


TTGATGAGGC 


GCAGAATACG 


60 


ACACATGCGC 


AAATGAAAAT 


GTTTTTACAA 


GACTAGGTTT 


TGGCTCAAAA 


ATGGTAGTTA 


120 


CTGGTGACCA 


AACTCAAATC 


GATTTACCTA 


AAGGTGTTAA 


AAGTGGACTT 


AAGGAAGCGG 


180 


NCAGTAGGTT 


ACACAACGTT 


AAAGGTATAA 


GTATATTGAA 


ATTAGCTCAG 


AGCGATGTAG 


240 


NAAGACATCC 


ATTGGTAAGT 


AAGNTCATTG 


AACATTATGA 


AGGAGAGAAT 


TAAATGTCTA 


300 


CGATAGATTT 


TAGCGATCAC 


ACAGGCTTAG 


TTAAAGATGC 


GTGGTATAAA 


CAAATTGAAG 


360 


ATTTATTAGA 


ATTTGCTAAA 


AAAGAAGAGC 


ATATAGAAGA 


CGATGCTGAG 


CTCTCTGTTA 


420 


CATGTGTGGA 


TTCACAAGAA 


ATACAAGANC 


TTAATCGAAC 


ATATTAGAGA 


TAAAGATAAA 


480 


GTCACAGATC 


GTCTCTCATT 


TGCTTTAGAA 


GAAGATGAGC 


CAGAGATTGA 


T 


531 



(2) INFORMATION FOR SEQ ID NO: 888: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 1149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888 : 
ATCTACATTT CGGAAGTTGG TATGTGTCTT TTTTATTAAA TTTAAGGATG CTACGGTTAG 60 
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TTCAAATGAT 


TATGAATGAC 


ATTTAATGGT 


ATCATCCAGC 


CTCAGTACTT 


GGAGATAACA 


120 


TTTTGGCGGT 


GAGACATAAA 


TCATATTTTT 


ACAAAACTTA 


TTTCGTCGTC 


TCCCCAATAA 


180 


AGATAAAGTA 


TTTCTTTATT 


GTGGTGTAAC 


TTCTCCACGT 


TGTATTTCAC 


TATCCAAATT 


240 


TTGTTTCATA 


TTTTCAAACA 


ACTCTTGAAA 


TCTAACCTCA 


TCTTGATCAG 


AATAACCTTT 


300 


AAAAATAACT 


GAATCTATAT 


TTTCTTCCGN 


TAAAATAATA 


AATTTCATCT 


TCCACAACAA 


360 


CAATCGGATA 


TTTACATGCA 


GAATAGGCAA 


ACATATTCTT 


TGGATTTTCA 


ATTTCTACAA 


420 


TTGGGCCTCT 


ATTAATAATC 


AATAACTTTN 


GCGAACCCTC 


TTTTAAATAA 


ATAATGCTCC 


480 


CTATTGTTTC 


CATTTATTTT 


CCTCCTTTGT 


CACCGTCTCA 


ATTTCAACAT 


TCCTTCGCAT 


540 


TTCCAAGTCA 


ATCATCTCTT 


CTTCCGCTTT 


ACGAATGNCA 


TGTTGAATGT 


GCTGTTGCTG 


600 


CTCGTATAAC 


GCTTCCGNTG 


AACGTCGTTT 


CGCAAAATGA 


AATNCATCGC 


GGAACTGTTC 


660 


CATACGACTA 


GCCATATTGC 


TACTATGATA 


ACGGNCACCT 


AAATGTTGAT 


ATAATTGTTC 


720 


TATCGAATGA 


TTAAGCTCAT 


CACTTTCGTC 


TATCAAAGCT 


TTAACCTCTC 


GCTTNACATC 


780 


ATATAAGTCA 


TCCATNTGCT 


TATGTTTCGC 


TGTGATCATT 


TTATTGATTC 


CATCAAGTTT 


840 


ATTCGACATC 


ATTTCANCAC 


GGTTTAATTT 


TTACCAATTT 


CATTACCAAT 


CATTACGTCT 


900 


TGTGTGATTA 


ATTCTTCGAC 


AACTTNGTGT 


ATACTATCGC 


TCGCATTTCT 


TATTGCTTCA 


960 


AGCAACAGAT 


TGATCCTTAC 


TAATCAAATC 


GTCTTATCAC 


ACTATGAGNA 


TTATTATTTC 


1020 


CACTTACTGG 


TGTACTTNCA 


ACTTTATCCN 


GGTTCGACAC 


ATTATAAAAT 


GTTTCAACGG 


1080 


CAATTTTATA 


GATGATGATG 


ACTATAGTCT 


TGAAATATCA 


CTTTGACTGT 


ACTCAATATT 


1140 


TACACCCTC 












1149 



(2) INFORMATION FOR SEQ ID NO: 889: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 889: 



AAATTTTAAA 


TAAACTTTCC 


TGCTGGGAAT 


TTTAGGGGAT 


TTAATCATTG 


GCNGGATTGC 


60 


AGGGAATTTA 


AAAATTTTAA 


GCCCCCACTT 


ATGGAATTTA 


TTATTGCATA 


TTTTATCAAG 


120 


TGGCAAGTTG 


GAAGCATTAA 


GTACAATGCA 


CCACCTACTT 


CCACTTGTAA 


GTATTTTAAG 


180 


AAGAGCCTGC 


CAAAAATTTG 


TATTTTTAAA 


CCAATGCGAT 


TAACCCATGG 


TGTATTCACA 


240 


CCGCTTGGTG 


CAAGATCCAA 


GCTGCAAAAG 


CTGGTCAATC 


AATTTTATAC 


ACAATTGAAT 


300 


CTAACCCTGG 


ACCAGGTTTA 


GGTATCTTAC 


TTGCTTATAT 


GATTTTTGGA 


AAAGGCACTG 


360 


CAAAAGCAAC 


GTCATATGGT 


GCTGGAATTA 


TCCACTTCTT 


AGGTGGTATT 


CATGAAATTT 


420 


ATTTCCCATA 


TGTATTAATG 


CGTCCTTTAT 


TATTTATCGC 


TGTTATTTTA 


GGTGGAATGA 


480 


CTGGTGTTGC 


AACTTATCAA 


GCAACAGGAT 


TTGGATTTAA 


AAGTCCCGCG 


TCACCAGGTT 


540 


CATTTATAGT 


TTACTGTTTA 


AATGCACCTA 


GAGGTGAATT 


CTTGCACATG 


TTGCTCGGTG 


600 


TCTTCCTTGC 


TGCACTTGTA TCATTCGTTG 


TAGCTGCTTT 


AATTATGAAG 


TTCACTAGAG 


660 
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AACCAAAGCA 


GGATTTAGAA 


GCTGCGACAG 


CTCAAATGGA 


AAATACTAAA 


GGGAAAAAAIT 


to n 
/ZU 


CAAGCGTTGC 


TTCTAAGTTA 


GTATCTTCTG 


ATAAAAATGT 


TAATACAGAA 


GAAAATGCTA 


Ton 


GTGGTAATGT 


TAGTGAAACA 


TCTTCATCAG 


ATGATGATCC 


TGAAGCGCTA 


TTGGATAATT 


840 


ACAACACTGA 


AGATGTTGAT 


GCACACAATT 


ACAATAATAT 


AAATCATGTT 


ATTTTTGGCT 


y uu 


GCGATGCGGG 


TATGGCJTTCT 


AGTGCAATGG 


GTGCAAGCAT 


GTTACGTAAT 


AAATTTAAAA 


you 


AGGCGGGCAT 


TAATGATATT 


ACAGGTTACA 


AATACTGCGA 


TTAATCAATT 


GGCAAAAGAT 




GCTCAATTAG 


GTATCACTCA 


AGAAAAAACT 


AACTGATCCG 


TGCTATTAAA 


CAAACACAAA 


i n o n 


ATGCCATCCA 


TATTTCAAGT 


GGATTAATTT 


CCTTAATTCA 


CCAAGATATG 


AAGGAACTTT 


1140 


TAAATAATCT 


TAAAAAAAGA 


TGATCAAGCA 


TAATAATTAA 


ATAAATTAAA 


AAATGGAGGA 


1200 


TACCGCCATG 


TTATTGAGTA 


CACGTGAAAA 


AGAAATGATA 


GCCCTATTGA 


TTAAGTACCA 


1260 


CAGTCAATAT 


ATCACTATAC 


ACGACATTGC 


TCAGCAACTT 


GCGGTGTCCT 


CTCGTACTAT 


1320 


TCACCGTGAA 


TTAAAAGGTG 


TTGAAGCATA 


TTTAACTTCA 


TTTTCATTAA 


CTTTAGAACG 


1380 


CGCAAACAAA 


AAAGGGCTAC 


GCATTGCTGG 


CGCAGATTCT 


GATTTAAACG 


ATTTGAAGCA 


1440 


ATCGATTGCA 


CAACATCAAA 


CCATTGACTT 


ATCTGTTGAA 


GAGCAGAAAG 


TAATTATTAT 


1500 


ATACGCTTTG 


ATACAAGCCA 


AGGAGCCAGT 


TAAACAATAT 


AGTTTAGCGC 


AAGAAATTGG 


1560 


CGTTTCTGTC 


CAGACTTTAG 


CAAAGATGTT 


AGATGATTTA 


GAGCTTGATT 


TAAATAAGTA 


1620 


CCAACTATCT 


TTATCTCGAA 


AGCGTGGCGA 


AGGCATTTAC 


TTGGTAGGTA 


CTGAATCAAA 


1680 


GAAACGTGAA 


TTTTTAAGTC 


AATTAATGGT 


GAATAACTTA 


AATAGTACTA 


GCGTTTATTC 


1740 


AAGTAATTGA 


AAATCATTTN 


GTCTTTCATT 


CATTAAATCA 


AATCCACAAA 


GACTTTGTTT 


1800 


GACTTAGAGC 


GCATTTTTAA 


TGTTGAAAGA 


CTATTAATGG 


ACTACCTAAG 


TGCCTTACCC 


1860 


TACCAACTTA 


CCGAATCAAG 


TTATTTAACT 


TTAACTGTCC 


CATATCGTGC 


TATCCCATTC 


1920 


CACGTATAAA 


AAATGGAGAG 


TATGTCGCAT 


TAAACGATGA 


TATTTATGAT 


TCTGTACAAA 


1980 


ACACATTTGA 


ACACAAAGTA 


GCAAGCGAAC 


TTGCTGATAA 


ACTTGGTCAA 


ATATATGACG 


2040 


TCACGTTTAA 


TCAGGCAGAA 


ATTGCTTTCA 


TTACTATCCC 


ATTTACGTGG 


AGCTAAACGA 


2100 


AAAAATCTTA 


ATGATACATC 


ATTAAATAAT 


CGTTGTGAAG 


AAAACAAAAT 


TAAAGCGTTT 


2160 


GTTAACAAAG 


TAGAAATGAT 


TTCCGGTATG 


ACATTTGCAG 


ATTTGGATAC 


TTTAGTAGAT 


2220 


GGACTGACGC 


TACACCTTAA 


TCCTGCAATC 


AATCGTTTGC 


AAGCTAATAT 


CGAGACCTAT 


2280 


AATCCGTTAA 


CAGACATGAT 


TAAGTTCAAA 


TATCCAAGAC 


TATTTGAAAA 


TGTAAGATTA 


2340 


GCTTTAAATG 


ATTGTTGGCC 


TGATTTGATT 


TTTCCAGAGA 


GTGAAATTGC 


TTTTATAGTT 


2400 


TTACACTTTG 


GTGGCTCGAT 


TAAAAACCAA 


GGTAATCGAT 


TTTTAAACAT 


ATTAGTCGNT 


2460 


TGNAGCAGTG 


GTATGGGAAC 


TAGTCGTCTA 


TTATCAACTC 


GTCTAGAGCA 


AGTTTTTAGT 


2520 


GAGATTGAGC 


GTATTACACA 


AGCATCAGTC 


AGCGATTTGA 


AGTCACTAGA 


TTTAAGTCAA 


2580 


TATGATGGCA 


TTATTTCTAC 


TGTGAATTTA 


GACATCGACT 


CCCCCTATTT 


AACGG 


2635 



(2) INFORMATION FOR SEQ ID NO: 890: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:890: 



CTTAAGGAAC 


GTACAGACGG 


CTTAAAAGCC 


TTTAAAAACG 


TTTTTAAGGG 


GTTTGTAGAC 


60 


AAGGTAAAGG 


ATAAAACAGC 


ACAGTATTCC 


GAAGTAAGGT 


GTGTTTAGTG 


CTAAGTGGTT 


120 


AAGCACGAGC 


TTGTGCCTCC 


TCTATCGTTG 


ATCTTTGTTG 


TGACTTTGTC 


ACCTTTAGAT 


180 


CTCGAAGTAA 


CTTTCATAAT 


AATTTGTGTG 


ACATAGCCAG 


TACAAATAAG 


TAATAGTATT 


240 


GTTGAGACGA 


TTATTAGTCC 


AATGATTAAA 


AATGGTGCTT 


GGCTAATGAC 


ACCTAAAGAG 


300 


TTAACAACTG 


AGATACCGGC 


TGGTACGAAG 


AGTAAGCCAA 


TGTTATTTGT 


TAGTGTCGCT 


360 


CCTACTTTTT 


CGACTTCGCC 


TAACTTAACA 


GCACCAGTAC 


ATAATAATAC 


AAATAATAAT 


420 


ACTAAACCGA 


TTACTGATGC 


AGGCATAGGA 


ATTGGCATAA 


ATGANTCAAT 


TATTTGCGAT 


480 


ACAAAGAGTA 


CTAAAGCAAT 


TACAATGACT 


TGGNTGAAAA 


AAGTGTGCTG 


GTTGTGGATG 


540 


CGTCTTTGTG 


GTGTTTCACG 


ACCATTGNCT 


CCTACGTGTG 


GATTTANCTA AAGTATAGAT 


600 


GGCTCACTTC 


GATGTGCGTG 


ATTTTTAGTC 


CGAAATACAA 


AATATCATAG 


GTAAAATGCA 


660 


TAAAAAAAAG 


GATTACTGTT 


AAAGTAATCC 


TATCGACGCT 


TTAAAATCTT 


TCATAAATGA 


720 


AC 












722 



(2) INFORMATION FOR SEQ ID NO: 891: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 891: 



CCTCCNACAA 


AANCAACTTA 


ACTGTTGTAT 


TGATGAAGGG 


AGATTTTTGC 


GTAAAATAAT 


60 


TTGTTAATTT 


TCGAACCACC 


AACGACCAAT 


CAATTGCTAT 


GGTCATTGCC 


NAAACCCAAA 


120 


TCAATATTAA 


CCTTGTAAGA 


CTTGGTTACN 


TAGTAAGGGA 


ATATTTCCCT 


ATGAAATAAC 


180 


TAGATGTCAC 


AATCTTGGAT 


AAATTTTAAT 


TCTTCAAGNT 


TGTTGGTCTT 


TTCTTAAGTG 


240 


AATCTTCTAA 


TTCAAGAATG 


CCATACCTGC 


ACCTAGAGCT 


AATTCAGCAT 


ATGGTAAATC 


300 


GTCATTATTG 


TGACATACCA 


GTATCTGGTA 


AAGTTTTAGC 


TTGTTGTTTA 


GGCTTATTAA 


360 


CTTTTCCTTG 


TTGGTCTGAT 


TTTGCATTTG 


CTTTGTTTTC 


TTCAGGCTTA 


GTTACATTAA 


420 


GCATATCTTG 


ATGCATACTA 


TGACTACCAT 


TTGAAACTGT 


TGCTGGAGAT 


GCATTGGCAC 


480 


CGTCGTTTTG 


CGTAGCTTTA 


TTGTTTGCAG 


CTGAACCAAC 


TGATTTTTGC 


GTATCATTAG 


540 


TTTGCATATC 


TTTAGCTTTA 


TTGTTTNCAA 


CTGGGCCAAC 


TGATTTTTGC 


GTATCATTAG 


600 


TATCTGCTGT 


TGCCCGNATC 


ATCTNTTTGC 


CTAACATNAG 


NTGAAGTCAT 


TTTATCTTTT 


660 


GCNTCAAGAA 


GATGCAGATG 


TTGATGGTTT 


ATCCGAAACT 


TCAGTATCAG 


CTTTGCTTGG 


720 


CGANTTANCT 


GCTTCGTTAG 


ATGCAACGTT 


AGTTTCAGAC 


TTAAGTTGTC 


CTGCATCAGN 


780 


TTGATTTGTC 


GTACTTTCTT 


CTTTATCTTT 


TGATGTATTA 


GAAGGTACAN 


TTGGTTCTGT 


840 
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TATGTCTGCT 


GAAGGCAATG 


TTTCAGTTGT 


TGANTCAACC 


ATACNTTGAA 


TTGTrTGAATf. 


900 


TCTACCATCT 


TTTTCTGCCT 


TAGCTTTANT 


TTCAGANTTT 


GGTTGTGCAA 


CCTTGCCATT 


960 


AGTTGATTGA 


GANTCAGCAC 


TATTANTTAC 


TTCAGCATTT 


J lXji L i J i*TGAAT 


CATTTACAGA 


1020 


TGCATTATCT 


TTACTATCAG 


CAGATGATGC 


TGCTTCTGTG 


CTCGCAGTTG 


TTGGAGCCGG 


1080 


TGCTGNNGAT 


CCGGGTCGNG 


CATTCTCGNT 


TGTTGCTGTA 


GGTGTACTAT 


TGTTAATTGT 


1140 


TGTGCTTTCT 


GCTGGCGNTG 


CATTATCTGT 


TTCTGTTACA 


GGTTTATCAG 


TTGTGCCGTT 


1200 


ATTAGTTGAT 


TCTACTTCTG 


GTTTACTAGT 


TACAGCGTTA 


TCCATTGTCG 


GACTGTGTGT 


1260 


TGATGCATCT 


GCACTAGAAT 


TGTTATTAGC 


TTGCGGTTTA 


TCCTNTGCAT 


CATCAGTTGC 


1320 


TGATGTTACT 


GTTGTNTCAC 


CTGTTGNCGC 


ATCACTATTA 


TTTGNTGTTG 


GCGGAGAAGC 


1380 


GTCTGCTTTG 


CCATTATCTG 


TCGTCTCAGA 


TACGTTAGGN 


TGGCCAGTGT 


TCTCTGGTGT 


1440 


NGCAATTAGC 


ANTNNGAATT 


TGGCTNGNNG 


CATCAATNAT 


TATCTTGTAC 


CATTATTAGT 


1500 


ATCATTAGCA 


TCTGGGATCA 


TTCTTGAGGC 


ACAATCGCTT 


CAATTGCAGG 


TATCGTTACA 


1560 


TTTTGTAATT 


CAGCAACCTC 


TGTATTTGTT 


TGTGTTTTAT 


CTAATTTATC 


AGCAAATCTG 


1620 


TCAAAATATC 


TACCTAAATC 


TGTACGTGCA 


atttctttcg 


CCGATGCATC 


TGCATCTGCA 


1680 


TTTTTAATTA 


TTTCTATTTG 


CTTGTTANCC 


ACTTCTCTAA 


TTGCTTCCAA 


AGCATTTTTC 


1740 


TTAACTACTG 


GATTAATACG 


TTGTGCTTTA 


AGTTGTTCAA 


GCGCACTATT 


TTTGACAGTA 


1800 


GCGATTTCTG 


CATTTGTAGT 


TTGATCAGAA 


ATATCTTCAG 


TTGCTTTTGA 


GAAAATGTCT 


1860 


TCTAAAGCAG 


TCGTAAACGC 


TTCTTTTTCT 


TCAGTTGTAG 


CATCACCGTC 


GACATTTACA 


1920 


CCTGCTTCAA 


TCTGGGCTAG 


TGCAGTTTCT 


AATTCTTCGA 


TAGCCTTTTG 


TTTTTCTGTT 


1980 


GAGTCAATTT 


GAATGTTATC 


AAATGCCTCA 


AGTCCTTGAG 


CTTTCGCTTT 


NTCAACTTCA 


2040 


GCAGTTGTTG 


TTGCATCAGT 


AATACCTTGT 


TTAGCTTGAT 


CTGTAATTTG 


TTTAATCATT 


2100 


GCTAATGCTT 


CAtfrnrrrc 


TTCAGCAGTT 


AACTGGTCAC 


TATGATCAAT 


TGATTCTTTC 


2160 


GTATCTTCTG 


CTTTAGCTTC 


AATAGCTTGG 


TTCGCTTTAG 


GTTTAACAGT 


AGCATCTACT 


2220 


TGAATAGTAT 


CAATTGCTGC 


TTTACCTTGT 


GTTAATGCAT 


CANCAACGTC 


ACCATTATCC 


2280 


ACACCATTAT 


TAATGTTTTC 


TAATGCAGTT 


TGAACATTTT 


GGTCAACTTG 


CTTAATTGCT 


2340 


TGTTGCTTTT 


CATCTTGTGT 


TGCATTAGTG 


TCAGCTGAAA 


TATTATTTTT 


CTTCTGATCT 


2400 


GCATAAGCAN 


ATAAATCTGN 


TGTAGCTGAT 


TCTTTTTTAC 


CGGTTGGAAN 


TGTGTAATCG 


2460 


TTAATANTAT 


CTAAGTCATT 


ATGNATNTGN 


NCTTCNATGT 


CATCTTTAGN 


AGNAGCNTGA 


2520 


GTAACATTTT 


GATCCGCAGG 


TTGTTTTNAT 


TCAGCAAGTT 


TTTGTTTCGC 


TTCAGCAATT 


2580 


TCACNTGAAG 


TCGATGCGTT 


AGAGTTATCC 


GCTTCGTTTA 


CTTTAGCATT 


AG 


2632 



(2) INFORMATION FOR SEQ ID NO: 892: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892: 
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ACCCCTTTGN 


NCGTATTGNA 


NTCNCNAGCA ATTTTNAATT 


CCCNAGTTGC 


AAGAGATTCC 


60 


AAGCTACCNC 


CAGATGCNGT 


AGTTGAAGNN 


ANACAAGCAG 


CTGATGCAGA 


ACCAAATACT 


120 


GNAAATGGTA AACCAAATCA 


AGCCATTTCA 


GCAGCAACTA 


CTAACGCACA 


AGTTGATGAA 


180 


GCGAAAGCTA ATGCAGAAGC 


AGCAATTAAT 


GCGGTAACAC 


CAAAAGTTGT 


TAAGAAACAA 


240 


GCGGCTAAAG 


ATGAGATTGA 


TCAATTACAA GCAACGCAAA 


CAAATGTTAT 


CAATAATGAT 


300 


CAGAACGCTA 


CAAATGAAGA 


AAAAGAAGCA 


GCTATTCAAC 


AATTAGCAAC 


AGCAGTTACA 


360 


GACGCGAAAA 


ATAATATTAC 


AGCTGCAACT 


GATGATAATG 


GTGTAGATAC 


AGCGAAAGAC 


420 


GCTGGAAAGA ATTCAATTCA AAGCACACAA CCAGCAACAG 


CAGTTAAATC 


GAATGCGAAA 


480 


AATGAAGTTG 


ATCAAGCTGT 


GACAACTCAA AATCAAGCAA 


TTGATAATAC 


AACTGGCGCT 


540 


ACAACTGAAG 


AGAAAAATGC 


AGCAAAAGAT 


TTAGTTTTAA 


AAGCTAAAGA 


AAAAGCGTAT 


600 


CAAGATATCT 


TAAATGCACA 


AACAACTAAT 


GATGTTACGC 


AAATTAAAGA 


TCAAGCAGTT 


660 


GCTGATATTC 


AAGGTATTAC 


TGCAGATACA 


ACAATTAAAG 


ATGTTGCGAA 


AGATGAATTA 


720 


GCAACAAAAG 


CAAACGAACA 


AAAAGCGCTT 


ATTGCACAAA 


CTGCAGATGC 


GACTACTGAA 


780 


GAAAAAGAAC 


AAGCAAATCA 


ACAAGTNGAC 


GCACAATTAA 


CACAAGGTAA 


TCAAAATATT 


840 


GAAAATGCAC 


AGTCAATCGA 


TGATGTAAAC 


ACTGCAAAAG 


GATAATGCAA 


TTCAAGCAAT 


900 


TGACCCAATT 


CAAGCATCAA 


CAGATGTTAA AACGAATGCA 


AGAGCAGAAT 


TGCTAACTGA 


960 


AATGCAAAAT 


AAAATAACTG 


AAATACTTAA 


TAATAATGAG 


ACTACTAATG 


AAGAAAAAGG 


1020 


TAACGATATT 


GGCCCAGTTA 


GAGCAGCATA 


TGAAGAAGGT 


TTNNNTANTA 


TTAATGCAGC 


1080 


ANCTACTACA 


GGTGATGTAA 


CTACTGCTAA 


AGATACAGCA 


GGTACAAAAA 


GTTCAACAAC 


1140 


TTCATGCAAA 


TCCTGTTAAG 


AAACCAGCAG 


GTAAAAAAGA 


TTAGNTCAAG 


CTGCAGCTGA 


1200 


TAAGAAAACA 


CAAATAGAAC 


AAACACCAAA 


TGCATCACAA 


CAAGAAATTA 


ATGATGCAAA 


1260 


ACAAGAAGTT 


GATACTGAAT 


TAAATCAAGC 


GAAAACAAAT 


GTCGATCAAT 


CATCAACAAA 


1320 


TGAATATGTT 


GATAATGCAG 


TTAAAGAAGG 


AAAAGCTAAA 


ATTAATGCAG 


TTAAAACATT 


1380 


TAGTGAGTAC 


AAAAAAGATG 


CTTTAGCTAA 


AATTAAAGAT 


GCATATAATG 


CTAAAGTAAA 


1440 


CGAAGCGGAT 


AACTCTAACG 


CATCGACTTC 


AAGTGAAATT 


GCTGAAGCGA 


AACAAAAACT 


1500 


TGCTGAATTA AAACAAACTG 


CGGATCAAAA 


TGTTAATCAA 


GCTACTTCTA 


AAGATGACAT 


1560 


TGAAGTTCAA 


ATTCATAATG 


ACTTAGATAA 


TATTAACGAT 


TACACAATTC 


CAACAGGTAA 


1620 


AAAAGAATCA 


GCTACAACAG 


ATTTATATGC 


TTATGCAGAT 


CAGAAGAAAA 


ATAATATTTC 


1680 


AGCTGACACT 


AATGCAACAC 


AAGATGAAAA 


GCAACAAGCA 


ATTAAGCAAG 


TTGACCAAAA 


1740 


TGTTCAAACT 


GCATTAGAAA 


ACATTAATAA 


TGGTGTGGGA 


TNAATGGTGA 


CGNGGGAGGG 


1800 


GATGCATTAA 


ACACAAAGGG 


TNAAAGCAGG 


AAATTGATAC 


TATTCAAAGT 


AGGATGCTAC 


1860 


TGGTTAAACC 


TAAAGCGGAA 


CCAAGCTATT 


GGAGGCTTTA 


GGGAGGCGGT 


TCCGGGAGGA 


1920 


TTCCNATTGG 


GCC 










1933 



(2) INFORMATION FOR SEQ ID NO: 893: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
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• <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893: 



AAATTCATAA 


TAAAGNCTTT 


TGNCATTAAA 


GTTNNAAAGA 


AGATAAAGAA 


NCTGCTNNAA 


60 


ATTGACGTAT 


CNNAAAGAAG 


TAAAAGAAGT 


TTAAATANTC 


AAATAATAAA 


TCNACAACAC 


120 


AAGANATTTC 


TGAAGAACAA 


AAAGGTGAAT 


ATCAAAGAAA 


GTCAGAGGCA 


TTAAAAGAAA 


180 


GATTTATAAA 


CAGACAAAAA 


TCTAAAAATG 


AGTCTGTGGT 


TTCACTAATC 


GATGACGAAG 


240 


ACGACAACGN 


AAACGACAGG 


CAACTTGTGG 


TTTCTGCGCC 


ATCAAAGAAA 


CCAACAACAC 


300 


CGACTACATA 


TACTGAAACA 


ACGACTCAGG 


TANCAATGCC 


TACAGTTGAG 


CGTCAAACTC 


360 


AGCAACAAAT 


CGTTTACAAA 


ACACCCAAAA 


CCATTAGCTG 


GATTAAATGG 


TGAAAGTCAT 


420 


GATTTCACAA 


CAACGCATCA 


ATCACCAACA 


ACTTCAAATC 


ATACGCATAA 


TAATGTTGTT 


480 


GAATTTGAAG 


AAACGTCTGC 


TTTACCTGGT 


AGAAAATCAG 


GATCACTGGT 


TGGTATAAGT 


540 


CAAATTGATT 


CTTCTCATCT 


AACTGAACGT 


GAGAAGCGTG 


TAATCAAGCG 


TGNACACGTT 


600 


AAGAGAAGCT 


CCAAAAGTTA 


GTTGATAATT 


TATAAAGATA 


CACATAGTTA 


GAAAAGACCG 


660 


ATTAAATGCA 


CAACAAAAAG 


TAAATACCTT 


AAGTGAAGGT 


CATCAAAAAC 


CGTTTAATAA 


720 


CCCAATCAAT 


AAAGTANCCA 


TGCCAATAAT 


ATTAATGCAT 


GGCTGCAAAG 


CAAATAATGA 


780 


GTTTGTCGTA 


AAAATACCAA 


CATTTAAACT 


AGCAATAAAT 


AATATCCAAG 


TCATCATTTC 


840 


ATTGATGCAA 


TCTAGTATAG 


TCCACATTCT 


AAACAGGTGT 


GGACTATTAC 


TTTTTTCACT 


900 


TTATATTACC 


GAAAAAATTA 


TTATGCTTAA 


CTATCAATAT 


CAATAATTAA 


TTTTAAGCTG 


960 


AAAAACAATA 


AAAATGTTAA 


GACAACGTTT 


ACTTCAAGTT 


AATTATTATA 


CTGAAAATTC 


1020 


TGGTATATAA 


TGCTGTTAGT 


GAATATAACA 


GGGAAATTAT 


ATTGGTTATA 


ATATTGAGTC 


1080 


TATATAAAGG 


AGAAATAACA 


GATGAAAAAG 


AAATTATTAG 


TTTTAACTAT 


GAGCACGCTA 


1140 


TTTGCTACAC 


AACTTATCAA 


TTCAAATCAC 


GCTAAAGCAT 


CAGTGACAGA 


GAGTGTTGAC 


1200 


ACAAAATTTG 


TAGTTCCAGA 


ATCAGGAATT 


AATAAAATTA 


TTCCAGCTTA 


CGATGAATTT 


1260 


AAGAATTCGC 


CAAAAGTAAA 


TGTTAGTAAT 


TTAACTGACA 


ATAAAAACTT 


TGTAGTTTCT 


1320 


GAAGACAAAT 


TGAATAAGAT 


TGTAGATTCA 


TCGGCAGCTA 


GTAAAATTGT 


AGATAAAAAC 


1380 


TTTGCCGTAC 


CAGAATCAAA 


GTTAGGAAAC 


ATTGTACCAG 


AGTACAAAGA 


AATCAATAAT 


1440 


CGCGTGAATG 


TAGCAACAAA 


CAATCCAGCT 


TCACAACAAG 


TTGATAAGCA 


TTTTGTTGCT 


1500 


AAAGGCCCAG 


AAGTAAATAG 


ATTTATTACG 


CAAAACAAAG 


TAAACCACCA 


CTTCATTACT 


1560 


ACGCAAACCC 


ACTACAAGAA 


AGTTATTACT 


TCATNCAAAA 


TCAACACATG 


TNCATTAACA 


1620 


TGTNNATCAT 


GCAAAAGGAT 


TCTTTTAATA 


NACACTTTAT 


TGTTACACAT 


CAGACTCGCC 


1680 


TAGATATACA 


CATCCATCTC 


AATCTTTATT 


ATCAAGCATC 


ATGTGCAGCT 


CCTGGATATC 


1740 


ACGCGCATAA 


TTTGGTACAC 


AGGGCATGCT 


AGCATTAA 






1778 



(2) INFORMATION FOR SEQ ID NO: 894: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 894: 



TANAATNTCA 


CAGNAAAGTN 


TAANATATAA 


NGATCATTCA 


CATNANCATA 


CGAATAAGCC 


60 


AGGTACATAT 


GTTTCTGCAA 


TTAATGGTAT 


GGAGAAGGAA 


ACGCACAAGT 


CAAAAANCNC 


120 


TCACATNATA 


TGTATTCTAN 


TAANACAAAT 


CATCGCGCTA 


AAGATNCACC 


TCCAGATTNT 


180 


CACANAGANA 


GTNTCACGGA 


CTTCAGAGCG 


TACCGGCAGN 


TNTTNGGGGN 


ACAATGAAAC 


240 


CTNAANNGTT 


NGAAAATGGG 


CGNTATCCCA 


GTAAGTAANC 


CTGCAGAANA 


TGTTGANTCA 


300 


GATAAACAAA 


ACTATGATAA 


NTATGTAGCT 


AAGACGCAAA 


CGTCTCAAAA 


TAAGCAATTA 


360 


GAACAAGAAA 


AACAATTTGA 


TAGTGTTGGC 


AAACAAGGAA 


CTGCATCTAA 


ATCATCTGAT 


420 


GAAANTGTAT 


CATCAACAAC 


AATNTCAATG 


CCTAATTATT 


CAAANGGTGG 


NTAATACTTT 


480 


CAAAATTGGA 


AAT 










493 



(2) INFORMATION FOR SEQ ID NO: 895: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895: 



GAATTCTNAA 


ATATCACAGC 


AAAGTATAAA 


ATATAAAGAT 


CATTCACATT 


ACCATACGAN 


60 


TANGCCTGGN 


ACATCTGTCN 


CTGCANTCNC 


TGGTATNGAG 


TCGGCATCGC 


ACNCGNCATA 


120 


CCCTCTTTNN 


TATGTATTCT 


AATANCACAN 


NTCATCGCGC 


TATCGTTTCA 


NCTCCAGATN 


180 


ATCACAAAGA 


AAGTTTCAAG 


ACTTCAGAGG 


TACCGCCAGC 


TATCTNCGGC 


ANAATGANAC 


240 


CTAANNAGTT 


AGAAAATGGT 


CGTATCCCTG 


TAAGTAAACC 


TTCAGAAAAA 


GTTGAGTCAG 


300 


ATAAACAAAA 


ATATGATANA 


TATGTTGCTA 


AGACGCAAAC 


GTCTCAAANT 


TAANCAATNN 


360 


GGACAAGGAN 


AANCAANNTG 


ATNGNGTNGG 


CAAACAANGN 


ACNGGATCTA 


AATCATCTGA 


420 


TGAANATGTT 


TCATCAACAA 


CAAAATCAAT 


GCCTANTNTT 


CAANGGCNGG 


TGATNCTATC 


480 


AAANTGGAAA 


TTTNNGGCTC 










500 



(2) INFORMATION FOR SEQ ID NO: 896: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 950 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896: 






TTTGAAGCTT GAAGTTAATA CACTTTATCA GTTGTANCGC 


ACTTTACAAA 


r*c*r* A A *P ATV^ A 

L-vjv^AA 1 n- ion 


60 


GCCACAATNC GGCAGGCGTT GATGTTTCTG GGAGATGTTG 


CTTTGTCCGC 




120 


CCCACACATA TTAAAGCAGA TGGTGATGCC GATGTCACTT 


ATGGAGTAGC 


A A TTTiTPfifST 


180 


GATAAAGTTA CACGTAAAGA ACTTACTATT GAAGCAACTG 


TCACACCGCT 


A TfVrTTTCT 
A J.\j\-V-V_ 1 lul 


240 


TCNCAAGANA TAAGTGAGTA TTCAGCGCAC ACCCAACGTG 


GTGTTGTCAC 


L lAAA/tl-/t 


300 


TATATCAATA AAGATCANAA CATAGTTGAT GACTATAAAA 


ACAAAATTTT 


A A rnr«p A A TV* 
ACiA AA Jaj 


JUU 


GAAGCTAATG CAAGTTCAAT TCTATACCCT ATTTTAAAAC 


GTCCTGATGA 


AAAACGTt>I\j 




ACAGAGCGTG CATATGAAAA TCCACGTTTT GTCGAAGACT 


TAATTCGTTT 


AATTGCCGCks 


ARCS 


GACTTAGTTG AATTTGATTG GTTAGACGGN TTCGATATCG 


AATGTCGGAA 




RAO 


ATCCATCAAC ATGATGCTTT CGCTAAATTA AAGTATCGCA 


AATAATATCA 


IAI IajAI 1AA 




TTCAAACGCG TAACATTAGT TGCGCGTNTT TATTATACCT 


TCTAGAGTTG 


UA I A 1 v_ IV 1 1 




TTNATTAAAT CTTCTCANTT GATGTGCCCG AGAGCATAAA 


TCAATATCCT 


AAGCTCTACA 


/zU 


AATGATATAT TGGTCGTAGT TGATTGAATA ACTACTATCA 


TTATAAATAT 


CAAAATTTGT 


780 


TGTTTTATAA ATAAACACAT CATCCAGTTA AAAGAAGTAA 


TTAAATATCA 


TGTGAAGTTA 


840 


ATTATAAAAA ATAGATATAA GTGGTGTGGT TAATGAATTA 


TTTTGTTGGT 


TAATAGTTTA 


900 


GGTGTTTAAT TTACCTGGAA TGGAAAAAGC AATTATTATC 


CGTTTAAATT 




950 



(2) INFORMATION FOR SEQ ID NO: 897: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897: 

GAATTCGGCT AAAAATNACT GAGNTGGTAC GCNNTTNTTN GATCATGCGA NGNAANCAGC 60 

GANACAGCAG TTAAGTAATA TGACGNATNT AACACCTGCA GAAAAGNCNG ATGNATCATA 120 

NCANGTTAAT NGAGGGACTA CTGNTGCTGG NGTTCACACG GCACAATCAT ATGCCACCAC 180 

ATTACATCAA GCGNNGAGTA CGTGNAAGAC ATCNNATNGC TANCAGTGAT NNGNNTAAAG 240 

NANGNGANGA TNNCGTTGAN GCTANTNCTG CTAGNNAACG AGAATCTGGC ACCGTGGTAN 300 

CTGCTNCTGN CACGATTTTT GGTGNGTANC AGAANTCCNG NCTTCGATTC ATCTGCCGCA 360 

TACACANCTC CNAGNGCCGN CGNTTAGTCT TAGTGGGANT TAACCGNGNT GNATTNNCNN 420 

TTCGCANCGC ATNGGCGAGA GNTCCCCCAG NNGATCGTGG GTGTTTNAGT TTCCAACCCN 480 

NCAATTNGGT TGCCANTTNT NTGGNNNANA GNACCCCTCT NTTNTNCGGC C 531 
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(2) INFORMATION FOR SEQ ID NO: 898: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898: 



ACCCCACCTA 


TTGNCCCCCC 


AATTATGTNA 


ATCCTTTGGA 


AANAACCACC 


CCTTAACNCC 


60 


TAAAGTTGAA 


ATACTGGTAA 


TCAAGNAGTT 


GATTATGGAA 


ATATTANACT 


TAGGGATTGG 


120 


TAGTACCATT 


ATTGCCCCAA 


TACAAGAATN 


GAAGTTATNN 


AGTTAACCCT 


AATCACCAAT 


180 


TNCNTCNNAA 


GTAAAAAGAA 


TCCTATGATT 


TTAGTCAAAN 


CCGGAAGATG 


TNACCAAGTC 


240 


AATTTGATAA 


TANNNAATCA 


TTAAGTAATA 


ATGTACCNAC 


ATTGGATTTT 


GGGTGATATT 


300 


AATTCAGCCT 


ATATTATCNA 


AGTTGTTAGT 


AAATATACAC 


CTACATCAGA 


TGGCGAACTA 


360 


GATATTGCTC 


AAGGTACTAG 


TATGAGAACA 


ACTGATAAAT 


ATGGTTATTA 


TAATTATGCA 


420 


GGATATTCAA 


ACTTCATCGT 


AACTTCTAAT 


GACACTGGCG 


GTGGCGACGG 


TACTGTTAAA 


480 


CCTGAAGANA 


AGTTATACAA 


AATTGGTGAC 


TATGTATGGG 


AAGACGTTGA 


TAAAGACGGT 


540 


GTCCAAGGTA 


CAGATTCGAA 


AGAAAAGCCA 


ATGGCAAACG 


TTTTAGTTAC 


ATTAACTTAC 


600 


CCGGACGGTA 


CTACAAAATC 


AGTAAGAACA 


GATGCTAACG 


GTCATTATGA 


ATTCGGTGGT 


660 


TTGAAAGACG 


GAGAAACTTA 


TACAGTTAAA 


TTCGAAACGC 


CAGCTGGATA 


TCTTCCAACA 


720 


AAAGTAAATG 


GAACAACTGA 


TGGTGAAAAA 


GACTCAAATG 


GTAGTTCTGT 


AACTGTTAAA 


780 


ATTAATGGTA 


AAGATGATAT 


GTCTTTAGAC 


ACTGGTTTTT 


ATAAAGAACC 


TAAATATAAT 


840 


CTTGGTGACT 


ATGTATGGGA 


AGATACAAAT 


AAAGATGGTA 


TCCAAGATGC 


TAATGAACCT 


900 


GGTATCAAAG 


ATGTTAAGGT 


TACATTAAAA 


GATAGTACTG 


GAAAAGTTAT 


TGGTACAACT 


960 


ACTACTGATG 


CCTCGGGTAA 


ATATTAAATT 


TACAGATTTA 


GATAATGGTA 


ACTATACAGT 


1020 


AGAATTTGAA 


CAACCAGGAG 


GTTACACGCC 


AACGGGTTAA 


AAATACTACA 


GCTGAAGATA 


1080 


AAGATTCTAA 


TGGTTTACAA 


CAACAGGTNG 


TCATTANAGA 


TGCAGATAAT 


ATGACATTAG 


1140 


ACAGTGGTTT 


CGATAAAACA 


CCAAAATACA 


GTTTAGGTGA 


TTATGTTTGG 


TACGACAGTA 


1200 


ATAAAGACGG 


TAAACAAGAT 


TCAACTGAAA 


AAGGTATCAA 


AGATGTTAAA 


GTTACTTTAT 


1260 


TAAATGAAAA AGGCGAAGTA 


ATTGGAACAA 


CTAAAACAGA 


TGAAAATGGT 


AAATATCGTT 


1320 


TCGATAATTT 


AGATAGCGGT 


AAATACAAAG 


TTATTTTTGA 


AAAGCCTGCT 


GGCTTAACAC 


1380 


AAACAGTTAC 


AAATACAACT 


GAAGATGATA 


AAGATGCCGA 


TGGTGGCGAA 


GCTTAACAAT 


1440 


TACCGTCGAC 


CTCATGATGA 


TTTCACACTT 


GGATAACGGA 


TACCGCGATT 


CAGATTCCGA 


1500 


CAGTGATTCN 


GACTCAGATA 


GNGATTCCGA 


CTCAGATAGT 


GACTCAGATT 


CAGACAGCGA 


1560 


TTCAGATTCA 


GACAGCGATT 


CTGACTCAGA 


CAGTGACTCA 


GATTCCGATA 


GTGATTCAGA 


1620 


TTCAGACAGT 


GATTCAGACT 


CAGATNGTGA 


TTCAGATTCC 


GACAGTGACT 


CAGACTCAGA 


1680 


NAGTGATTCA 


GATTCTGATT 


GCGATTCNAG 


ATTCCCGACA 


GTGACTCAGA 


TTCCCGATAG 


1740 


NGTCTCGGGA 


TTCAAGCGAG 


TGATTCNAGT 


TTTCAAGATT 


AGCGTATTCC 


AAGAATTCAA 


1800 


GATTGG 












1806 
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(2) INFORMATION FOR SEQ ID NO: 899: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 899: 



AGAACTATGA 


TACGCAACGA 


AGACAAGTTG 


AAACTGATGT ACACCGAGAG 


TACACATATT 


60 


GATGTTGATG 


AANATTCCTG 


ATCNTGATAN 


ACATGTACNT 


CTTGATGTGC 


TTTGTGAATC 


120 


GCATAATGAT 


GTTGATGTTG 


AATCGGATTC 


ACTTGTACTT 


TCTGATGTTG 


AGCCAGATTC 


180 


TGATGTACAT 


ACCGATGTTG 


ATAAACTTAC 


AATGGTCGAC 


ATGCGGTTTG 


AAGTTGATGT 


240 


ACNTAGCGAA 


TCACTTAATG 


ATGCTGATGT 


GCTTTGTGAA 


TCGGATTCAC 


TGCTTTGCAT 


300 


TGACGCAGAT 


GTTGAATCAC 


TTAAAGATGT 


TGACGACGAA 


TCGGATTCAC 


TTGCGCTTGC 


360 


TGAGTTACTG 


AGCGATGTTG 


ACGTACTTGC 


TGATCCTGAG 


TTACTTGTAC 


TTGTTGAGTC 


420 


ACTGAGTGAT 


GTTGACGCAC 


TTTCTGATGC 


TTGATTCACT 


TGCACTTGTT 


GAGATACTTT 


480 


GTGAATCAGA 


TTCACTCTCT 


GAACCACTTA 


AG 






512 



(2) INFORMATION FOR SEQ ID NO: 900: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900: 



CTTACCCGAC 


CAAGATGTCT 


CGAAGAAGCA 


CNTTGTGATA 


TTGATTCACT 


TGATCCAGTT 


60 


GATGCTGATT 


GCTATCACTC 


GTTGAATCAC 


TAANACGACG 


TTGATATGCT 


CATTGANTCG 


120 


GATTGACTTG 


CACTCATTGA 


ACCAAGACGT 


ACTTTGCGAT 


TCCGAAGTAC 


GTACTGAAGC 


180 


ACTTGCCGAC 


GTTGATGTAC 


TTGTTGAACC 


TGATTGACTA 


GTACTTAGTG 


ATAATGACTT 


240 


ACTATCAGAA 


TCAGATGTAC 


TTTGTGATCA 


CNTAATGATT 


CTGATGTACT 


ACCTGACTGA 


300 


GACGTGCTCA 


TTGAACTACT 


TACAGACATT 


GATTTACTGT 


CTGATGCAGA 


TAATGACCCA 


360 


CTTGTACTGA 


TAGAGTCACT 


TACTATCTCT 


GAAGTACTCA 


TCGAGTCTGA 


TGTACTTGTT 


420 


GAGACACTTT 


GTGATGCTGC 


TATGCTTAGT 


GATCCAGAAA 


CAGAACCACT 


TGTGCTCGTC 


480 
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GAATCGCTCA 


ATGATTCTGA 


TGTACTCATC 


GATTTTGGAA 


TCACTTGTAC 


TTAATGATAT 


540 


TGATGTACTT 


TGTGAATCTG 


ATTTGCTTGT 


TGACGCACTT 


TGAGAGTTGG 


CTATGCTATT 


600 


TGAAATACTG 


ATAGAGTCCG 


AGGTGCTAGC 


TGACTCGCTC 


AATGATGTTG 


ATGTACTAAT 


660 


TGCATTCGAT 


GTACTGTCAC 


TTAATGATGC 


TGATGTACTA 


GACGACCCTG 


ATATACTCGT 


720 


TGATAAGCTT 


TGTGACTTAG 


ACAAGCTTCC 


TGATGTACTC 


ATACTTAATG 


AGTCACTGAG 


780 


TGATGTTGAT 


GTACGCAATG 


AATCAGATGT 


ACTTGTTGAT 


AGACTTTCGG 


ATTTTTCAGT 


840 


ACTGTTAGAG 


TTCGAAATCG 


AATCGCTTAA 


TGATGTTGAT 


TTACTAACTG 


AACCTGACAT 


900 


GCTTGATGAT 


ACACTTTGT 










919 



(2) INFORMATION FOR SEQ ID NO: 901: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901: 



CCCAAATTGG 


CGATAGCTAA 


TATTTGATAA 


TGATGCGCAT 


TTCATACAAT 


TTGTGACTTG 


60 


GCAATTATTG 


AATATTTATA 


TAATTTTTTC 


TCGTAATAAA 


CACAACTTTG 


ATAGCGCTAA 


120 


AACAGCTGTG 


TTAAGTTATG 


AGTTAACGAA 


TTTAACACAT 


TTTACTAGGG 


CGGCATTTAA 


180 


GAATATTTAT 


ACGTTTTTAN 


CGAATATTTA 


TTTATTGTAA 


NACGCTACCA 


AAAAGTTAGA 


240 


CTTCCTTCCC 


ACTAAAGTNC 


CACTTTTTTC 


TTTCAACTTT 


TTTAANAAAC 


GGATATGCAA 


300 


CTTTTAGTAT 


TGGTATCAAA 


ATGATTGTTA 


GGTCATATTC 


TATCAATATA 


TTTTTCTAAA 


360 


GAATTGCTTT 


TATTAACTTT 


CAATTATGTA 


CCTAACCTAA 


AAAGAAGCCA 


AGGCAACGAA 


420 


TGTTACCTTG 


ACTTCTAATA 


CATATTCAAC 


TAACTATATA 


TTCAATCATA 


CGCGCATGCG 


480 


AGAGTGATTG 


TTGTACATCT 


ATAATGCGTT 


GATTTAAAGA 


ACCTTTATAT 


GGTAAATCAG 


540 


GTTTGAATAA 


GTGTTGTATA 


AATAGACCAT 


CGACTAAAAC 


GTCAATGTAT 


GATAATAACT 


600 


CTCGACGTTC 


TGTACAATCA 


TTTGCTAAAT 


ATTCATATAA 


AAATCCAGTC 


CATACCCAAA 


660 


TTGTCTTTGT 


ATTTCCAAAA 


CGTGCTCGAA 


ATGCTTTGAC 


AAGATTTAAT 


GTAATATCCA 


720 


AATTACAAAA 


TGGTTCGCCA 


CCTAATAGAC 


TTAGCCCAGA 


TATATAATCA 


TGATCGCAAT 


780 


CATCTAATAT 


TTCTGCTAAT 


ATTTCATCAG 


TGTATTTCTC 


GCCATATCTG 


AACTTTTGTG 


840 


AGGCTTTGTT 


ATAACATCCA 


ACACAATTAA 


ATGGACATCC 


TGATACATAA 


ACACTGCATC 


900 


TTACTCCTTC 


ACCGTCAACA 


AAGCTATTTG 


AATCCTATTT 


TAGCAATATA 


ACCTTGTCCT 


960 


TGTTTAATGT 


CCTAAAAGTG 


TCATCCTTTA 


GGCGCCTTCA 


TATGTTTTAC 


TCGTGCGCAA 


1020 


ATTTCTTTAT 


GACGGCCTTT 


AATTACTGGA 


CGTTGAACTG 


GATTGCCTAA 


GTAACCCACA 


1080 


TGTTCGTTTA 


ACGACATCAA 


CTGTTTTAGG 


ATTATCATTG 


NCACAAGTTC 


GGGCATTTAA 


1140 




AGNTGCTTCA 


AAATCTCCAT 


CCGNNATCAC 


ATTCAATAAC 


AATGGANCAA 


1200 


ATCGGNATAT 


TTGGNNCCTA 


AGGTTAACCA 


ACTTNGGCAA 


TAAGAGTNGG 


GCCCAATACC 


1260 


CGNTTCTAGG 


GCTTTCAAAT 


TGTGTTGCAA 


TTTCGGATAC 


TCACAATAGT 


GAATGAAACC 


1320 
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ACCACTCGCA 


TAATAAGGAT 


AATCTTTTTC 


» A »TV~**r»7V TV tf"* 


f|V|M|MIW *Jt TV Ti TV/"* 

1 1 X X vAAAAo 


f^*lY2TA A C A TV* 


1380 


TTTACGTACA 


TCATAATGGA 


AAGALT l "i 1 Ivj 


t\ X f\J\. X A X l_V~ 1 


TT ATT^TYZT* & A 


TYSTf >r T w T*l* A AT* 


1440 


ATCTCCAAAT 


CTCTCTTGGT 


CTAAACGACA 


AAA TkrV ATW* 


V7 X 1/UA.Vinnil 


PAriTfin'VST 

X LVjVTV^VJ X 


1500 


ACTCGTAAAT 


ACTGANCCAA 


ATGTCATATA 


AX 1a- XvjIN^l-/* 


X WuCu XAj lun 


TAAr*f3TTJWA 


1560 


TTTCTTTAAG 


AATNACAAGC 


GTAAATGCTT 


GNGCTTCTNG 


AoA X vj 1\. 






CATAGAAAAC 


AGNAGCTGTN 


TCATACAACC 


pTTvivvpa/vv 
LTAI\- I ALjCu 


CATTGAAATC 


GTTGCACGTT 


1DOU 


TATTTTTAAA 


TAACTCAGCA 


ACATCATCTG 


TTTCTTTTAA 


TTTATAGTTA 


AAAGCGCCAC 


± / **U 


TTTTATATAA 


AATTGGTGCG 


TTATTCGGTA 


CAGNATCTTT 


CAAACGATTT 


ATACGATAAA 


1 nnn 

lOUU 


GTAATGCATC 


ATGTAACACA 


TCGATACGTT 


PAmjk A TV lA/" , 7n* 
L.A 1 AAiiAuiA I 


TTCCCAGAAT 


TTCGTCATAT 




TACCGGCAGA 


TTCTAATGCC 


ATTCTAGGTA 


AATTAAGTGT 


NACAACACCA AGATTACAAC 


1920 


GACCATTATT 


TTCAAAATGA 


CCTTCCGCAT 


CTTTCCAACT 


TGGGTAAAAA 


TGAACGACAA 


1980 


CCCATTGGNG 


CTTTGAAATC 


ACCTAATATT 


TCTACGAGTT 


TGGTCAATAA 


TTTAAAATAT 


2040 


CTGGATACAT 


ACGTTTCGGN 


GGAACAACTT 


TAATGCTAGT 


TGTTTAATGT 


CATAGGTCGG 


2100 


ATCTTGGGGG 


ACTAAAGTTG 


GGGTCCCNTC 


TCTATTGGAA 


ATCAAA 




2146 



(2) INFORMATION FOR SEQ ID NO: 902: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1015 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902: 



ATTGTCCCAC 


ACCATTTACT 


ACAAGANACG 


AGCGTCCTCC 


ACATTGTTAC 


NTTGTGCNAT 


60 


CAATNCTTCT 


NGTTTGTTTT 


GAANNNAAGA 


CCGAAGCATT 


TGTANTCTGC 


TGTGTTCGCA 


120 


GCCTTGTTTC 


GCACGTTCTA 


AGGTATTGAA 


TACCGTTTAA 


TGCAGTATTG 


GCTTGTGTAA 


180 


CAGCTTGCAT 


TGCTCTATCG 


ACATCTGCNT 


TAGGTGTATT 


TCCTCCAGCT 


GTTTTATNTA 


240 


AAATAGTTGC 


TGCTGCATTT 


ACTGCTTGAG 


AATAAGCCGT 


TCGTTTAGCA 


TCATCAGCAT 


300 


CTTGATAATT 


TTGACTTTGT 


AACGTCGTGT 


CTTTATCACG 


AATTGATGTT 


TCTAATTGAC 


360 


CCATAGCACC 


ATCTAATTGT 


TGCGCTTTGG 


CTTTAACTGT 


ATTAACACCT 


CAACATTTGT 


420 


TGCTNGTGTA 


ATTTCATTAT 


CTAACGCATT 


ACGTTGTGCA 


TTATTAATGT 


GTGTTAATGT 


480 


ACCTAACGTT 


TGTTTCGCAG 


CAGCTTTAGC 


TTCATTTAAT 


TTCGCATCAC 


CGCNCAACGC 


540 


CGTCTTCGTA 


CTGTNCACAT 


TGTGTAATGC 


TTGNNCAACT 


GCTGCTTTGT 


CTACATTGTG 


600 


ACCACTAGCT 


TTTGTTAAAA 


TTGCTTTTGC 


TGCATTTACT 


GCTTGATCAT 


AAGCTGATTT 


660 


CTTACTTGGC 


TCAGCATCTA 


GGTATTTCTG 


AGTTTGTTTT 


GGTTGTGTCT 


CATCATTGAT 


720 


ACCATTTTGT 


AAACTTGTGC 


ATTGCGTTAT 


TTAATTCTTG 


TTGCTTTNGC 


AGGTTCTTGG 


780 


ATTTACACCA GCTACTGTAG 


GTGGNACCAT 


CAATGTTAAG 


CGTTAACGCA 


TCTTTTTGTG 


840 


CATTGTTAAT 


TGATGTTAAG 


TTATTCAAGT 


TTGTTTTTCG 


CTGTTGTCTT 


AGCTTGAGCT 


900 


AAGTTTTGGC 


GCACCATTTA 


ATGCATGTTC 


TTTAGTNGGT 


CACTTGTGAT 


GTTGCTTGGC 


960 



789 



WO 97/30070 



PCT/US97/02318 



GTAATAGTAT TNGGGNTCCA TCCNNTGGNT TCCACGTTTG GATTAATGAT TGCTT _ 1015 
(2) INFORMATION FOR SEQ ID NO: 903: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : S ingl e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903: 



ATAAATAATN 


CCGCTACAAT 


TTAATACATA 


TGCTTGANTG 


NCTGGNNAAT 


CGTGTTCCAA 


60 


GTANTNGCTT 


NTAAACTTAA 


TTGACCTAGT 


CCAGNCCATG 


CANAGATATA 


CTCGATAACA 


120 


ACTAGTCCAC 


CCATTATCAT 


TGGTATAGAC 


ATACACAAGN 


TTGATACCGC 


AACTTGTATA 


180 


GCATTACGCA 


ACACATGCAA 


CATTAATGTG 


ATAGATTTCA 


CACCGCTTGC 


TCTTAAATAA 


240 


AGTACATAAT 


CTTCATTTAA 


TTGTTCCACC 


ATCGAGCGTC 


TAACATTTNC 


TAAAGTAAAT 


300 


ACCAGCATAG 


GCAATCGTAA 


TAACGATCAC 


TGGCAATATG 


TAACTTTCTG 


GACCTGTTAA 


360 


TCCAGAAGTC 


GGCAATATGT 


TTAACTTCAC 


TGAAACGTAA 


ATAATAAGTA 


TTGAAGCTAG 


420 


TGCCGCTCGT 


GCCGAATTCG 


GCACGATGCT 


AATGCAGTTA 


GAAAGAAAGC 


CACTGAACGT 


480 


ATCGCACGAT 


CAGTGAACTT 


TCCTCTTTTT 


AATGCACTAA 


CTACACCTAA 


AATAATTGAT 


540 


GTAATCATCA 


CCATAACACT 


TGAAATTATT 


GTTAATTTCA 


ATGTGTTCAT 


AAATGCTGGG 


600 


CCAATGCGTT 


CAGCAACAGG 


ATCCCCTGTA 


ATGTAACTTG 


TACCAAAATT 


AAATTGCATC 


660 


GCTTCAAGTA 


ACCAGTTTG 










679 



(2) INFORMATION FOR SEQ ID NO: 904: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904: 

GGCACCAGCT AGAGCGAGTT TTTATATATA CAACACGAAA GAAGACATTG ATCAGTTAAT 60 

AAATGCCTTG AAACAAACGA AGGAGTTTTT CTCTTATGAA TTTTAATAAT CTAGATCAAT 120 

TATATAGATC TGTCATTATG GATCATTATA AAAATCCTAC AAATAAAGGT GTATTANATA 180 

ACGGGTCTAT GACAGTAGAT ATGAATAACC CGACATGCGG TGACCGTATA CGACTAACAT 240 
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TTGATATAGA 


AGACGGCATT 


ATAAAAGATG 


/-^m x x prpmwy< X 


Tv/-v^rn/-»» TAr^T^T 
Avjvj 1 V»AAvi*J 1 






CAATGGCAAG 


TGCATCGATG 


ATGACACAALr 


/"•HY^TTA A Afm 


flU TATTT'TAXPTT' 


tT2f5 Ail A A(V A A 


360 


TCCAAATNGA 


CCCAAGGAAT 


mm x r^f^r^tkXlk X x 

TTACCGNAAA 








■» ^ V/ 


AGGGAAGAAT 


GGGGAGATAT 


TGAAGCA l It- 






UV. IVvr 1AI in 


AftO 


AATTTGTCCA 


CATACCTTGG 


AAACATTGGA 


AAAAvHjVj 1\-L. 




Afv* ara^fT a a 

AuijAuvJU 1 Art 




AACAGGAGGG 


TCCGGCTTGA 


m rf"* X % *"»1V X X X /** 

AGAAGAAAAG 


AI\.CTCj1 aaa 


i\.Al AAKzA l\3 




DUU 


TAAGACATAT 


NNAAGTATNN 


NAATTTTTAA 


mx 7v 7v /■> tv nwiv* 

TAAAGATGTC 


AIvJTL A 1 1 J 


AA I AAA 1 A i\j 


DbU 


GTTTACATCA 


TTGAATTAAA 


AACTTACGCA 


ccgccgi rc r 


AAATATATTT 


1 1 AAVjCj AC* I Aj 


Ton 


ATTGAAATGG 


CTAAAAAAGC 


ACCTGATGTT 


TV MUM TV fTV TV 

GGGGAl l ATA 


AATATUvjA 1 J 




/ol) 


GATGTATCCA 


TTTTCAGATC 


AGAACGTGGT 


TTAACTGAGA 


ATATCGTTAG 


AGAAATTTCT 


O /I A 


AACATGAAAA 


ATGAGCCGGA 


ATGGATGTTA 


GATTTCCGTC 


TTAAATCATT 


AAAAU"i\iTTT 


ft Art 

900 


TATAAAATGC 


CAATGCCTCA 


ATGGGGTGGC 


GACTTATCAG 


aattgaattt 


^*^« x m/^i iv r^* iv mm 

CGATGACATT 


960 


ACTTACTATG 


TAAAGCCTTC 


AGAACAAGCT 


GAACGTTCAT 


GGGATGAAGT 


GCCAGAAGAA 


1020 


ATTAAAAGAA 


CTTTCGATAA 


ATTAGGAATT 


CCTGAAGCTG 


IV Ik TV TV Th TV X m TV 

AACAAAAATA 


1 i 1 AGCTGGT 


i r\ a r\ 
1080 


GTTTCTGCTC 


AATATGAATC 


TGAAGTTGTT 


m m » m *it *it m It 

TACCATAATA 


rtWl}\ XXX TV /" , TV 

TGGAAAAAGA 


7\ / Wflliy* X X X X 

ACTTGAAGAA 


1140 


AAAGGTATTA 


TCTTTAAAGA 


TACAGATAGT 


GCTTTACAAG 


TV X X TV ITV , » TV TV /"» X 

AAAATGAAGA 


x riwrr x n»n/*« x x x 

ATTATTCAAA 


l^OU 


AAATACTTTG 


CTTCTGTAGT 


ACCTGCAGCA 


GATAACAAAT 


TTGCGGCGTT 


AAACTCAGCA 


i o c n 


GTATGGTCAG 


GTGGNTCGCT 


CATTTATGTA 


CCTAAAAATA 


TCAAACTAGA 


TACGCCACTA 


1320 


CAAGCTTATT 


TCCGTATTAA 


CTCTGAGAAC 


ATGGGTCAAT 


TTGAACGTAC 


ATTAATCATT 


1380 


GCTGATGAAG 


GTGCTTCTGT 


ACATTACGTA 


GAAGGTTGGT 


ACTGCACCAG 


TTTATACAAC 


1440 


TAGNTCTTTA 


CACTCTGCTG 


TTGGTGGNAA 


TCATTGGGCA 


TAAAGATGCG 


CACNGTCCGC 


1500 


NTTATTCTTA 


CGAANCAAAA 


CTGCGGGACA 


ATGTTTTCAA 


CTNNAGGTAC 




1550 



(2) INFORMATION FOR SEQ ID NO: 905 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:905: 



GAATTCCTAA 


TGTAGCATTA 


ACTACAGATA 


TTATTGNAGG 


GTATCCAAAT 


GAATCAGAGG 


60 


AACAATTTGA 


NGAAACTTTA 


ACTCTGTATG 


ATGAAGTTGG 


NTTNGNACAT 


GCATATACGT 


120 


ACTTGTATTC 


ACAACGTGAT 


GGNACGCCTG 


CTGCTAAANT 


GAAAGATANT 


GTNCCCCCCA 


180 


NNTNNCANTT 


AGGAACGATT 


GCANCGTTTG 


NNTAAAAAAG 


TTGGTCATTA 


TCCACAAATA 


240 


GCTATGAGTA 


NGTNCGNCGG 


ACANACTGTA 


ACAGCCACNN 


CGGNGAAGGT 


AGNAGTAAAA 


300 


CCNGATGATC 


AGGTTCTTGC 


TGGCTACACT 


GATNNCAATA 


AGCTAGTTAA 


TTTCAAAGCA 


360 


CCTAAAGAAA 


TGATTGGGTA 


ANTAGTGGNA 


GTNCGATNGA 


TGAAGCTAAC 


AGCCTTCNAT 


420 


TAANTNNCAG 


NTCGGNACGA 


AGTTNGAGCC 


GAATGGTGAT 


CCACNAATNT 


TTT 


473 
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(2) INFORMATION FOR SEQ ID NO: 906: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906: 



TGATAATACT 


GCTTAATACA 


ACATTTATTG 


CAATTAAAAG 


TGCAGTAATA 


GCCAGTTTTT 


60 


CNTTGANTTC 


AAAATGANTG 


TCCTCCNTTT 


TTGTTTGTAA 


NTAATCACTA 


TGCTTGGCTT 


120 


TATTATGGTC 


ANNTAAACGT 


GTTTCCATTT 


GTTGATACTA 


ACATTTTCAA 


TAATTGATTC 


160 


GCTTCATATT 


GTGAAGTTTG 


AAACTGTTCA 


NCTATGGGCA 


ATGTATTTAT 


TTCTGCTTCT 


240 


ATACTTTGAN 


TGGTATGTTC 


CGACTGTTCT 


AGCGCATTTT 


GTTTCCCGTA 


ATTTTGAAAG 


300 


TTTACTGCTT 


GTTTTTGATG 


CTTTTTTAAC 


ATATCCATTT 


TAGTCTTTAT 


CGTTTGGTTC 


360 


TGATGAATCT 


GTGCTTCAAT 


TTGTTGATAT 


GTTTTGATAG 


AANCCCAAAT 


TTTAATTTTA 


420 


NTTGCAATAT 


AACCCGCTTG 


TTCCAAANCG 


TCATCTTTAT 


AAAACAATTT 


ATTTGAATCA 


480 


CCNATTTCGG 


GCTCCTACTT 


CCTTTACCAA 


ACTTGCCATT 


TAATGGAATA 


CTGTTTAGCT 


540 


TCATCTATTC 


GTACTTCCAC 


TAGTTTACCC 


AATCATTTCT 


TTAGGTGCTT 


TGAAATTAAC 


600 


TAGCTTATTT 


TTATCAGTGT 


AGCCAGCAAG 


AACCTGATCA 




TACTACCTTC 


660 


ACAAAGTACT 


GTTACAGTTT 


GTCCTTCGTA 


CTTACTCATA 


GCTATTTGTG 


AATAATGACC 


720 


AACTTTTTTA 


TTCAAACGTT 


GCAATCGTTC 


CTTTTTGACA 


TTTAAAGGTA 


CATTATCTTT 


780 


CATTTTAGCA 


GCAGGCGTAC 


CATCACGTTG 


TGAATACAAG 


TACGTATATG 


CATGTTCAAA 


840 


ACCAACTTCA 


TCATACAGAG 


TTAAAGTTTC 


TTCAAATTGT 


TCCTCTGATT 


CATTTGGATA 


900 


CCCTACAATA 


ATATCTGTAG 


TTAATGCTAC 


ATTAGGAAGT 


CTATCTTTGA 


TTCGTTTTAC 


960 


TAAATCCAAA 


TAACTTTCTC 


GTGTATATTT 


TCTACCCATT 


ATTTTTAATA 


CTGCATTATT 


1020 


TCCAGATTGA 


ACTGGCAAGT 


GGATATGAGG 


AACGATATTA 


CCACCCTCTG 


AAATAACATC 


1080 


AATCATGTGA 


TCTGTAAAGT 


CCCAAGGATG 


ACTTGTTGTG 


AAACGAACTC 


TTGGAATCGC 


1140 


TATTTTAGAA 


ATTGCTTGTA 


AAAGATCTCC 


TAAGTCATAT 


TCTATATCCT 


GTAAATCTTT 


1200 


ACCATAAGAA 


TTTACATTGT 


GACCTAAAAG 


CGTTATTTCT 


TTGTAACCTT 


CACGAGCAAG 


1260 


TTCACGTACT 


TCATCTATAA 


TGTCTTCAGG 


TCTACGGGTC 


CGGTCCTTAC 


TCCTTGTNAA 


1320 


TGGAACAATA 


CAATATGTAC 


AAAACTTATC 


ACAACCATAC 


ATAATATTGA 


CCCATGCTTT 


1380 


ATGTTGCTTC 


ACGGACTTGT 


GGNAGATTAC 


AATAACGTCT 


NCTTCTTTTA 


GACATACTCA 


1440 


CAACAATGT 












1449 



(2) INFORMATION FOR SEQ ID NO: 907: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1589 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907: 



GTCGCTCATT 


TTGTATCATC 


ATTTTCTTAG 


CGTTGTGATA 


GTGCCGGGTT 


CTCCTCCAGC 


60 


NCTGCCTGCA CCATCTTATA 


TTGCATTNCN 


TGGGTGGNTA ATCCGCCGGG 


TTAATATTCT 


120 


TTGTGANTCG ATATCCTNAA 


TTGNAAAATA 


TGGATAATGA 


TGNATTAAGT 


CGCTTGATTT 


180 


TAAATAGAAG 


TGTNAATGNA 


GTTGATGATA 


TGATTGAAGA 


ACCTGAAAAA 


GANNAAACTA 


240 


AATAATAGAA 


GAGTCGCACA 


ATATACCTTC 


TTCATTCGGA 


GGCGTATCGT 


GCGA'ITTTTT 


300 


GTATTATANA 


TTGACATTTA 


AGACGAGGCA 


GCTGNACCTT 


ATATATAATT 


GCTAAGAGTT 


360 


TGGGCTGGAG 


CCATTTCTAA 


CAATATTTAT 


AATCGTTTAA 


AAGATTTCAC 


GAACCCAGAA 


420 


ACAATTAATT 


TGGAAATTTG 


GTCGGCGAAT 


AATAAACCTA 


ATGCGATGGG 


CGCCTGCAAT 


480 


AAGTGTTACT 


TCTAACATGG 


TATTGATTGC 


TGTGCTGAAA 


TTTAATAAGA 


CTAAATTTTT 


540 


TGTAGCATCG 


TATGCTAATC 


CCACCCTGGT 


ACCTAATGGA 


ATGATACCCG 


TTACCCATAA 


600 


AAATTGATGG 


CAGGTTCTTT 


TTTGTTTACG 


GACCAAAATA 


ATGACTTACC 


AAGCCTAATT 


660 


GCTAAACCTC 


CCAAAGAAAC 


TAGAGAATAT 


AGTGTGCACC 


ATTAAAACCC 


GTTGGAAGAA 


720 


TAAGGTTGTA 


AACCAATCCA 


TCCAACATGT 


ACCCACCGAA 


CCACATGATA 


GATATAATTT 


780 


TCTAGGTGCA 


TCAAAAATGA 


CGCAGAAGAA 


CATTGAAGCT 


AAAAAGCTAA 


AGATAAAGTT 


840 


TAAGATCCAA 


AACATAGTCT 


GATACTCCTA 


TACTAAAATT AATACGCTAC 


CAACGCCAGC 


900 


ACCGATGCCA 


AACGCAGTAA 


CCAATGCTTC 


TAATGATTTC 


GTTGTGAACA 


TCAACATGTG 


960 


TCCACCAAAT 


AAATCTTGTA 


TTGCGTTTGT 


TATTAATACA 


CCAGGAACAA 


TAGGCATGAC 


1020 


TGCCGCAATG 


ATAATAGTTG 


CCAAGTCACC 


TGTTGGGATA 


AGTGTATGTC 


CAATAACGGC 


1080 


GATAATCCCA 


ATAACTAATG 


AACCAATGAA 


TTCTGGGATA 


AACTGTGCAT 


GTAACTTGCG 


1140 


ATCTAAAATC 


TCAGTGACTA 


GGTATCCTAG 


ACTACCTGCT 


AATATCGCAG 


TTAAAACATC 


1200 


AATCAATCTA 


CCACCTTGTA 


AATATAAGAA 


ACTCATTGCA 


ATCATTGCTG 


CAGCAAAACC 


1260 


TTTAAAGGGA 


AGACTGCTGT 


CACGCTTAGC 


AACATATATT 


TTTTCAAGTT 




1320 


TTCGGCTAAA 


GAAATTTCAT 


TATTTGTAAT 


TTGACGCGAA 


ATTTTATTAG 


CTTGCGAAAT 


1380 


TTTTATTAAG 


TTTGTATCTC 


GAGAGGTAAT 


TCTAAATATT 


CTAGGAAACG 


ATTCCCGAAT 


1440 


GTAACGTAAA 


CTGGATGACA 


GTGTTTGTAA 


CAAAGCTTGT 


TACTTTCACT 


TGTAACCAAG 


1500 


'riTTTTTGCA 


ATACGTGTCA 


TGGTATCTTC 


TACAACGCTC 


GTGCCCTTCC 


TGCAACCAGA 


1560 


TTCTTAATAG 


TATGGCGGAG 


CAGGAAGCA 








1589 



(2) INFORMATION FOR SEQ ID NO:908: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:908: 



GCCCACTTGC 


TTGTTCTAAT 


CCATAGGGTT 


TTTAAAAGAT 


CCCTCCTAAG 


AAANCCTAAT 


60 


TTCATTTCTG 


AATAAGAATT 


TAATTTGAAT 


ACCCGTTTGA 


AACAANTTAG 


GTTGTTCCAA 


120 


ATTTGTCTTC 


ATCCAATATG 


AATAACATCA 


GTNGGGTGGT 


TTTTATAATG 


TTCAATATAA 


180 


TTCAATATGA 


TTTCTTTTAC 


TTCTTGCACC 


ATTTTGCCTA 


CTAGTGGTTC 


TGGCTTTAGC 


240 


ATTAAACATT 


TATTATATTT 


TGATAAATTA 


CGATATTCAT 


TCGGTGTTAT 


ATCCATATAA 


300 


CTTTTAAACA 


TTTTAGAGTA 


AGTGGAGCTA 


CTACTAAAAC 


CTAAATGCTC 


ACTTATGCTG 


360 


CTAATAGTAC 


TATCAGTAGT 


AAGTAGAATT 


TCAATCGATT 


TACCAATTTT 


CAAAGTATCA 


420 


GATGTCTTTA 


CACTCCGCTA 


CTTCCGAAGC 


CTTTTGCTCC 


ACGTTCTGAA 


ACACATGCCC 


480 


ATAAGTAAGT 


GGAATTGTGA 


TGACAAATTT 


GACTTTGAGA 


CAAATAATTT 


ATCAGCGACA 


540 


CTTTTTAATG 


TCAATCGNTG 


ATTTACATTG 


TGATAAATAT 


ATTCGAGCTC 


GTCACGTAAA 


600 


TCACCATAAT 


AGGCATATCG 


ATATTGATTG 


TGCGCAATCT 


TTTTATCCAT 


ACTTGCTTCT 


660 


TGTGNAATGA 


TTTTAATAAT 


ATCCTGTAAT 


TTTCTNATAT 


CAGCGTCATT 


GAGAGGTTGA 


720 


TTATCAAGAT 


ATCGATAAGT 


CAATTTTANT 


AGTAGACATT 


TCAAATTATA 


GGATGATTTA 


780 


ATCAACTTTG 


NTGTATAGTG 


GTAATTAAAG 


AAAGTAAAGC 


CTAATTCCGC 


AAACCAATCG 


840 


CTACTCAAAT 


ACAACATGAT 


CGNTTNNTTA 


TTATTCTTAA 


TACTAAATAC 


ATCCAGATGA 


900 


TTAATAATAT 


ATATATCATC 


TTCAATTATT 


TTACTTTTAC 


GAAATTTT 




948 



(2) INFORMATION FOR SEQ ID NO: 909: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909: 



GGGACGTTGN 


TGTTTTTTCC 


NGAGGCTTGT 


GCTTTCATGT 


GATGACCCCA 


CCCAAACATT 


60 


TGTAGCCGCT 


AACNTTTGAC 


NCCCACGCCT 


ATTGAATACT 


TTCTAATACA 


ATACTTTCTT 


120 


TCTTCTTGTC 


CTCTTAACAT 


ATAATCACCC 


CANAAGTATT 


TTTTTGACCC 


CCTCTTGAGA 


180 


ATGNAAAACA 


ACTGCACCTT 


CAATATAACA 


TGACATATAT 


TATTTTTTGA 


TTAAGNCTAC 


240 


AAAATTAATA 


AAAATTTAAT 


AATNCGATAT 


TAAATCGATA 


CTAATCATTT 


ACAAATATGT 


300 


GCATTTCCAA 


ATAATTTATA 


AACACTTTTA 


GTTGTGGCGT 


TGAATCCAAC 


ACAACCAACA 


360 


TTTAAGATAA 


ATATTTTATT 


AAGTAATATA 


GTTTCAATCT 


ATATATAATT 


AGTTAATTTG 


420 


TATTGGTATG 


AAAATAATTC 


AAACTANACA 


ANTATAAGTA AGTTTTCAAG 


ATTTTTCTAA 


480 


ATTAAAATAA 


TGATTCATCT 


CATTAAGGGG 


AAGATGCATC 


ATACAGTTTT 


GTGTATTCAT 


540 



794 



WO 97/30070 



PCT/US97/02318 



TTAAACANAG 


GGGNTGCNTT 


AATGAATCAA 


TATCATTCTA 


ATGCACAACC 


NNCCAAGTGC 


600 


ATGGCGTTTT 


TTTGTCTATA 


GTTTAGTGGG 


CATACTATGT 


l"i\ri _ i"iATTC 


CTTTTACGAT 


£ c r\ 

660 


TAATGGTAAC 


AACACTATTT 


TCGTCGATCA 


TGTTCATCTA 


GCCATTCGCT 


CAATCATAGG 


720 


TCCACTTATG 


CCCTATGTTG 


CACTGATTAT 


GATTTTAATT 


GGTACAGCGT 


TACCAATAGT 


780 


GAGACGTACT 


TTTATGACTT 


CAATCACAAA 


CTTGGTCATT 


ACATTATTTA 


AAGTTCCAGG 


840 


TCCAAGTGGA 


NTGGAATAAT 


GTATGNATTN 


AAAATCGGTC 


CATCAATACN 


ATTTAAAGCT 


900 


ANCTATGCTC 


CGTTCTTGTT 


TGAAAAATTA 


ATGATGCCAT 


TAAGTATCTT 


AATTCCAGTA 


960 


GGTGCAATTG 


CGCTTTCTTT 


ATTAGTAGGC 


TATGGCTTAT 


TAGAATTTGT 


CGGTGTTTAT 


1020 


ATGGAGCCTA 


TTATGAGACC 


TATTTTTAAA 


ACACCAGGAA 


AATCCGCTGT 


CGATGCAGTA 


1080 


GCTTCGTTTG 


TCGGCAGTTA 


TTCCTTAGGA 


TTATTGATTA 


CTAATCGTGT 


CTATAAGCAA 


1140 


GGGATGTACA 


ACAAACGAGA 


AGCCACGATT 


ATTGCGACTG 


GCTTTTCAAC 


AGTTTCAGCA 


1200 


ACTTTTATGA 


TTATCGTNGC 


TAAGGANCCT 


TAGGGCTAAT 


GCCGCATTGG 


AATTTATACT 


1260 


TTTGGATAAC 


TTTAGTCATC 


ACATTTGTCG 


TGACTGCAAT 


TACTGCATGG 


CTACCGCCAA 


1320 


TCACCAATGA 


ATCAACAGAA 


TATTATAACG 


GACAAGAAGG 


AGAACAAGAA 


GTTGCTATTG 


1380 


AAGGAAGCAG 


ACTGAAAACT 


GCATATGCAG 


AGGCGATGAA 


ACAAAATGCA 


TTAACACCAT 


1440 


CTCTCGTGAA 


GAACGTTTGG 


GACAATTTCT 


AAGATGTCTT 


TACACTCCGC 


TACTTCCGAA 


1500 


GCCTTTTGCT 


CCACGTTCTG 


AAACACTTTC 








1530 



(2) INFORMATION FOR SEQ ID NO: 910: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910; 



GGAAAGTTGC 


CACCANTTGC 


AAGAAATACC 


TAGCTCAAGC 


GTTATGGATT 


GACCCAATAT 


60 


TCAATCGGAT 


AGAAGAATAT 


AAGAGGTGGA 


TAAATTGGCG 


AAGAAAGCAA 


GGATTGTACC 


120 


AATAAATGAT 


AAACCATATA 


GTTTACCCAA 


ATCTGAAATG 


GAATTAATAG 


AAAGTCNCGG 


180 


GTATAACCGC 


TGGAATGGTT 


TCTAAGAGAG 


TAAAAGACGG 


TTGGGAACTA 


CATGAAGCAG 


240 


TAGGGATTAT 


GTCGGGGCAA 


GTGGTCTATA 


AATATGAGGA 


GGAGCAGGAA 


AATGAAAAAA 


300 


TTTAATGTTC 


AAATCACATA 


CACTGGCATG 


ATTGAAGAGA 


CTATCGAGGC 


TGAAAGTTTA 


360 


GAAGAAGCAG 


AAAATGAGGC 


GCGTGATATT 


GCGATGATGG 


AAGTACCATT 


TGATTGTGAT 


420 


GAATATGAAA 


TTAATGTAGA 


GGAGAAAGAA 


TAATGACTAA 


CACATTAACA 


ATTGATCAGT 


480 


TACAAGAGTT 


ATTACAAATA 


CACAAGGAGT 


TCGACGATAG 


AATACCAACG 


CTGAACTTAC 


540 


GAGATAGCAA 


AATAGCATAT 


GTAGTTGACA 


CTATAGAATA 


CTCAAGCTTG 


CATGC 


595 



(2) INFORMATION FOR SEQ ID NO: 911: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2005 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911: 



GGCACGAGCT 


CATACATCGC 


TAGTAAGAAT 


AATAATCTTA 


GTATTTGTTT 


AGTCATTCCC 


60 


CACACTCCCT 


TATATTTTCA 


AACAACTGAC 


TCACTTTAGC 


ATAATTCCAT 


CTCTCCTATC 


120 


TTAACCTTGT 


GCCTCGTANT 


TGCGCTCAGC 


TTCTTCTTTA 


CTCTCTGCCT 


CAACAACTGT 


180 


AAACGTCTGA 


TTATCTCTAG 


CAGTAGTAAA 


ATGTTCATGT 


GGTTGTCCTT 


GTTGAATCTT 


240 


TGAATGTTGT 


GACTAAGTAT 


TGTGTCATTC 


CTCATAGCTC 


CCTTGAACTT 


GTTTGAGCTT 


300 


ACTCATAAAA 


AACATTACTA 


AAAATGCTAT 


TAAGATATGC 


GTCTTTTGAT 


GTTTATAAGC 


360 


AAATGTAGAT 


ATCATAAAGA 


TAGTAGCAAG 


CATTAACATT 


TCATATATGT 


TTGTGTGTAT 


420 


AGTCTTTTTA 


CTCTTAAGAA 


AAATAATTGC 


TATGCGATAA 


AAGAGATAAA 


CGCCAAACCC 


480 


TATTAAAAAT 


ATTTCTAACA 


TGTCGCTCAC 


TTCCCCAAAA 


CCTCCTTGAC 


TCGATCTAAG 


540 


ATGTCTTTAT 


ACTCCGCTAC 


TTCCGAAGCC 


TTTTGCTCCA 


CGTTCTGAAA 


CACACTCGAA 


600 


TTCCTCCACT 


TGCTTTAGTT 


CAGGTGTCCA 


TATAGGCACG 


ATAACCAATT 


GAGCTAGTTT 


660 


GTCGCCTTCG 


TTGATTTGAT 


AAGTTCCATA 


TTGTCTTATG 


GCGTCACTCA 


AATCGATTTC 


720 


TCCTTTAATA 


TCAAAAACAC 


CTGGTGTGAT 


ATAACCATTC 


GATGCAATAG 


CGTCATTCTT 


780 


GATATTAATC 


CCTAAATTGC 


CGTGATATCC 


CGCGTCTATC 


TTGCCTGTTT 


CAATCACTAA 


840 


ATGCGTTTTA 


CTACTTACAC 


CACTACGACT 


AGTTAATAGT 


CCGACATAGC 


CCTCTGGTAT 


900 


ACTCACAGCT 


ACATCTGTTT 


TGATCACTGC 


TTTTTCTTGT 


GGTTCGAGTA 


CGACAGTTTC 


960 


AGCTGAGAAT 


ATGTCATAAC 


CTGCATCCGT 


CTTATGATTT 


CGTTCGGGCA 


TTCTAGCATT 


1020 


TTTTGATAAT 


AGTTTTACTT 


GTAATGTGTT 


AGTCATTTTC 


CTATTCCTCC 


TCATATTTAT 


1080 


AGACAACTTG 


ACCTGCCATA 


ATCCCTACTG 


CTTCATCAAG 


TTCAATACCT 


CNTTTAACTG 


1140 


AATGTTGAAT 


AGCATTTGTC 


ATTCCCTCAA 


GTATTTCATC 


AAACGCTTGC 


GCTTTCTTAT 


1200 


ACACGTCCTC 


AATCTCTTTT 


AGCAACCCCT 


CTGTGTCATT 


ACCGTATACG 


CACTAGCACT 


1260 


AATAACGGAC 


TGTTCGATTT 


TTTCGCGATT 


ATTCATTGGT 


GTCATCCTCC 


ATAAAAATTT 


1320 


TATTGTTTAA 


TTCCATTCCG 


AATTTAACTC 


TTTCATCATC 


GTTACCGAAT 


TCGTTTATTA 


1380 


AATCTTTTTC 


AACGCTCTTG 


CAATACCTAT 


CCCATGCGCT 


TQcrrrciTC 


TCCAGTTCTT 


1440 


TGTTACAATC 


TCGTAACTTC 


GCTATATCCC 


CAATAAGCTC 


ATCTCGTTGC 


TTCTTGTACT 


1500 


CTTCACGATC 


TTTTAATGCT 


TTGTGAAGTT 


TATCTAATAA 


CTTGTTAGAG 


TTAGTACAAA 


1560 


GATTTTTATA 


TTGTTCATCT 


GATAAGGTGA 


ACGTCATCTC 


ATAACCTCCA 


ATAGCATCTC 


1620 


ATTTTCAAAA 


ATATTTCCAA 


CAATTTCAAT 


AATATCGGCA 


TTTTCACTTA 


GTAATTCAGT 


1680 


TACATTGCTA 


AAAGTTATAT 


AAAAGGCTCC 


TTCTTTAAAC 


TCGATAAAAC 


TTACTTCTCT 


1740 


CGAATTAACA 


ATCTTGGAAC 


AATATCCCCT 


TCATAAATCT 


CCACAACCGG 


GCACATCTTT 


1800 


TAAATCCTGT 


GTATTGGTAA 


TAGGTTTTAC 


TTCAATTGAA 


ACTTTTATTA 


ACCTGGTGGA 


1860 


AATCAAAATG 


TACCCACTTA 


TTAAAAATCG 


GATTTCGGNC 


AATAATACTC 


AATAACTTNN 


1920 


NNTATCTTNA 


TCCCAAGCTT 


TTAATTTCAA 


CATCAATCTT 


ACCAACTCCC 


CATCTTTCCA 


1980 
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AATCAATGTC AACCGGCAAN GTCAC - 2005 

(2) INFORMATION FOR SEQ ID NO: 912: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912: 



CTACATGGTA 


ACCAAAGAAT 


TTTTAAAAAC 


TAAACTTGAG 


TGTTCAGATG 


TGTACGCTCA 


60 


GAAACTCATA 


GACGAGGCAC 


AGGGAGACGA 


AAACAAGTTA 


TATGACCTAT 


TTATCCAAAA 


120 


ACTTGCAGAA 


CGTCACACAC 


GCCCCGCTAT 


CGTCGAATAT 


TAAGGAGTGT 


TAAAAATGCC 


180 


GAAAGAAAAA 


TATTACTTAT 


ACCGAGAAGA 


TGGCACAGAA 


GATATTAAGG 


TCATCAAGTA 


240 


TAAAGAGAAT 


GAGAATGAAG 


TTTATTCGCT 


CACAGGAGCC 


CATTTCANCG 


ACGAANAGAA 


300 


AATTATGACT 


GATAGTGACC 


TAAANCGATT 


TAAAGGCGCT 


CACGGGCTTC 


TATATGAGCA 


360 


AGAACTAGGT 


TTACAAGTAA 


CGATATTTGA 


TATTTAGAGG 


TGGCGCATGG 


AAGTACATTA 


420 


CAGTAGTAAA 


ACNAACGAGT 


GGACNACNCC 


ACAACATTTA 


TTTGATGACC 


TAAGCGAAGA 


480 


ATTCAGTTTT ACATTAGATC 


CTTGTTCAAC 


AGACGAGAAC 


GCCAAATGCC 


GGAAGTATTA 


540 


TACAGTAAAA 


GATAATGGGT 


TAATTCAAGA 


CTGGTCTGAG 


GACATTGTTT 


TTATGAACCC 


600 


GCCATACGGT 


CGAAGTATTA 


AGCGTTGGGT 


CAAGAAGGCT 


TACGAAGAAA 


GTTTGAAAGG 


660 


CGCAACGGTA 


GTTTGTTTAA 


TACCCGCAAG 


AACAGACACG 


ACATATTGGC 


ATGATTACAT 


720 


TTTTAATAAG 


GCTGATGATA 


TAAGATTCCT 


ACGCGGTCGT 


CTGAAGTTTG 


GAGATAGTAA 


780 


AAACAGGGCT 


CCTTTTCCTA 


GCGCAATTAT 


CGTTTATAGA 


GGTGCACAAA 


TGAGTAAATA 


840 


CAACGCTAAG 


AAAGTTNGGA 


GTACAAAAGG 


AATTNGTTTT 


TGGATAGCAA 


AAGNAGAATG 


900 


TNGAATATTT 


ACCAAATATT 


TAGAAAGTAA 


TATTGAATGG 


CACTAACTAT 


GATCGTATCG 


960 


AAATACAACC 


GAAATTTGGG 


AAACAAAGAC 


CGATTACGTA 


TATAGCCGAT 


TTCTCTTTGT 


1020 


GGAAGGAAGG 


GAAACTGGTT 


GGAAGTTATA 


GACGTTAAAG 


GTAAGGCGAC 


TGAAGTTGCC 


1080 


AACATCAAAG 


CGAAGATATC 


CAGATATCAG 


TATTGAGATG 


TGATTTTACC 


GTGG 


1134 



(2) INFORMATION FOR SEQ ID NO: 913: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913: 



CNCCAACTGA 


GCTACCTTTA 


ATATCGCCGT 


CTATATGCAG 


TTTTAATATT 


GGAGACTGCG 


60 


TTGGGTGTAA 


TTGGGTACTT 


CTCTCTAAGT 


TTNTNCACAT 


CCCCANCACT 


ATTATNCCTT 


120 


GTGNACTCTG 


CTACATAATA 


NGGAACATTC 


ACANTGNTAG 


ATATTCTTCC 


ATTTNTATAA 


180 


TTTCACAAGT 


CTNTAAAATC 


GGCATATTGC 


CCGAAAAATT 


TAAAATTCTC 


TATTTCGTGT 


240 


TTAAGATTCA 


CATCTTCTAC 


TGTGCCCAAT 


AGAACTATCT 


TATTGGTNTC 


TANANACAAC 


300 


ACGATATTTG 


GCNCGAGCGT 


GNCAACTTAT 


CCACTACATC 


ATAAGTTGCT 


CTTACAGTAT 


360 


AATATCCTGT 


TGCTGCCCCC 


GTATTTCTAT 


TAAAATGTAG 


TACCATACCT 


TCGCTATTCA 


420 


TTTCACCTTG 


CTTGTTGCTT 


TTAGCAAAAT 


CTGTTGATAT 


TGTCCAAGTC 


CCTTTATCAC 


480 


CTTTTTTAAA 


TTCGCCATCA 


CGATATCCTT 


CTTTGTCAT A 


TAGATTCTCG 


AGATTCTTAA 


540 


TTGGATACAT 


ATCTAACGTT 


TTCGAAAAAG 


CTCTTTTTAA 


TTTGTGCTTT 


CTTTGGAACC 


600 


TTCTTTTGGA 


TTTTCATGCA 


CTTTTGCCAC 


AACCTGCTAT 


CGCAAATATT 


AAAATCATAA 


660 


CGCTTATGTA 


CAATGCAAAC 


CTTTTTGAAT 


ATCCCATCAT 


TTCACGTCTC 


TCTGCTTTAG 


720 


AATTATTTCT 


ACATCTAATA 


AACTTTAGCT 


ATCTTCATCA 


GACGGCCCAA 


AATTTAAATA 


780 


ATCTCTTAAT 


GTTGTCTCTT 


GATCTTTAAC 


TTTTGAAAAT 


TTATATTCTA 




840 


TCCAACTGAA 


CTTCCTTTTC 


TCGTGCC 








867 



(2) INFORMATION FOR SEQ ID NO: 914: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 637 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914: 



GTTTTCCTGT 


TCTCTTTCAT 


CAAGGCTTTT 


TCTTTGAATC 


AATTTCTTCT 


ATTGAAAAAT 


60 


AACCGGNTGT 


GGTCCTAGTA 


TTTCTATTCA 


TTGTATAATA 


TCATTGCCTC 


TTTGAAGTTA 


120 


AACTTTTACC 


TTTTGGGTTG 


AATTATCATT 


TCAGATCTAA 


CAACCCACGT 


CCCTTTGTCT 


180 


CCCTTTTTAA 


ATTCTCCATC 


TCGATATCCT 


TCTTTATCGT 


ATAAATCCTC 


AAGATTTTTA 


240 


ATTGGATACA 


TATCTAACGT 


TTTCGCAAAG 


CTCTTTTTGA 


TTTGTGTTTC 


TTTTGAATCT 


300 


TCAGGAGTGT 


CAAATACGCC 


ACTGCAACCT 


GATGCTAATA 


GAATTAAGAC 


TGACGTCCTA 


360 


TACACAACGC 


ACATTTTCTA 


AAATTATTCA 


TTGTTCCGTA 


TCCCCTTTTT 


TATCATAGCT 


420 


AATAGTTCTC 


ACTTGAGGTA 


AACTCCAAAC 


TATCAGTAAA 


AAATATATTC 


TCTTCTAGAT 


480 


TCTCCACGAA 


TGTAAACTCA 


ATGTCCTTAT 


GTCCGATTGA 


TGAACCTTNN 


AAATCTCCAG 


540 


TACCTTTCAA 


CAACAATGTG 


GGNAGNCNTC 


NCTGGNGGAT 


ATATCATATC 


TAACACGTAA 


600 


TTGCNCNACA 


TTATATTCAT 


TATTTCTCAG 


TCGATAT 






637 
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(2) INFORMATION FOR SEQ ID NO: 915: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915: 



AACTTTGAGT 


NTTTTGTTCN 


ATATGGAACT 


TTAAAGATTT 


GGAAATTATN 


TAGATGGAGA 


60 


TATATCATAT 


AATCCAGAGG 


TGCCNAGTTA 


TTCAGCTAAA 


TATCAATTAN 


CTAATGATGA 


120 


TNACAATGTA 


AAGCAATTAC 


GTAAAAGATA 


TGACATNCCA 


GCAGAATAAA 


GCACCCAAAG 


180 


TTATTTGTTG 


AAAGGGTACA 


GGGTAATTTN 


AAAGCCTCAT 


CAGTTGGATA 


TAAAGACAGT 


240 


GAATTTACTT 


TCGTAGAGAN 


NAAAGGTGAA 


AATATATACT 


TTAGTGATAG 


TCTACATCTT 


300 


GANCCAAGTG 


AGGATAAATA 


AACGTGACCA 


ATAAAGAGTA 


TGAAATCGAA 


CCCGGAAAAA 


360 


GAGAGTGTGA 


AATGATGAAA 


CGATTAAATA 


AATTAGTGTT 


AGGCATTAGT 


TTTCTGTTTT 


420 


TAGTCATTAG 


TATCACTGCT 


GGTTGTGGCA 


TAGGTAAAGA 


AGCGGAAGTT 


AAGAAAAGCT 


480 


TTGAAAAAAC 


ATTGAGTATG 


TACCCTATTA 


AAAATCTAGA 


GGATTTATAC 


GATAAGGAAG 


540 


GCTATCGTGA 


TGATCAGTTT 


GATAAAAATG 


ATAAAGGTAC 


ATGGATTATA 


AATTCTGAAA 


600 


TGGTTATTCA 


ACCTAATAAT 


GAAGATATGG 


TAGCTAAAGG 


CATGGTTCTA 


TATATCGAAT 


660 


AGAAATACCA 


AAACAACAAA 


TGGTTACTAC 


TATGTCGATG 


TGACTAAGGA 


CGAGGATGAA 


720 


GGAAAACCGC 


ACGACAATGA 


AAAAAGATAT 


CCGGTTAAAA 


TGGTCCGATA 


ATAAAATCAT 


780 


TCCAACAAAA 


GAAATTANAG 


ATAANAACAT 


AAAAAAAGAA 


NTCGAAAACT 


TTAAGTTCTT 


840 


TGTTCAATAT 


GGAAACTTTA 


AAGATTTGTC 


GAAGTACAAA 


GATGGAGATA 


TTTCATACAA 


900 


TCCAGAGGTG 


CCAAGTTATT 


CACCAAAATA 


TCAAGTAACT 


AATGATGACT 


ATAATGTAAN 


960 


ACAATTNCGN 


AAAAGATATA 


GATATACCGA 


CCGAATAAAG 


CACCANAGCT 


ATTGTTGANA 


1020 


GGTACAGGGA 


ATTTAAAAGG 


TTCATCAATT 


GGCTACACAA 


AAATTGAATT 


TACTTTCGTA 


1080 


GAGAAAAAGG 


GAGAAAATAT 


ATACTTTAGT 


GATGGGCTAC 


AG 




1122 



(2) INFORMATION FOR SEQ ID NO:916: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916: 
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CAATTCAGAA 


AGGGGGAGAA 


GCCTTAAATA 


TNAGAAGGTA 


TGTTTTTGNA 


GTTAAATAGA 


60 


AANACAAGGA 


GTGCGNNAGG 


ATTTTACTAT 


GTTAATGCGA 


TAAAGAAGGN 


TGTAAATGGC 


120 


AGACCTCAGG 


ATAACCAAAT 


AGAATATCCC 


GTTGAAATGA 


TTGACAATAA 


ANTCATTCCA 


180 


ACTAAAGATA 


TCNAAGATGA 


NNAAATAAAA 


AAAGAAATCG 


AAAACTTTAA 


GTTTTTCGCG 


240 


CAGTATGGCA 


ATTTTAAAGA 


TTTGACGAAG 


TATAAAGGTG 


GAGATATTTC 


ATATAATCCA 


300 


GAAGCCCCGA 


TTTATTCAGC 


AAAATATCAN 


CTANCCAATG 


ATGATTATAA 


TGTAAAGCAA 


360 


TTACGTAAAA 


GATATGATAT 


ACCGACGAAT 


AAAGCTCCAA 


AGTTATTGTT 


AANAGGCACA 


420 


GGAAATTTAG 


ATGGCACATC 


NATTGGATNC 


NAAAAAATTG 


AATTNACTNG 


CGGNNAGAAA 


480 


ANNGGAGAAA 


ACNCTCACTT 


TACANCTAAC 


CTACATTTTA 


AACCGAGTAA 


TGATGAATAN 


540 


TCGTGACTAA 


TAAAGAGTAT 


GAACTCGACT 


CAAGGAAACA 


ATATAAANAG 


AAGAGAGGTG 


600 


GTNAGGTGAT 


GANACCGTCT 


ANATAAATTA 


GTG 






633 



(2) INFORMATION FOR SEQ ID NO: 917: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1975 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917: 



CTCTTTAAAA 


TCGGNATACT 


GACTAAAAAA 


TTTAAAATTT 


TCAATTTTTT 


GTTTAAGTTT 


60 


TTCGTCTTCT 


ACATTATCCA 


AAAGAATAAT 


CTTATTATTT 


TTAAGTTCAA 


CACGGTATTT 


120 


TTTCTCATTC 


TGATTTTTGN 


TAATATCATC 


ATAAATTTTA 


TTTACAAAAT 


AATAACCTGT 


180 


TGCCTTTGTG 


GTATTTCTAT 


TTAGATATAA 


AACCATGCCT 


TCGTCATCTA 


TTTCACCCGG 


240 


TTTGTTACTT 


TTTGAAAAAC 


TTGTGAGTAA 


AGTCCAAGTC 


CCCTTATCGC 


CCTTTTTAAA 


300 


TTCAACATCT 


CTATATCCTT 


CTTTGTCGGA 


TAAATCCTCG 


AGATTCTTAA 


TTGGATACAT 


360 


ATCTAACGTT 


TNCGCAAAGC 


TCTTTGTGAT 


NTGCTCTTCT 


TGTGAATCTT 


CTTGNGGCTT 


420 


TCTCCGAAGT 


ATCGCTTGAN 


CTATCACAAC 


CTATTATAAA 


AATGCCCANA ATAATAACAC 


480 


TTATGANCAG 


TGCAAGCTTT 


TTTAAAGATA 


TCCCAATTTT 


TTCCACAATC 


CTAANAATCC 


540 


TTATCCAGAT 


ACCTAATTTT 


TTCNCTGGTT 


TAATAACCTT 


AAAAAATCCA 


ATTGACTTGA 


600 


TAATTTGCTG 


TTTTTCACCG 


GTTTTGAAAA 


AGTCAATTTC 


TAACTTTCTA 


ATAACCCTAC 


660 


AAGAACTGCC 


TTTCAAATCA 


CCCATCCAAT 


ATGCATTTTT 


AATATAGGTG 


ATTTTTCAAG 


720 


TCGAAATATT 


ATAACGACTT 


CTTAATTCCT 


TAACTATTTC 


ATCTTTATTG 


CTCATTTTGT 


780 


ATTCAACATC 


ATAACTAGGA 


ACATTTTCAT 


TAATTGAAAC 


ATCACCATTA 


TTATATTTTC 


840 


TCAATTCTTT 


AAGATTTGCA 


TATTGACCGA 


AAAATTTAAA 


GTTTTCTATT 


TTCTGCTTGA 


900 


GCTTATCATC 


TTTTACTTCA 


TCTAATAAAA 


TAATCTTATT 


ATTTTTCATT 


TCAATTTTAT 


960 


AATTCTTTTC 


ATCTGGTAGT 


TTACCATTCC 


GGTAAAATGT 


AGTAACCAAA 


ATAATGGCCT 


1020 


TTAGCGGTTC 


TAGTATTCCT 


ATCCAGATTT 


AAAACCATAC 


CTTCATCATC 


CAATTCACCC 


1080 
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GGTTTATCAC 


TTTTAGCGAA 


ATCTGTGATA 


CTATAGCTGA 


ATACAGAALC- 


CACU I \_. J. 


1 1 AC\ 


TATCACCCTT 


TTTAAATTCG 


CCATCACGAT 


ATCCTTCTTT 




1\,L l\.unun 1 


1 odd 


TCTTAATAGG 


ATACATATCT 


AACGTTTTCG 


V~AAAljl~ 1 v- I 1 


ill An l l lu l 


1\» 1 I V. l l luu 


1260 


AACCTTCTTT 


TGTTTCATCA 


C CTTT ACU AU 


AALCAIA. 1 AL. 


AALAAL1A1 1 


21 A & ft TV ft ft f2 A 
AAAA IV. AnuA 


1 19n 


TACTTATGTA 


CCAATCCAAA 


CCTTTNGGAA 


TATCCCATCG 


LTI I\_ UAUAA 1 




IjOU 


AGTCATCATC 


TTTATTTTTG 


GCAGGGTTGA 


AAATTTAATT 


/■"•7V » (TWUPTA 

GAATCCiAUTA 


1 IvxAl AA 1 1 X. 


X44U 


GCTATTTTCT 


TCTTTGGAAA 


AAGTCTTATT 


TCTTAACNTN 


NNATATCCAA 


CGGGAACTGC 


1 C A A 

lDUU 


CTTTTAAGTC 


CCCATCAATA 


TGCATTTTTA 


ATATTGGAGC 


TTTATCaGTA 


GGAATGTTAT 


1560 


AACGGCTTCT 


TAATTGCTTA 


ACATTTTCAT 


CTTTATTACT 


CATTTTAAAT 


TTCGCGTCAT 


1 M A 


AACTTGGAAC 


ATTACTATTA 


ATTGAAACGT 


CGCCATGGTT 


GTAACTTTTC 


* Jv Jv pnwiwnn iv 

AAACTCTTGA 


1 C O A 


AATCGGCATA 


TTGTCCGAAA 


AATTTAAAGT 


TTTCTATTCT 


TTCTTTAAGT 


TTTTGATCTT 


1740 


CCACCTTGTC 


TAATAAAATA 


ATTTTATTGT 


TTTTCATCTC 


AACTCTATAA 


TTTTTACAAT 


1800 


AAGAGNTTTC 


ATGATTGTCA 


GATATAGTAT 


CTACCAAATA 


ATATCCCTTG 


GCCGTTCGTG 


1860 


TATTTCTATC 


CAAATNTAAA 


ACCATACCTT 


CATTATCTAA 


TACACCTGGT 


CTATTACTTT 


1920 


TGGCGAAATC 


TGTGTAAATC 


GTCCACATCC 


CTTTGTCACC 


TTTTTTAAAT 


TCGCG 


1975 



(2) INFORMATION FOR SEQ ID NO: 918: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1048 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918: 



TTTGTCATGT 


TCAGGTCGAT 


TAACACCCGC 


TAAATGCACG 


ATAAAGTCTG 


CTTTCAaCAA 


60 


TGCTGACTCT 


AATTCTTCCT 


CTTTAGTGTG 


TCGATGTACT 


tcgaaaatat 


GATGATCTGT 


120 


CGGTGATGTT 


AAATCTGCTT 


NCAAGTTTTT 


TCCTACAAAA 


CCTTGTGCTC 


CTGTAATTAC 


180 


AATATNCAAC 


GTCAATCTCT 


CCTATCTCAT 


TGAAGCTTTA 


TAATCATNCA 


ATTCGTTTCT 


240 


AACATATTCT 


AGTGTTAAAA 


GTTTCTCTTT 


TATCTCTTCC 


ACCGNTAAAA 


TATGTGTATT 


300 


ATCGGAGTTA 


TATTCAATAA 


GATTGCGTAA 


TCTTTTCGGN 


TACCGGTTTC 


AACATAATTA 


360 


CTATNATTTA 


AATCTCTGGA 


GGTCTGGCCG 


GNACTCTAAA 


ATAATCACCC 


ATATCTTCAN 


420 


AATGGNGNGT 


ATTCTTCNCT 


CGGCaaNAAN 


GTNTCTGCTT 


TTTCTCTCCA 


TGGCACGAGC 


480 


GNACCAATGA 


TTTCAATTGC 


ATTATCAGCT 


TCAAACAATT 


CTAATAATGC 


GGTCGCAAGA 


540 


TCCCCTACCG 


TTGAGCTTGG 


TGCTTTTTGA 


ACCATAATAT 


CTCCTGTCTC 


TGCATGCTTA 


600 


AATGCATGAA 


CAACTAGTTC 


TACCGCATCT 


TCTAAGCTCA 


TTAAAAATCT 


TGTCATATCA 


660 


GGATCTGTAA 


TCGTTAAAGG 


TTCTCCAGCT 


TTGATTTTGT 


CGATAAACAA 


AGGTATTACT 


720 


GATCCTCTTG 


AAGCCATCAC 


ATTACCGTAT 


CTTGTNCCAC 


AAATAAGCGT 


TTGTTCACTA 


780 


CGAATATTTC 


TTGATTTGGC 


TACGAATACT 


TTCTCCATCA 


TTGCTTTTGA 


AATACCCATA 


840 


GCATTAATAG 


GATACGCTGC 


CTTATCTGTA 


GATTAAACAT 


ATGACTTTTT 


TAACAATTTG 


900 
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AATGAATAGC ACTTGGTAAG ACATTTTCTT GTACCAATAA TATTTGTCTT CACTGCCTC& 960 
ACTGGAAAAT TCACATGACG CACTTGTTTT AAGCTGCTGA TGGATACGTA TCACATCTCG 1020 
CATGTGGTTC TCACTTGGAT ATCAGACA 1048 

(2) INFORMATION FOR SEQ ID NO: 919: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919: 



GAATTCATCA 


TGTTNCAAAT 


CCATTAGGTG 


TAGGAGGTAT 


CAATGATGCA 


ACAACTATCT 


60 


CTGAAACATA 


GATTAAACAA 


TGGNGATTCA 


GTTTATGGCA 


TTTCCAATTC 


TATACCGGAC 


120 


CCATTGATGA 


TCGAGGTTAT 


CGCAGCAAGC 


GGGTATGACT 


NTGTTGTGAT 


NGATACAGAA 


180 


CACGTGGCGA 


NTANTGATGA 


GACACTAGCG 


CATTCTAANT 


CGCGCAGCTG 


ANGCAGCGCA 


240 


TATTATACCA ATTGNTCGTN 


TCACTGTAGT 


GATAGATAGA 


GATATCATTA 


NTGTGTTAGA 


300 


TCTGGCTGCG 


AGAGGCNTTA 


TTGTGCCACA 


CGTTANGATC 


NNCGAGACAG 


TTGGAGCAAT 


360 


NTTGTGATNT 


TANCCCGGTC 


TCTCCC 








386 



(2) INFORMATION FOR SEQ ID NO: 920: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1677 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 920 : 

ATAAGAAATC CACCGGTTGA AGATAACCCA CCTAATAACN TCCAAGTTGA GNNNTTGNTG 60 

CCTTGGNCAA TTGGTTTTTT CATTTTAATC CTTAATTATT AATATTTTCT TTTTNCCTAA 120 

GAAACTTCAA TTCACCTGAA GCCCAAACCT AAAATATTAG TGAATGCGAT TGAGAATAAT 180 

GATAACCCCN NNAAGTAGCA ACATTATTAA GTNCTNAAGA TAATAAAGTA GATTCTGAAG 240 

AAGCGAAAGT ATACATTAAC TATATCAAAG ATGAAGTTGG GCTTAAGCAA TTTGTCAGCG 300 

ACCTTAAAAA TACGGTACAT AAATTGAATA AGAGTAAGAC CAGCGTAGCT TCTTATATTC 360 

AAACCAGATC TGGTCAAAAT ATATTACGTG TAAGTAAAAA TGGCACGCGT TATATCTTTT 420 
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TCGAT AATA 1 


/*N *f|W|U|l >\ / P M 1 1 


r^r* a f*r*P AfT A 

uLALL In^uni 


Ajt^AACCAAT 


TGTTAAGCCG 


aaagaaaaa^ 


480 


L.AAAA lAiun 


oil In/iAiU J. 


fKTfiRTAAGA 


AAAaGATGGT 


TATAGCTGAA GCAAATAAAG 


540 


qir» a A X T* 
IvJAl Qjd- AA I 


AbulAAl 111 


nlALLvlUOUn 


CATATAGAAT 


TCCAGCTATG 


AAATCAACTG 


600 


Af» A A/WWH 


111 1 VjJVAIovt^ 


TATTTAAAAT 


x xuini x x x m\sj 


ACAAAGTAAT 


TCTGAAACGG 


660 


TAGATGTTAL 


iXjAAijA l ru i 




ATAfA Af**W2 , P 


AACTTTAAAA 


GGCGATACAA 


720 


AATTAAATGA 


TAGTTCTAAA 


AAAvjIAAL i a 


t a a a tv; a cm 

1 Ann iunLuu 


TGAAATGGCA 


TTTTCAAGTT 


780 


Kill /"^/"•CP 7v 

C CAAAAC L» 1 A 


IXKjTv-C AI A J. 


/V 1 A/**A A A AT* A 


A Af!ATATTAP 
nnunlnl InL 


CATTTCAGCT 


TCAGGTAAAG 


840 


CuAftAobliiA 


AALAi 1 1ALA 


APAr'AnAPfJA 


AdAPGATTAA 
nunv.un l l xw 


AGCCAGCGAT 


TTAANATACA 


900 


a ta r* Ad a rsn a 


1 rl/lv 111 unrt 


TTTTGGGACA 


GNGAAGATAT 


CGNNGGACTA 


TGTTGAAAAG 


960 


& & if* a a a a Af"* 


A Af?A AA AC^Af? 


V_ X. iunnunnv 


AAATTCIATACJ 

nnn x x \ja x n\j 


AATTCTTTGC 


TGGATATTCT 


1020 


1 lAvj^AaAAlri 


nluLluCul 1 


TAATTAHTTA 


OX* X X X X V3XVX X 


TTGTATCGTC 


ATATATAAAA 


1080 


AAAWjA I\_A 1 


Ul 1 i 1 I AXuA 


luA 1\7 1 AHAVj 


a AnnrypnTAT 

nnuv.u iulnl 


ctaaaggaag 


TTTAATGATG 


1140 


ATTAGTTCAC 


CAC AAA 1 1 A 1 


Tk TV TV^/^TV^ A A 

ALiAivjV- iuAA 


AAAUAluu 1\7 


ataagattac 


CGCAACCGTA 


1 900 


AGATTAATAA 


ATGAAAACGG 


TAAUCAAbIA 


UA1 AAAvjAAI 


atgagcttga 


ACAAGGCTCG 


1 0£O 
XzDU 


CAAGACCGCT 


TGCAAl 1AA1 


LAAuAuAiLA 


r* A A A A AT* Af A 
uAAAAAlAuA 


ulAl 1AAAAL 


mrnjv j\ ArpA 

1 lAAAXAAbA 


1 

J.J*W 


TGC ATC» Iv3 i t. 


JllWA HI* A f A 

A IvJUAA 1 ALA 


a Ar* a Ar* a Trr* 

AALAAUA 1 


TTTPAAAfiTA 
1 1 IVAAAulA 


tttacaatta 


GTGTGAATTA 




CTTTGAAGGG 


ATTGGTTTTT 


GGGTGAAAAA 


ATTTATGGAG 


tgctgaggtt 


TATATCAATN 


1440 


TNATTACCCT 


NNATAAGGTT 


GGTAATGATN 


GTNCAATGAA 


acttggtgaa 


GTAATGCTTA 


1500 


ATATATAAAA 


GAGCATATTT 


AAATTTAGTT 


TCTGTTCGTG 


atagttataa 


ACTAATCATT 


1560 


ATATAAAGGA 


GATAGTCATA 


TGAATTGATT 


TTAACGCATT 


ttaacaagcg 


TTCACCGATA 


1620 


TTTATTATTA 


atttaggatc 


TTTGGACGGG 


CCATTGGGAG 


cattatcgat 


AGATATT 


1677 



(2) INFORMATION FOR SEQ ID NO: 921: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921: 



GGAACTCGTT 


TTTAAAAGTA 


ACCAATCAAA 


CAAACTCTCA 


GATTATTACT 


GTATCAGTTA 


60 


CAACTGGGAA 


TAAATCTGAA 


TCTGACAAAA 


TTGTTAATAG 


AATATCTAAA 


GTTTGTGCAC 


120 


ATGATATGCC 


TAAAATTATG 


AGTGTTGATA 


ATGTTACGAT 


ACTTTCCTCA 


GCACATGATA 


180 


ATGCTGTAAA 


AGTATCTCCA 


ATAGTATCTG 


TCAACTTAGT 


GATCAGCATT 


ATTGTTGGTA 


240 


TTGTTTTAGC 


AATATTGATT 


ATTTTCTTAA 


AAGAATTATT 


AGATAAGCGT 


ATTAAGACAG 


300 


AAGAAGATGT 


TGAATCGCAA 


CTAGGATTAC 


CTATTTTAGG 


TTCAATACAA 


AAATTTTAAT 


360 


TTACGAGGAA 


TTACCATGAC 


GAATACACGA 


AGAAGTACAT 


CAAGTTTAAT 


TGGCCATGAA 


420 


CAACCAAAGT 


CACCTATTAG 


GGAGAAATTT 


CGGAGGCATA 


AGGATCAAAT 


ATTATGTTTG 


480 


GAAATCCTGA 


CAGGGCAGGT 


CAAAGCATTG 


TNATCACTTC 


AGAGGCAACA 


GGCGCAGGTA 


540 
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AGTCTACAAA TGCAGCAAAT TTAGCAGGGG CATATTGCGC AAGCAGGGTA TAAAACAATA 600 

ATCCGNNGAC GGGGATATGN GTAAACCTAC GCAGCATTAT ATTTTTAATT TGCCAAACAA 660 

TGAAGGCCTA TCAAGTTTAT TGCTAAATTG GTCAACTTAT CAAGACAGTA TTATCTCAAC 720 

TGAAATTCAA GATTTAGACG TCTTGACGTC TGGGCCAATC CCACCGAATC CGTCAGAGTT 780 

AATTACATCA AGGGCATTTG CAAATTTGTA TGACACATTA TTGATGAATT ATTACTTGGT 840 

CATTATCGAT ACGCCACAGT GAACACAGTT ACAGGTGCGC AATTATTTCA AAGTTTACCG 900 

GAATTTGGTC TACG 9 14 

(2) INFORMATION FOR SEQ ID NO: 922: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922: 



GATCCATCAT 


CTCTGTTTCA 


TTTGCGGCCC 


CGTCATCGAT 


ATTAGNCAAA 


TATATGGTTA 


60 


TTGAATATCA 


ATCAATACTT 


AATCAATCTC 


CATAAATAAG 


TGATAATAAC 


NAAGAAGATN 


120 


TATCCGACCC 


TTTGTCTTTG 


TTCCAAAATG 


ACACCTAGAA 


TGTTACCTGC 


CTGCTTTTTC 


180 


CATAAGTGCT 


TTTGCTTTTT 


TAACTTCATT 


TTTATCATTT 


TTTTCACTAT 


CAATTACTAA 


240 


CAAGACTATC 


TTTAATAGCA 


CGAGCGCANA 


NAGTTGTGCA 


NCAGTCACAG 


TAATAACTGG 


300 


CGGTGTANCG 


ACAATAATAA 


TGTCGTAACG 


TTTATTAAAC 


AGATCAACTA 


ATTCTTTGAA 


360 


CCTTTCAGAC 


CCAATTAACT 


CAGATGGATT 


TGGAGGTACA 


GGGNCAGCTG 


TTAGCAAATC 


420 


TAAATTTTCA 


ATTTCTGTCG 


ACGTAATTGN 


TTCTGACATA 


GTCGNTCGAC 


CAATGATTAA 


480 


GCTTGATAGT 


CCATTATTAT 


TGTGCTCATT 


AAAAATATAG 




GCTTACGCAT 


540 


ATCGCCATCA 


ATAACTAATG 


TCTTATAGCC 


TGCTTGTGCA 


TAAGTAATCG 


CTACATTCGA 


600 


TACAACTGTA 


CTTTTACCTG 


CACCAGGCTT 


TTCAGAAGTA 


NCCAATAAGC 


GCTTTACTTC 


660 


ACCATTTGCT 


TTTGAAAACA 


TGATGTTTGN 


ACGTATACCT 


CGNNACTTTT 


CACTAATTGT 


720 


TGAGTTTGGG 


TTTTCATATA 


CAAATAGTGT 


TGTTGTCGTA 


TTTTCCTTTT 


TTGACATAAG 


780 


TAGCAACCAT 


CCTTAATTAA 


ATTTTTGNAT 


TGAACCCAAT 


ACAGGCAATC 


CTAATTCTTT 


840 


CTCTACATCT 


TCTTCATCTT 


TAATTCGCTT 


ATCGAAAATT 


ACTTTGAAGA 


AGATATATAT 


900 


AAGCGCTCGT 


GCC 










913 



(2) INFORMATION FOR SEQ ID NO: 923: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923: 



CAATAAGCCA 


CCCCANTTTT 


TGNTCCATTT 


GTTTGAAGGA 


AGANGTTTTG 


ATTCATTNTG 


bU 


GTTGTCCCAT 


CCACCNGGGA 


GCAGTCCCCA 


AGACCTCCCC 


CNTTGGCATT 


AAGCCCNGGG 


120 


AATNTTGATA 


ATATATAAGN 


ATTTNTGCCC 


NGGCCTAAGA 


CCAACCAATT 


NAAAGATNTT 


180 


GNATCCNTTA 


AGTNCTGTTG 


TAAACATTTN 


CCCAGGACCA 


AGTGCCNAAT 


ACGGGATTAG 


240 


CTTCATCACA 


TGNCCTAGAT 


TATAAAGAAA 


TATTANATGT 


TATCGAATAG 


ATAATAATTT 


300 


AAATGAAATT 


GAGTGAAATG 


AATATTTTAA 


ATATAGCATT 


ATAGTTTGGN 


CTAAAAATTT 


360 


ACACAACGGA 


AGGATGTAAT 


TATTATGACA 


NCCAACACAG 


TTACATTACA 


CACAGCACAT 


420 


ATTGTAAGTT 


TAGGAGATAT 


CGAAGAAGCT 


AAAGCAAGCA 


TTAAACCATT 


TATTCGTCGA 


480 


ACACCACTAA 


TTAAATCTAT 


GTATTTAAGT 


CANAATATCA 


CTAAAGGTAA 


CGTATATTTA 


540 


AAACTTGAAA 


ATATGCAGTT 


TACAGGTTCA 


TTTAAATTTA 


GAGGTGCTAG 


CAATAAAATT 


600 


AATCATTTGT 


CAGATGANCN 


CAAAGCTAAA 


GGTATTATTG 


GCGCATCAGC 


CGGAAATCAT 


660 


GCACAAGGTG 


TAGCTTTGAC 


AGCAAAACTA 


TTAGGTATTG 


ATGCAACGAT 


TGTGATGCCA 


720 


GAAACAGCAC 


CAATTGCTAA 


ACAGAATGCT 


ACCAAAGGAT 


ATGGTGCAAA 


AGTCATTTTA 


780 


AAAGGTNAAA 


ACTTTAATGA 


AACTAGACTT 


TAGATGGAAG 


AATTAGCGAA 


AGAAAATGGT 


840 


ATGACAATTG 


TTCATCCATA 


TGATGATAAG 


TTTGTCATGG 


CTGGTCAAGG 


AACAATAGGA 


900 


TTAGAAATTN 


TAGATGATAT 


AG 








922 



(2) INFORMATION FOR SEQ ID NO: 924: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 924: 



CAAGCCAGTT 


CAACTNACCG 


ACCCGGTATA 


TTTGGGGAAT 


ATACGAANCN 


CNNTCAGATG 


60 


ATCNAAATAT 


GANTGANACN 


CGGTNGCGTT 


CTCNAGNCAN 


AACAGNGTTA 


AATAANCGTA 


120 


AGTCGGCAGT 


TATGTCANAT 


GAGATTACTT 


CTNCTGATTG 


GGNAGTTACG 


NNTCCACACG 


180 


AGANNATATC 


ANNTGGTGAN 


NCAGNCAGAG 


TAAAACATAG 


AGATTTTGAC 


CCGCCATTGT 


240 


ATGTAGAGGC 


AGAAGTTATT 


GCCGAAGAAT 


ATAACATAAT 


TTCAGAAAAT 


AGCACATATA 


300 


CATTCGGTCA 


ACCTAAAGAG 


TTCAAAGAAT 


CAGAATTACG 


AGAAGAGTTT 


AACAAGCGAT 


360 


TGAACATAAT 


ACATCAAAAG 


TTAAACGATA ATATTAGCAA TATCAACACT ATAGTAAAAG 


420 


ATGTTGTAGA 


TAGTGAATTA 


GAATACTTTG 


AACGCAAAAT 


ACACAAAAGT 


GATACACCGC 


480 
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CAGAAAATCC 


AGTCAATGAT 


ATGCTTTGGT 


ATGATACAAG 


TAACCCTGAT 


GTTGCTGTCT 


540 


TGCGTAGATA 


TTGGAATGGT 


CGATGGATTG 


AAGAAACACC 


AAATGATGTT 


GNAAAATTAG 


600 


GTGGTATAAC 


AAGAGAGGAA 


AGCGCTATTC 


AGTGAATTAA 


ACAATATATT 


TATTAATTTA 


660 


TCTATACAAC 


ACGCTAGTCT 


TTTGTCAAGA 


AGCTACAGAA 


TTACTGGAAT 


AGCGAGTACT 


720 


TAGTAGATAA 


TGATTTGAAA 


GCGGACTTAC 


AAGCAAGTTT 


AGACGCTGTG 


ATTGATGTTT 


780 


ATAATCAAAT 


TAAAAATAAT 


TTAGAATCTA 


TGACACCCGA 


AACTGCAACG 


ATTGGTCGGT 


840 


TGGTAGATAC 


AAAAACTTTA 


TTTCTTGAGT 


ATAGAAAGAA 


ATTACAAGAT 


GTTTATACAG 


900 


ATGTAGAAGA 


TGTCAAAATC 


GCCATTTCAG 


ATAGATTTAA 


ATTATTACAG 


TCACAATACA 


960 


CTGATGAAAA 


ATATAAAGAA 


GCGTTGGAAA 


TAATAGCAAC 


AAAATTTGGT 


TTAACGGTGA 


1020 


ATGAAGATTT 


GCAGTTAGTC 


GGAGAACCTA 


ATGTTGTTAA 


ATCAGCTATT 


GAAGCAGCTA 


1080 


GAGAATCCAC 


AAAAGAACAA 


TTACGTGACT 


ATGTOAAAAC 


ATCGGACTAT 


NAAACAGACA 


1140 


ANGACGGGAT 


TGTTGAACGT 


NTAGATACTG 


CTGAAGCTGA 


GAGAACGACT 


TTNNAAGGGG 


1200 


AAATCAAAGA 


TAAAGNTACG 


GTTANACGAA 


TATCGAAACG 


GATTGGAAGA 


ACAAAAACAA 


1260 


TATACTGATG 


ACCAGTTAAG 


TGATTTGTCC 


AATAATCCTG 


AGATTAAAGC 


AAGTATTGAA 


1320 


CAAGCAAATC 


AAGAAGCGCA 


AGAAGCTTTA 


AAATCATACA 


TTGATGCTCA 


AGATGATCTT 


1380 


AAAGAGAAGG 


AATCCCAAGC 


GTATGCTGAT 


GGTAAAATTT 


CGGAGAGAAG 


AGCAACGCGC 


1440 


TATACAAGAT 


GCTCAAGCTA 


AACTTGKAGA 


GGCAAAACAA 


AACGCAGGAN 


CTTAAAGGCT 


1500 


TAGANACGCT 


GAAAAGANAG 


CTAATGTTTA 


TACAGACAAC 


AAGGTCAAAG 


AAAGCACAGA 


1560 


TGCACAGAGG 


AAAACATTGA 


CTCGCTATGG 


TTCTCAAATT 


ATACACAATG 


GTAAGGAAAT 


1620 


CANATTAAGA 


ACTACTAAAG 


AAGAGTTTAA 


TGCTTCTAAA 


AGAACACTAT 


CAAGAGTGTT 


1680 


AGCAGACATC 


ACTGTAAATG 


CTATGAAAGG 


CATCTATTTA 


AGGTATGACG 


AAAATGGGGC 


1740 


GATTACTTCA 


CATACTATTG 


ATAAAGATGG 


CGTGAAAATT 


AGTGGCGATA 


AAGTTGATAT 


1800 


AACAGCGAAT 


AGAGAATTTA 


ATGTATTCGC 


AAATAATATT 


AATAACAAAG 


TTGGTAAAAA 


1860 


TGACATTGTT 


AATAGCCTAA 


ACTTATCAAA 


TGAAGGTCTT 


GACATCAATG 


TGAATAGAAT 


1920 


TGGTATTAAA 


GGCGGAAATG 


CTAACCGTTA 


TGTACAAGTT 


CAAAATGATT 


TTATTGAACT 


1980 


TGGCGGAATC 


GTACAACGAA 


CTTGGAAAGG 


CAAACGATCA 


ACCGATGATA 


TATTCACACG 


2040 


TCTTAAAGAT 


GGACATCTAA 


GGTTTAGAAA 


TAATACCGCA 


GGCGGGTCAC 


TTTATATGTC 


2100 


ACATTTTGGT 


ATTTCAACAT 


ATATTGATGG 


AGAAGGCGAA 


GACGGAGGTT 


CATCCGGTAC 


2160 


TATTCAATGG 


TGGGATAAAA 


CTTACAGTGA 


TAGCGGTATG 


AATGGCATAA 


CAATCAATTC 


2220 


TTATGGCGGT 


GTAGTCGCTT 


TAACATCTGA 


CTACAATCGA 


ATTATTATCG 


ATTCATATGC 


2280 


TTCAGCTAAT 


ATTGAAAGTA 


GAGAAGCGCC 


GATATATTTA 


TCTCCGAACA 


CCCAAAAATT 


2340 


AAACCTGGNT 


TTAANCCGAT 


TCGCATTCAC 


ATTATCAAAC 


GCTGATAGGT 


NCATTACGAA 


2400 


ACTGGCGGTT 


ATATCATGTT 


GGGTTCAAGA 


TGNAANCTAT 


AAGTOCGGTG 


CTGGATTAAG 


2460 


ATTTTCTAAA 


CGTACCAATA 


AAGGATTGGT 


TCAAGTCGTT 


AATGGTGACT 


ATGCTACAGG 


2520 


CGGAGACACT 


ACAATTGAAT 


CAGGTATGGC 


CAAATTCAAC 


TTAGTTAANC 


GGAAGAGATG 


2580 


GAAATAGTTA 


CGTTAGCATT 


CAAAGTTATG 


ATTTATTGGC 


GGTAGGTTCT 


GATAATGCTG 


2640 


GCGATAGAGT 


CGCTTCTAAT 


TCTATTTATA 


AGCGTACTTA 


TTCAGCACCT 


GCTAACTTAC 


2700 


ACATTACTTC 


TGCTGGAACA 


ATTGGGCGTG 


CTACTTCTGC 


CAAAAAGTAT 


AAAATTTCAA 


2760 


TCGAAAACCA 


ATACATCAAT 


GAAGACGATC 


AGTTCAGTCA 


TTCAAAAGAG 


ATTTTAAAGC 


2820 


TTCCAATTCG 


TACATGGTTT 


GACAAATATG 


AATCGGAAAT 


AATGGCTAAA 


GAATTGGAAA 


2880 


GTGGTAAAAA 


GTTATCTGAT 


GATACTTTTA 


AACTTAGTCG 


ACATACTGGC 


TTAATAGCGG 


2940 


AAGAGGTTGA 


AGAATTAGGA 


TTTAATGAAT 


TTGTTATTTA 


TGATGACAAC 


GGAGAAATCG 


3000 


AAGGTATCGC 


ATACGATAGA 


CTTTGGGTTC 


ATCTANNACC 


TATTATTACC 


CNAAAACCAN 


3060 
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CANTCAAANA NCGNAAAACN TANTGGNGGN NTTAACNTCT NATNGACNGC AACCAGGGTC 3120 
CNCATNCCAC CTCCACATTA TACAANTCAC TTTCTCNCGT CACTA 3165 

(2) INFORMATION FOR SEQ ID NO: 925: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925: 



AATGGGNAAT 


TGTCGGATGA 


ATTGTATCAA 


CCAATTACAA 


GCGATTAATT 


AAGAATCTAA 


60 


TGTCGAAGAG 


AGTGAAGAAT 


ACTTGACAAT 


ATTAATAGCC 


AGATACAGTT 


ATACACTCAA 


120 


GTAAAAGTAA 


CAACACCNAT 


TTTAGTGAAA 


GCACCAATCG 


CTGTCGTCGT 


ATTTTACTTA 


180 


AAGAAGTAAG 


AGATTCAATT 


TTTAGAGAGA 


AAATGGTAGG 


TGAAGGCTTA 


GCAATC AAAG 


240 


CTCATGAAGA 


ATCCAAAGTA 


ATCGCACCGT 


TCAATGGTTT 


AATATCTATG 


ATTGTACCAA 


300 


CTAAGCATGC 


AGTTGGTATT 


CAATCAGAAG 


ACGGTGTGGA 


CATAGTCATT 


CATATTGGCG 


360 


TGAAGNCAGT 


TGACTTGGAA 


GGTAAAGGGT 


TCAAGTACTT 


TGTAAACCAA 


AATGATCATG 


420 


TTGAAGCAGG 


CCAAACGNTG 


TTGCAATTCG 


ACCAGCAATA 


TATACAACAA 


CNAGGCTACA 


480 


ATGCTGACGN 


TATTGTCGTT 


ATTAGCAACT 


CTGCCGATTT 


AGGAAAAGTA 


GAACTGACNA 


540 


TGAATGAAAT 


CATTACGACT 


GAAGATGTTA 


TTTTTNAAAT 


ATTTAAAAAC 


TAGGAGTGTG 


600 


TTGTAATAAT 


GACAAAANTA 


CCGCAAAATT 


TTATGTGGGG 


TGNCGCTCTT 


GCCGCAAATC 


660 


AATTTGAAGG 


GTGAATATGA 


AAAAGGTGGT 


AAGGGGTTAA 


GTGTAATTTG 


ATGTATGACG 


720 


AGTGGTGCAC 


ATGCCAAAGC 


ACGTCAGATT 


ACAGAATCAA 


TAGACCTCAA 


TCACCTATAA 


780 


TCCAAATCAT 


TGGAGGTATT 


GATTTTTATC 


ATCGTAATAA 


GGGAAGATAT 


TGCCTTGTTT 


840 


AAAAGAAATG 


GGATTGAAAT 


GTTTACGTAC 


GTCGATTGCG 


TGGACACGTA 


TCTTTCCGAA 


900 


TGGGGATGAA 


GATGTGCCAA 


ATGAAGAAGG 


ACTTGCCTTT 


TATGATCGTA 


TCTTTGATGA 


960 


ATTAATTGCA 


CAAGGTATTG 


AACCTGTTGT 


GACGTTATCA 


CATTTTGAGA 


TGCCACTTCA 


1020 


TTTAGCGAAA 


CATTATGGTG 


GATTTAGAAA 


TAGAGAAGTT 


GTCGATTATT 


TTGTGCATTT 


1080 


TGCGCGTGTT 


GTATTTGAAA 


GATATAAAGA 


TAAAGTTACA 


TATTGGATGA 


CGTTTAATGA 


1140 


AATTAATAAT 


CAGATGGACA 


CATCAAATCC 


TATCTTTTTA 


TGGACGAATT 


CTGGGGTAGC 


1200 


ATTGACAGAA 


AATGATAATC 


CTGAAGAAGT 


CTTGTATCAA 


GTAGCACATC 


ATGAACTTTT 


1260 


AGCCAGTGCT 


TTAGCAGTTC 


GTCTTGGTAA 


AGAGATTAAT 


CCGAAGTTTA 


AGATTGGAAC 


1320 


AATGATTTCA 


CATGTACCCA 


TTTATCCATA 


TTCGTGTCAT 


CCGAAAGATA 


TGATGGAAGC 


1380 


ACAAATTGCG 


AATCGCTTAC 


GTTTCTTTTT 


CCCGGATGTC 


CAAGTGAGAG 


GTTATTATCC 


1440 


AAGCTATGCT 


AAAAAAATGT 


TGGCACGAAA 


AGGATATGAT 


GTTGGATGGC 


AAGAAGGGGA 


1500 


CGACAGTATT 


TTACAGCAGG 


GCACGGNTGA 


TTATATTGGC 


TTTAGTTATT 


ACATGTCTAC 


1560 


GGCTGTAAAA CATGATGTTG 


ATACTACAGN 


TGAAAACAAC 


ATCGTCAACG 


GTGGGTTTGG 


1620 


ATTCATCTGT 


GGAGAATCCG 


CATATCGCAA 


CGAGTGATGG 


GTTGGGCGAT 


GGATCCAGAT 


1680 
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GGCTTAAGAT ATACAATTGG ACTGTGTTAT TTGA _ 1714 

(2) INFORMATION FOR SEQ ID NO: 926: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926: 



AAATACTTCT 


TCATAGAATA 


TGTTTGTAAC 


GTTAGATTTA 


GTAGTTAAAG 


TAAGGTCATA 


60 


AATTTTCAAT 


ATGCCTTGAA 


TTAAATTATG 


TTTAGACTGA 


GCAAAACTCT 


CATTTTTAAC 


120 


ATCCGCTGTA 


CGCAAGGACA 


ATTACTTTAT 


CAAAATGACT 


TCCAATTTCT 


NAAAGATGTA 


180 


CTACAAAGTG 


TTGNTGNAAA AGTTATTGCG 


GAAGGTAATG 


TCATTACACC 


GGATATGTAT 


240 


NAACGCGTGA 


TGGACTTAGG 


CGTTCATTGT 


TCAGTCGGTG 


GTGGTGCGAT 


AACACGACCA 


300 


AANGAAATTA 


CGAAACGTTT 


TGGNCANGTT 


ATGGAAGATT 


AAATGATAAC 


GATAAAAAAA 


360 


CGAGATGACC 


ANCATTAATT 


AAAGGTACCN 


NATTATCTTA 


GGTGGNTGAA 


TGAATGTAAT 


420 


GGGTTCANCT 


CGNNTTGTGT 


GTGTACGATA 


GTGATTTTAT 


TTTCACNTTG 


AGTTATCCAC 


480 


AAATANGTAA 


AGCGACGGGG 


ATGGTGATTA 


ATAGCGACAA 


NNGCCACGCC 


TTAAAAAACA 


540 


CATGATGATG 


AGTNTCCAGA 


CAGGGTATTT 


TAATTTCANA 


TGNTAGTATT 


ACAATAGCAC 


600 


TAC 












603 



(2) INFORMATION FOR SEQ ID NO:927: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927: 

GGCACGAGCG GAAAGGTTTG AATTCATCTG ATGAATCATC ATAATCCTTG ATAGAACGTT 60 

CATATTTATC TAAATCTGGC ATGCGTTCAT CGTCAAACTG AGTTAATTGA TAGTGTTTAA 120 

TAATACTGTT TAATTTCTTA GCATAGTTTG GATCTGTAGC ATATGTTTTA GATAAGTGTG 180 

ATGTTGCATC TTTATAAGAA TCGGCTTCCG ATTTCCATGT TGGTTTATAA ATTGTTCGAT 240 

TGCCATCAAT ACCATTTTTA ATAAGGTCAG AGTAATCTTT TAGTGATTCT TTCGTGCTTG 300 
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GATATTTTCG 


GAATCCAGCA 


1 1AA1AL1A1 


ACAATTTATT ACCATCAGCT TCCTAATGT£ 


JDU 


TTAAAAGGAA 


CAGAATTCCC 


I l\ZAAAAuCA 


CCTTTGATAC 


CGAATAAATT 


ATRGTTTGGT 


jt o a 


GACTTAGCTA 


AAGCACTACG 


ACCTGAGTCA 


GATTCTAAGA 


TTGCTTGCGC 

X A A x VTVTVTV' 


AATCATGACA 


IDA 

4oU 


GACGCATAAA 


TATCGTTATC 


TTGACCAATG 


CGATGTGCAT 


CTTTAGCAAT 


TGATTTGACA 


D4U 


AATTGACGTG TATCTTTTGA 




TTAAATTGTC 


CGCTATCATC 


ATTGTTAGAT 


oUU 


ATACTAGGAT 


CTGTTTCGAA 


TAATGATGTT 


GCACGTGTAT 


CCTTTTGATT 


AACATCGNTA 


660 


TTGAATGATT 


GAGCAGGTTT 


AGATTTATGT 


TTCAATTCAT 


CTTGTGTTGG 


NAACTGTGGA 


720 


TTCTTTGGTA 


TTAGATTTNN 


CATTTTTGGG 


CTTTTNNTAG 


ATTGAGATGC 


ATAATCTTTG 


780 


NGGGGTTTTC 


TTTNGGATCT 


TCACTGTATT 


GGNCCCCAAA 


ATAGAGTCTA 


AAGCCCATCT 


840 


GACAATGATT 


GATATCTTTC 


C 








861 



(2) INFORMATION FOR SEQ ID NO: 928: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928: 



GATTCTACCA 


GCCGCATTAT 


CCACACCGGC 


ATTATTAAAC 


AACACATCGA 


TTCTTCCAAA 


60 


CTGTTCCTTT 


ATGTCAGACA 


CAAAGTCTAC 


CACTTGTTGT 


TCATCTGAAA 


TATCCACATT 


120 


ATACGCCTTC 


GCATTGTCAC 


CGTTACTTTT 


AATTTTATCG 


ACAGTCTCCG 


ATACCGCTTC 


180 


AGCTATGTCT 


ACCGCCAATA 


CATACGCACC 


TTCTTGAGCT 


AAAGCGATTG 


CAGAAGCTTG 


240 


TCCGATACCT 


GTACTTGCTC 


CTGTTACGAC 


TGCAACTTTA 


TTTTCCAAAC 


GTTTCATTCA 


300 


GTCACTCCTA 


TACAACATAT 


CTTAAACTTA 


CAGGCCTTAT 


TTTCCACTGN 


TCAATGCTTC 


360 


ATAAACCTAA 


TAGATAAGTT 


TCCAGGCAGT 


CTTAGCTACT 


CTCTGCTATA 


CAAATTCTTA 


420 


TTGAATTACT 


CAAAATATTT 


ACAATATTAG 


TCTTTTTCAT 


TTCTTTGAAA 


ATTGGTACTT 


480 


GTGGAGAGTA 


TTTGTGGGTA 


CAACTTTAAA 


TATTTTACAA 


AGGGGGGAAT 


CCAATTGATG 


540 


ACAATCAAAA 


CATCAACAAA 


AGGGGTCTTA 


TCCGTCAATC 


CGTTCCTTGA 


ATAGTGTTGA 


600 


AAAAATTGGA 


AATAAATTGC 


CAGATCCTAG 


CGTCTTATTC 


TTTTTAATGT 


GTGTGGGCTT 


660 


AGCCATTATG 


ACGTGGGTTA 


TCTCATTATT 


NAATGTATCT 


GTTAAGCATC 


CAGGTACGCA 


720 


TCANACCATT 


TATATTAAAA 


ATATAATTAG 


CCATGATGGA 


TTNACGATGA 


TAATGAATGA 


780 


TACGATTAAA 


ANTTTCTCAG 


AGNTCCCAGC 


ATTAGGCTTA 


GTACTAGCAG 


NGATGATTGG 


840 


TATTGGTGTT 


GCAGAGNAAA 


ACAGGATACT 


TCGATAAGTN 


AATGATTTCT 


GTTGNGAATC 


900 


GNGCACCACG 


ATTTTTAATT 










920 



(2) INFORMATION FOR SEQ ID NO: 929: 
(i> SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DMA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 929: 



GGCACGAGCT 


AGCACAAACA 


GTTTGAACTA 


AAGAAGATTT 


AACTGAGTAC 


AATAAAATGT 


60 


CTAAGTCTGA 


ACAAGCCGAT 


ATAAAAGATG 


TTGGCGGATT 


TGTCGGTGGA 


TATTTAAAAG 


120 


AAGGCAAACG 


GCGTGCTGGT 


CAAGTCATGA 


ATCGTTCAAT 


GCTAACACTT 


GATATCGATT 


180 


ATGCAGCCCA 


AGATATGACT 


GACATATTAT 


CTATGTTTTA 


TGATTTTGCA 


TATTGTTTAT 


240 


ATTCAACACA 


TAAGCATAGA 


GAGATAAGTC 


CAAGACTGCG 


TTTAGTGATT 


CCTTTAAAAC 


300 


GAAATGTAAA 


TGCAGATGAG 


TATGAAAGCT 


ATTGGGCGTT 


AAGTCCGCAG 


ATATCGTTGG 


360 


GCATGGATTA 


CTTCGATGAT 


ACAACTTATC 


AACCACATAG 


GTTAAATGTT 


ATTGGCCTTC 


420 


CACCTAGTTA 


ACGATGCGGA 


ATTTTCCTTT 


ACCTATGAAG 


ATTTACCTTT 


GTTAGACCCC 


480 


GATAAAATAT 


TAAATGAATA 


TGTTGATTGG 


ACTGACC 






517 



(2) INFORMATION FOR SEQ ID NO: 930 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1216 base pairs 
<B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 930: 



CCTGTANANT 


CACCACCAGG 


TTCACGGAAA 


TGGAACTTGN 


ACATCGACCA 


AATCCAGGTG 


60 


ACCCCCAAGT 


GACCTTTCGC 


ATCTATGATG 


CCCAACACCA 


TTGCTTGGGT 


CAATTGCAGG 


120 


TGCAATTTGT 


TTAATTACCT 


TACCATCAAT 


TAAAATATCT 


GCTTGTTGTA 


ATTCTCCATT 


180 


TTGTAATACT 


TTACCGTTTT 


TAATTAATTT 


CATCATTTTC 


GTCCCCTTAC 


CTATTTTAAT 


240 


AATTCATCAA 


TGACTGCCAT 


TCTTAAGTAA 


ACGCCATTTT 


CCATTTGCTT 


AAAAATTCTT 


300 


GATTTTGAAG 


CTTCTACTAA 


ATCGCTTTGT 


ATTTCTACTC 


CTCTATTCAC 


AGGTGCCGGA 


360 


TGCATAACAA 


TAGCATGTTC 


TTGTAATTTG 


TTATAGCGCA 


CTTCATTTAA 


GCCATGCTTT 


420 


TGATGATAAT 


CATCTGCTGC 


AAATCTAGTT 


TCTTCTGCAA 


GCCCATGTCT 


TTCATGTTGA 


480 


ATTCTTAATA 


ACATNACTAT 


ATCTACTGTT 


TCTATAACAT 


CATCTATATT 


TACATAAGGT 


540 


GCTTCTAAAG 


AATCATCAAT 


CCAAGCATTT 


GGGCTATTAA 


ACATTACGTT 


TGCACCTAAT 


600 


GCTTTTAAAC 


TATGGTAATT 


ACTACGTGCG 


ACACGTGAAT 


TTTTAATGTC 


TCCACAAATC 


660 


AATACATCCA 


AGCCTTCAAA 


ATATCCATAT 


TCTTCATATA 


TCGTCATTAA 


ATCAAGTAAA 


720 
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CTTTGTGTTG 


GATGTTGTCC 


ACTACCATCA 


CCAGCATTCG 


CAATTGGGAT 


GTTAATATTC 


fO\) 


GCTAATTTTT CATAGTAGTT ATTAAACGGA 


TGTCTAATGA 


CTAATAAATC 


ACAGCCAATA 


840 


CTTTCTAAAG 


TTTTACATGT 


GTCATATAAA 


GATTCAACCT 


TTTGAAACAG 


ATGATGTTGA 


90U 


TGTTTCAAAG 


CTAATCGTTT 


TTAACCCTAG 


CTCAAGTvjCC. 




T^^^n^^ a 'a "a f rn *o^ 
1 1 JXAAAAL 1 


you 


ACATTTTGTT 


CGAGTAGAAT 


TTTCAAAGAA 


TAAATTTGCG 


ACATATTTCC 


CTTCAAAGTT 


1020 


TGGTAGTTGA 


CGTTCACCAG 


ATTTAAATTG 


ACTTGCCTTT 


TGGATAAGTT 


TGTATATTTG 


1080 


ATCTGTAGAT 


AAATGTTCCA 


TTGATAATAA 


ATGATTCATA 


AAAGCGCCCT 


CCTTAATTTG 


1140 


TAAATCTTAA 


TTGTTTTGTT 


TTTTCTCTTT 


AGGTAAGATT 


AAGTTCAAAA 


TAATTCCTGA 


1200 


AAGTGCAGCT 


CGTGCC 










1216 



(2) INFORMATION FOR SEQ ID NO: 931: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931: 



TAGTGNCGGN 


GNACCTGTAA 


GTAGATGATG 


NTGGGGAATA 


GCGNACCCTN 


CTAANTACCG 


60 


NAAGGGGAAN 


NATATTGCAG 


TANCCACATG 


ANTTGCCCAN 


CACCNTCTTC 


ACAGGGGNAT 


120 


TAATANAGAA 


NCCAGAACGT 


NCGCCGCNAG 


CCCNCANNCA 


GGNAAATACT 


ACATCACTGG 


180 


TGACTTAGCT 


CATGTAGATG 


ATGATGAGTA 


CTATCTGNGT 


TGANGGNCGT 


CGAGATGNCA 


240 


TTATTATTAG 


ATCAGACTAT 


ACGANTGGGC 


CTGTTGAAGA 


TGAAGATNCG 


TTANCGNATC 


300 


ATCCANCCGT 


TAAAGAATGT 


GCTGTAGTAN 


CANAACCTCA 


TGATATACGA 


GGAAATATTG 


360 


TTAAAGCATT 


CGTTATTTTA 


CAGGACCATA 




TGATACGTTA 


GTTAAAGAAC 


420 


TACAACAATT 


TGTTAAAAAT 


GAAGTTGCAC 


CATATAAATA 


TCCGAGAGAG ATTGAATTCC 


480 


CTTTCGTAAT 


ATCAGGCATA 


TTTTGAGTTG 


CAAATAATTC 


ACCTGAACGG 


TTTGGATAAA 


540 


TGAAGAAACT 


ATATTCTTCA 


CCTACTTCTA 


ATTCATCATC 


ATCGTTCATT 


TCTGATTGGT 


600 


TTAACTTTAC 


GTTTTCACCG 


TTTGGTCCTT 


TTAAAAGGTA 


AGTTGAGCCT 


TGTAACCCTA 


660 


CTACTTCAAG 












670 



(2) INFORMATION FOR SEQ ID NO: 932: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932: 



CTGAGAGGCA 


TGACGATGAT 


TCAGATTATC 


GGTGTAATTT 


TAGTGACGAT 


TGTNTTGCTN 


60 


AATCATTGGA 


ACTTTTGGAT 


TTTAGCGATA 


GGATTTATCA 


TTTTAATAGC 


ACCAGNGACA 


120 


GGTGTTGCGA 


CACTTGGTTT 


TNCAATAGCT 


ATGGATGAAA 


GTAGTAGTGG 


TAGAGGGAGT 


180 


TCATCAAGTT 


TATTAGGTTT 


AGTGCAGTTT 


TTATTTGGTG 


GTGTNGCATC 


GCCATTAGTG 


240 


GGGGTAAAAG 


GTGAAGATAA 


TCCTATACCT 


TACATAATTA 


TTATTATAGC 


TACAGCAGNT 


300 


ATATTAATTA 


TTTNACAGAT 


TTATAATATG 


AAGGTATTNT 


AAAACAAATC 


GATGATAAAC 


360 


ATAGNTTCTT 


CTATTATATT 


GAATTGANAA 


ATTAANAAAC 


AAAACTTGAG 


CTATTCATAG 


420 


ACCTATGAAT 


TTAGCATCAA 


AAATAAAAAT 


GCCGAGATTA 


TTATGTGGAA 


ATCTCGGCAT 


480 


TTTTATNTTG 


ATAGCACCGC 


AACATATGTA 


CTATNATCTA 


TCCAGNTGGT 


TTAACTTNAT 


540 


CTAGTTGTTT 


TATAGCATTT 


CTTTATTAGA 


ATGANACCGN 


ATTATTGGAN 


TTGAAGTTTA 


600 


NTCGTATTAG 


CCAATAAAAT 


GTCAGCAACT 


TCTTCAGGTG 


ATTCATNACA 


ACCNTTCGAA 


660 


AGCCAATTAA 


TAAACACCCC 


AGNTTGACCG 


CCTATGATAT 


ATGTAACATA 


AAGCTGTTTA 


720 


TTGGCTACTG 


TAGTACTATA 


GTCATTCATA 


ATTTGTTGGA 


TAGTAATCTC 


TCGCTAAATT 


780 


TATATAGTCT 


ATGAATAATG 


TTTTCATTGG 


ATAGGTAACT 


AAAATGTCTT 


TGAAAAACTG 


840 


CTCATTGTGC 


TTGATATACT 


TAGCAATAAT 


AATGAAGAAT 


TTATAAAGTT 


TAAGCTTATC 


900 


CTGTTTAATA 


TTTTCGAAAT 


TATGATGTAG 


CGCGCTTAGA 


AGAAATTGGA 


ATTTTTCTTT 


960 


ATGCTTTGAG 


TTGATGGTAT 


CCAATAAATC 


AAATTTATCT 


TTGTAATAAT 


CATAAAAAGT 


1020 


AGAACGATTA 


ATATTACAAT 


ACGCGCAAAT 


CATTTTAGTG 


GATATTTCTT 


GAAAAGGATA 


1080 


TTCAGCCATC 


AAATCAACAA 


GTGCGTTTTG 


AATTTCTATT 


TTGGCTTGTC 


GGTTCATATT 


1140 


GTTACACCTT 


TGGGGAATTT 


TTAANCCAAC 


ACTCTTATTA 


AAAATTGGTG 


GGGGTCTTTT 


1200 


AGTTATCATG 


TCTCTATAAT 


AAATAAAGTG 


AATTTTAAAA 


GCAAATCGGA 


GGAAATGTTT 


1260 


CATGAAAAAG 


ATGATATTAA 


TTAATGTCAT 


TACGGTTGTT 


GTACTGTTAG 


CAATTGGGAT 


1320 


TGCTGGCTTT 


TATTTCTGGA 


ATAAAACAAC 


AAGCTATGTA 


ACAACTGATA 


ATGCAAAAGT 


1380 


AAATGGCGAT 


CAAATTAAAA 


TAGCAAGTCC 


TGCATCTGGA 


CAAATTAAAT 


CACTTAATGT 


1440 


TAAACAAGGT 


GACAAACTCG 


ATAAAGGTGA 


CAAAGTAGCA 


ACTGTTACTG 


TACAAGGTCA 


1500 


AGATGGCGAA 


ACGAAAGATA 


TGGATCTTAA 


AATGCCTCAA 


AAAGGTACAA 


TTGGCAAATT 


1560 


AGATGGTATG 


GAAGGTTCAA 


TGCGTGCAAG 


CTGGTAACCC 


AATCGCTTAT 


GCATACAATT 


1620 


TAGATTGATT 


TATATGTNAC 


AAGCAAATAT 


TGACGAAAAA 


GATATTAAAG 


ATGTTGAAGT 


1680 


AGGTAAAGAT 


GTTGATGTGA 


CTATTGACGG 


TCAAAAAGCT 


TCTATTAAAG 


GTAAAGTTGA 


1740 


TAGTATTGGA 


AAAGCAACTG 


CTGCAAGCTT 


TTCATTAATG 


CCATCATCAA 


ATAGTGATGG 


1800 


TAACTACACT 


AAAGTTTCTC 


AAGTAATACC 


TGTTAAAATC 


ACTCTAGAAT 


CAGAACCGTC 


1860 


TAAACAAGTT 


GTTCCTGGAA 


TGAACGCTGA 


AGTAAAAATC 


CATAAAAATT 


ANGGGGGGTC 


1920 


ATTAAATGAC 


TACGACCTNC 


ATTATTAGCT 


ACATTATTTT 


AGCGCTCATT 


ATTGTTGGGG 


1980 


TTATCAATTT 


ATTTTTAATA 


AGATCCNAGA 


AAAAAAGGCA 


AACGCCANCA 


AAAGGAACAA 


2040 


CAATTTACGA 


CACGTCAATC 


AAATCAATCT 


AAATTTAAAG 


CTAGTGATTT 


AGACAAAACA 


2100 


ACTGATCAAT 


CGACACAACG 


TATGACGCAT 


GAAGAGTTGC 


GTGTTGACAA 


TCAAGATGAT 


2160 


CATAGCCAAG 


TTAGTCTAAA 


TGGTTACACA 


AAGGGGTCTG 


AGAAAGATCA 


AGAAGCATTC 


2220 


ACTAATAATA 


ATGGTGAGGA 


AGCAGTTGCT 


GCTAAAAATC 


CTGAATCAGA 


AGAATATAAA 


2280 


GTGAATGAAA 


AAATAAAAAA 


AGAACAG 








2307 
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(2) INFORMATION FOR SEQ ID NO: 933: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933 : 



CCCCAATCAA 


NNACCCAATT 


TGTATTGATT 


TGTGTGTNAA 


CCATCAATNN 


ATTGATACAA 


60 


TTTAATTCCA 


AGTNNNATTG 


CCCGTAATTT 


NCACNNNAAG 


AGANNAAAGA 


GAGGATGTAT 


120 


CGTTGTGATA 


AGAAACATTT 


AAAGCGTTTG 


TAATTTGATA 


AAGATGAGAG 


CGGTAAAGTG 


180 


ACACCCAACT 


TTCAAACAAT 


TATCGCCTAC 


TTGATTTACC 


TAAAGGAGAT 


GTGCTGATTA 


240 


AAGTACATTA 


CTCTGGGTAT 


AAATTATAAA 


GATGCTTTAG 


CGACTCAAGA 


TCATAATGCA 


300 


GTCGTAAAAT 


CGTATCCTAT 


GATTCCAGGA 


ATAGATTTAG 


CTGGAACAAT 


TGTTGAATCC 


360 


GAAGCACCAG 


GCTTTGAAAA 


AGGAGAACAA 


GTAATTGTAA 


CGAGTTATGA 


CCTAGGTGTT 


420 


AGCCATTATG 


GCGGTTTTAG 


TGAATATGCG 


CGTGTAAAAT 


CAGAATGGAT 


TATCAAGCTT 


480 


CCTGATACTT 


TAACATTAGA 


AGAATCAATG 


ATATATGGCA 


CAGCTGGTTA 


TACTGCCGGT 


540 


TTAGCAATTG 


AAAGACTTGA 


AAAAGTTGGA 


ATGAATATTG 


AAGATGGTCC 


TGTACTCGTT 


600 


CGCGGTGCTT 


CAGGTGGTGT 


CGGTACTTTA 


GCAGTACTCA 


TGCTTAATGA 


ACTTGGTTAT 


660 


AAAGTTATCG 


CAAGTACAGG 


TAAACCAAGA 


TGTTAACCGA 


TCCAATTACC 


TTGAACTTGG 


720 


TTGCCAAAGA 


AGTTATCGAT 


CCGACTTCCT 


GTTGAAGATG 


ATCATAAAAA 


ACCCACTCGC 


780 


ATCATCAACT 


TGCCAAGCTN 


GTGAAGACCC 


TGTTGGTGGC 


GAAGGTATTA 


ATTATGTTAC 


840 


NAAACGTTTA 


AATCATAGTG 


GATCAATTGC 


AGTTATTGGT 


ATGACTGCCG 


GCAATACTTA 


900 


TACTAATTCT 


GTATTCCCTC 


ACATTTTAAG 


AGGCGTAAAC 


ATTTTAGGAA 


TTGACTCAGT 


960 


ATTTACTGCT 


ATGAAATTAA 


GACACGAGCG 


TTTGGCGTCG 


TCTCGCAAAA 


GATTTAAAGC 


1020 


CTGAAAATTT 


ACATGAGATC 


AAGCAAGTTA 


TTACATTTGA 


TGAACTTCCA 


GAACAACTTA 


1080 


ACAAAGTAAT 


TAATCATGAA 


AATAAAGGCC 


GCATTGTTAT 


CGATTTCGGT 


GTAGATAAAT 


1140 


AGTAGTCATG 


AAAAAGACAT 


CCCGTTATGC 


GAGATGTCTT 


TTTTAATTTA 


GTATTTGATA 


1200 


TACATACCGC 


CTGAATCTGG 


TTCGGTAGGC 


ATAAATCCAA 


ATTTTGCATA 


TAATTTATCC 


1260 


GCCGGGTAAG 


TCCTGCGGAT 


AAAGACTAAC 


GTATGCACTT 


TCGACAAGAA 


TACATTTTTA 


1320 


ATAATNCTTC 


ATAATATGTT 


CCATAATTAG 


CCTGCCGTAA 


TCTTGACCTT 


GATAACTTTT 


1380 


CGAAACAGCA 


ATATCAACAA 


TTTGAAAAAC 


AGTTCCGCCA 


TCGCCAATCA 


CTCTACCCAT 


1440 


ACCAATTAAC 


CGATCTTTAT 


CATACAAGGT 


TACTGTAAAT 


AAGGCATTAG 


GTAATCCTTT 


1500 


TTCAGCTGCT 


TCGCGCGTCT 


TTGGACTCAT 


ACCTGCGTTA 


ATCCTTAATG 


CGCAATAATC 


1560 


CTCGCAAGTC 


GGAATATCAT 


ATGTCACTTT 


AACCATTATT 


TACCCCACTT 


TTCATCACAC 


1620 


AATATATCAA 


CCTAGTATAA 


ATGTTTATTT 


ACAATAGTCT 


TATTCGCTTC 


TTTAAACACT 


1680 


TCATGATGAC 


TTGAAACATA 


ACCCTCTGCA 


TTCGCATCTG 


GTTGGATATA 


TGTTTTAGCA 


1740 


AGGTTCGCTG 


CATTTGCACC 


ATCACTAAAT 


GCACTTGCAA 


TTAGATGTGA 


TTTTGCATCA 


1800 
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TGATAAACAA TATCTCCACA CGCATAGATA CCAGGTATAC TAGTTGTCGT ATTACCAAA^ I860 

CCTTTAACAC GACAATCATC ATGCATATCT AGCTTTGAAG ATGTTTCACT CAATAATGTA 1920 

TTACAACGAT CAAACCCATG ACTAATAATG ACATCGTCAA ATTTAACTGT ATGCCTATCG 1980 

CCACTTTCAA CATGTTCCAA ACACTCACTT ATATGCGTTT CATCATCATT GCCGACCAAG 2040 

TATT 2044 

(2) INFORMATION FOR SEQ ID NO: 934: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934: 



CCAAGTAAGT 


ATAGAAGCAG 


CATCCTTAGA 


AAAGAAAAAT 


GTAGATTTGA 


ACGAATTATT 


60 


ACAGCGTCTC 


AATGACGTTG 


AACAAACGGA 


ATACACCAGG 


TTCGCTTNAC 


CCTAAATTTT 


120 


TAAATGTTTC 


GTCACTTATT 


CAATATATTC 


AAGCAGCATA 


TCATGAACCT 


CTTAGAGAAG 


180 


AATTTAAAAA 


TTTAACACCT 


TATGTGACGA 


AATTATCGAA 


AGTACATGGA 


CCTAACCATC 


240 


CATATTTAGT 


CGAGTTAAAA 


GAAACATATG 


ATACATTTAA 


AAGTGGCATG 


TTAGAGCATA 


300 


TCCAAAAAGA 


AGATGATGTT 


GATTTTCAAA 


ACTAATTAAA 


TATGAACAAG 


GTGAAGTAGT 


360 


AAACGATATT 


AATACAGTGA 


TTGATGATTT 


AGTATCTGAT 


CACATTGCAA 


CGGGACAATT 


420 


GTTAGTGAAA 


ATGAGCGATT 


TAACATCTAG 


CTATGAACCA 


CCGATAGAGG 


CATGTGGTAC 


480 


GTGGCGACTC 


GTTTATCAGA 


GATTAAAAGC 


ACTTGAAGTG 


TTAACACATG 


AGCATGTTCA 


540 


TTTAGAGAAT 


CATGTTTTAT 


TTAAAAAAGT 


ATCATAAATA 


ACGCGATTAG 


AAACTGTTGG 


600 


CAAAGTAATG 


TCAGCAGTTT 


TTCGCTATAC 


TTAACAGAAA 


TTTAGTGATA 


TGAACAGCAT 


660 


TATTTGAAAA 


GAAAAATGGT 


CAACTTAGCA 


TAAAAATTGA 


TATGAAAATT 


TAATGGTATA 


720 


GATAATTAAA 


TAGTAGCGTG 


TTTTTrTAAT 


AATTTATTCA 


TGAATTTTAC 


ATGCACTATT 


780 


ATGATAAAAT 


AAACATAATT 


ATAATTCACT 


GAGGTGCTAT 


CGTGCTATCG 


CTAACAATGT 


840 


TATTACTTGA 


GCGTGTAGGT 


TTAATTATTA 


TTNTGGGCCT 


ATGTGTTGAT 


GAATTATTCC 


900 


ATATTTTTAA 


AAACTTTAAT 


GAATCCGTCG 


ACGCTACATG 


GAAAGCACGT 


TGGCAATTAT 


960 


GTATTATTTT 


CAGTTTGTTT 


GCCTTAATGT 


TCTAATTTAA 


CTGGTATCGT 


CATCGATCAT 


1020 


CAACATAGTT 


TGTCAGGAAG 


TGTGTACTTC 


CGTTTAGATG 


ATGATGTATC 


TTTAGCTAAC 


1080 


ACACGTGTAT 


TAACGATAGG 


TGTCGCAGGA 


TTAGTTGGTG 


GCCCTTTTGT 


AGGTCTATTT 


1140 


GTTGGCGTTA 


TTTCAGGTAT 


TTTCAGAGTG 


TATATGGGTG 


GGGCGGATGC 


ACAAGTTTAT 


1200 


CTTATCTCAT 


CTATATTTAT 


CGGNATTAAT 


TGCTGGTTAT 


TTNGGGCTTA 


CAAAGCTCAA 


1260 


AGGACGCAAA 


GCGGTTACCC 


GAGGTATTTG 


GCGAAAAAGT 


GGCCAATGAA 


TTGGGANTTG 


1320 


GTTATTGGGA 


AATGAATCAA 


ATGTNGGAGC 


AATTTTAACA 


TTTTTCCCCA 


CGACAAAGCA 


1380 


TATTGCGGTT 


GACTCATATC 


ATTAATTGCA 


CTACCAATGA 


TTATTGTTAA 


TAGCGTTGGT 


1440 


ACGGCGATTT 


TTATGTCTAT 


TATCATTTCC 


AACATTAAAA 


GCAAGAAGAG 


CAAATGAAGC 


1500 
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PTflwiV^r AAA. 


l~ AUA 1 \xACGT 


ACTGCAAI 1 vj 


A IV* A AOOAOA 

ATUAACCAGA 


CA1 TCCCGTA 


TTTTAAAGAA 


1560 


LAaAl JluAAlA 


oAUAAJXAjCjC 


AO AOOA A A rrvp 

ACAGCAAA1 1 


PPPHWRTT* 


TTAAAAATTT 


AATGAAAGTA 


1620 




PKHnvnspK AO 

CAATTACAAG 


OA A A A A fTV* A A 

CAAAAATGAA 


ATCTTATCGC 


ATGTAGGTGC 


AGGTAGTGAT 


1680 


CATCACATAC 


^ a a^a iv imri a 

CAACAAATGA 


AATATTAACA 


AGTCTGTCTA 


AAGATGTATT 


GAAATCAGGA 


1740 


X AOHWIV* A A ao 

AAGTTUAAAG 


AAGTGCATAC 


TAAAGAAGAG 


ATTGGTTGTA 


GTCATCCGAA 


TTGCCCGCTT 


1800 




qw-u-tmp A O A OO 


KPUVnPKPIl'liP 

ACTTGAGATG 


PKmppnvnPmik 
CATGGTTCTA 


TCGTCGGTAC 


ATTGAAGATG 


1860 


TATTTTACAA 


AC LaJTAATGA 


TTTAACTTTT 


GTGGAACGTC 


AACTTGCAGA 


AGGATTGGCA 


1920 


A A A ^P^P^P^^f A 

AAlAl 111 IA 


PTliPPPA A A 1* 

0 1 AUCCAAA 1 


TGAACTTGGT 


GAAGCCGAAA 


CXtCAAAGTAA 


GTTATTGAAA 


1980 


uA IvjV, IuAvxA 


rp#Ti A A P<lip A rpm 

i x aavjTCAi r 


A O A OOO A O A A 

ACACaGCACAA 


GTGAGTCCAC 


ATTTTTTCTT 


CAATTCAATT 


2040 


AALnLuA 111 


CAGCTTTAGT 


CTAOA AH* A A a ni 

TAGAATAAAT 


AGCGAAAAGG 


CACGAGAGTT ACTATTAGAA 


2100 


TTGAGTTATT 


TTTTCAGAGC 


GAATTTACAA 


GGCTCAAAGC 


AACATACGGA 


TTACTTTAGA 


2160 


TAAAbAGllA 


AGTCAAGTGC 


GTGCATACTT 


ATCACTCGAA 


CAAGCACNGT 


TATCCAGGAA 


2220 


uAX 1 1 AAlAl 


OA a rn A mni a act 
LAA 1 Al 1 AA 1 


GTTGAAGACA 


AATATCGCGA 


TGTGCTTGTA 


CCACCATTTT 


2280 


lAAi IvAAAl 


TTTAGTTGAA 


a a mooo a mn iv 

AATGCCATGA 


AACATGCGTT 


TACGAATCGA 


AAGCAAGGTA 


2340 


a oo Tv *t> a rmv^ a 
ACGATATTGA 


CGTGTCAGTG 


ATTAAAGAAA 


CTGCAACACA 


TGTACGTATT 


ATTGTACAAG 


2400 


ATAATGGTCA 


GGGTATTTCT 


AAAGATAAAA 


TGCATTTGTT 


GGGAGAAACA 


TCTGTAGAAT 


2460 


O A O A A Ifwimpfi 

UAUeAATCTGU 


AACTGGTAGT 


GCTTTAGAAA 


ATTTAAACTT 


ACGCCTAAAN 


GGATTATTTG 


2520 


gaaaatccgc 


AGCATTACCA 


ATGTGAATCC 


GACATCGAGN 


GGTACCACTT 


TTTGGTGTGT 


2580 


ACCTTCCTTA 


TGAAAGACAA 


GAGGAGGAAT 


AAATATGAAA 


GCATTAATCA 


TAGATGATGA 


2640 


GCCAl 1AGGA 


CGCTAATGAA 


TTAACATATT 


TATTAAATGA 


AATTGGTGGT 


TTTGAAGAAA 


2700 


TTAATGAGGC 


AGAAAATGTA 


AAAGAAACAT 


TGGAAGCACC 


TACTTGATCA ATCAATATGA 


2760 


CATTATATTT 


TTAGATGTCA 


ATTTAATGGA 


TGAAAATGGG 


ATCGAATTAG 


GAGCTAAGAT 


2820 


rtv** A a a a rnr^ 

TCAAAAGATG 


AAAGAGCCAC 


CTGC6ATTAT 


TTTTGCAACT 


GCACATGACC 


AATACGCAGT 


2880 


ACAGGCATTT 


GAATTAAATG 


CGACAGACTA 


TATTTTGAAA 


CCGTTTGGTC 


AAAAACGTAT 


2940 


TGAACAAGCA 


GTCAATAAAG 


TGCGTGCGAC 


TAAAGCCAAA 


GATGATAATA 


ACGCAAGTGC 


3000 


AATTGCGAAT 


GATATGTCGG 


CGAATTTTGA 


TCAAAGCTTA 


CCTGTTGAAA 


TTGACGATAA 


3060 


AATTCACATG 


TTAAAGCAAC 


ANAATATTAT 


TGGGATTGGC 


ACACATAATG 


GTATTACAAC 


3120 


CATACATACA 


ACGAATCATA 


AATACGAAAC 


AACAGAGCCA 


TTGAATCGTT 


ATGAAAAACG 


.3180 


ATTGAATCCC 


ACTTATTTTA 


TACGTATTCA 


TCGTTCATAT 


AG 




3222 



(2) INFORHATION FOR SBQ ID NO: 935: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1660 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935: 
TATTGTGATT CCTTGTNNTT GAGTGGAAAA TCCCTTAAAT AATTGGTGCC AACTCATCCA 60 
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AGCTTTGAAT CAACCCATAT CATCANNCCG CNTTTGGNCC CCAATTGTCA TCGTATCCTT 120 

GGAATTGAAG GATACCCAAA TGATGCATAA TCAGCTTCTG TANAAAGCTG TAATGTAATG 180 

TCATCAAGGT CACCTTTTCT AACAAGCACC TTATTAATAG AAGTATGATT CGCTTGCCAA 240 

GTACCTTGAT TATTTTGTTC TAAATGAATG ACTTCGCCTA ACGATTTCAA CGTAATATCT 300 

GCACGCTCGT CTTCGCTAAT AGTATATGTC TTACCATCTC GCAAATTGAG CATCTTCAAT 360 

TGTTTGTTAT ATTTTATAAT CAATTTATGC ATTGTCTTTG CCTCAGTCCT ATACTATTTT 420 

TTTCTTTCAG CTTCTTGGCG TTTTTCTTTA TCTTTTTGTG CTTGTTCTTT TTGTTTCTTT 480 

TCGTTCTCTT CTTGTTGCTT TAATTTCTCA TCTTTCGCTT TTGCTTTCTC TTCTTCAGAT 540 

TTCGCTTTTT CATCTTTAAC TTGTTTTTCT TTGTCTAAAA TATCTTGCAA TTTAACGNNA 600 

GACTTTTTCG TTTCTTCAGA ACGTTTATCA TTCGATAAAT CTCCGTTATT TTTAATCTCA 660 

NTTAATTTAT TAANCAACGC TAACTTTGTA ATATCGTTAT CATCTAAAAT AAAGTGGCAA 720 

TATTAATCGC TTCATCAAGA GGTCCTTGTC CTAAATTCCA TCCAAGATAA TAAAGTAGTC 780 

TTGGTTTGAA TTTGGTGTCA CGTTATTAAG TAAATTTTCG GTCTTATCTN TTTNCNAAAC 840 

CTAGTNGAGT GGGGTGGAAA TAACCCCTGG CATAAATGTN AAGTNCCCCT CTAGCTAATT 900 

TTTTTCCATC TAAATCATCA CACGTATTTA GTACTTGCGT ATAATC ATC C CTCACANATG 960 

CGTGGNATCC TTTTTCAATG CGCTCATTAT GCTTCATTAC TGANAAATAT ANAAAGGCTA 1020 

AAAATGCAAT TAATAACACA CTTAACGTTG TCATGCCGAT AGCAACCCAT TTGAAAACGG 1080 

TATGCCCTAC TTTGCGTACA TATGCATAAT TTTGACTATA ATCTTGTTCT TGTTTCTGAT 1140 

ACTGTTCATC TAAAAATGCC GTTAGTAAAT CTAACGTTGC CGCTTCAATA ACTTTAGTTT 1200 

CAAATGGCGT TCCTTTATGT AGTTCTAAGT TTCCTTCAAC TAAAGCATCA AATGATTGTT 1260 

TCTCATTGAA TGCACAGATA ACCAGTGCTT TATATCTTGT TAAAAATTCA GCTTCTGACA 1320 

CCGGNAATGG ATCAACAACA TTTTGTAAAC CTCTTGTTTT AGCAATTGGT AATCCATCTC 1380 

TTGTGAAAAA TAACTCATCT GGTGCCCAAA CAAATGTGTA ACGTGTACGA TTTACTTCTT 1440 

CTAAATTTTT GATATTGAGT AAGTATCGTA ATTTTTCATT TTTAGTAAAT GATTTAAGAT 1500 

TATCACAAGG TGTATGATTG TCATTAAGGT CATAATGTAT TTGGAAACTG TCACGTAGTT 1560 

CAGTTAATTC AGCATCTATA AAATATGGAG AATGTTGTTC TAATAAGTAC ATTAAATGAA 1620 

AATGTTCTGG NTTAATTGAA GACTTAGGAA TCTCTCTCAG 1660 

(2) INFORMATION FOR SEQ ID NO: 936: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEENESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936: 

GGTCGTGGTT CATTGGTAAA AATCCTAAAA GCCNCCAATG TAAAGTAACA NTTCAAAGCG 60 

TCATGGAAGA NTCATTAANN CTTANCTCNT TGGTCCATCA AACNGATGTA TTTTAACTGA 120 

TTGATCTNAA CCCACCGACG GCGTTGATAA TGTTGATCCA ATTAAATCCA ATCCAACCCT 180 
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TCTGGAAGCG 


TTGAACCCAA 


CCAATTCTAA 


GAAGCTGATA 


TTATNACCAC CAACCTGTT£ 


240 


GTTGTTNAAN 


AATACTACCC 


TATTATTTGC 


TAAATCTTTT 


oLUCL 1 1 I v~ 1 


uiUUiAAuAAA 


300 


AAACCAAACC 


ATGTTAATAT 


TGTTGCTTGT 


GAGAATGCTA 


1 In 1 uuV. /\/iV_ 


TVS A T 21 r* A T V PP 
l\iAl ALA 1 1\3 


360 


AANAAAGCAC 


GAGTACTTGA 


TATTACTGGC 


CCTCTTGGTA 


AnttATATTT'A 


ill luL 1 AAL. 


420 


TCAGCAGTTG 


ATAGAATTGT 


ACCATTACAA 


NAGNATGAAA 


A1A1A1 lAun 


/"•/■"•fiwr 1 & nirwnT 
Lul lAloul 1 


480 


GAGCCATTTT 


ACGAATGGGT 


TGTTGAAAAA 


GATGCATGGT 


21 TVyi»nnr« * /*» x 
A Ivjvj 1\,LauA 


AL 1AAAULA1 


540 


ATTAAATATG 


TTGATGATTT 


AACACCATAT 


ATTGAGCGTA 


AAI 1 AI I AAL 


TGTGAATACA 


600 


GGACATGCAT 


ATTTAGCGTA 


TGCTGGTAAA 


TTTGCAGGTA 


AAGCTACCGT 


ATTAGACGCT 


660 


GTAAAAGATA 


GTTCAATTGA 


AGCTGGCTTA 


CGCCGTGTTT 


TAGCTGAAAC 


AAGTCAATAT 


720 


ATTACTAATG 


AATTTGATTT 


TACTTGAAGC 


GGAACAAGCT 


GGTTATGTTG 


AAAAAATAAT 


780 


AGACCGNNTC 


AACAATTCTT 


ATTTATCTGA 


TGAAGTGACA 


CGTGTCGGAC 


GAGGTACGGT 


840 


TACGTNAAAT 


TGGCCCTAAA 


GATAGAATTA 


TAAAACAATT 


AACATATCTT 


TATAATAAAG 


900 


ATTTAGAACG 


CACTGGTTTA 


TTAAATACAG 


CTGCATTGTT 


ATTGANGTNT 


GATGATACAG 


960 


CAGGACCAAG 


AAACTGTTGG 


AGAAAAATNN 


TNACATTAAA 


GGACCACG 




1008 



(2) INFORMATION FOR SEQ ID NO: 937: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937: 



AAGTTATTAT GGATTTTTGA 


CCCAGGAACN 


CGATGATNCA 


ATAACATTAA TTTAAGNGGG 


60 


GGCCAATTGA 


CAATCCCACT 


AGAGATATTA 


NCTGTACAAA 


CAGTCGCAGG 


TAATCCAATC 


120 


AAGTTGACNA AGAANACGAC 


ANACNCCTTG 


GACGTATTGG 


ATATTATGGG 


ACCCCAAGAT 


180 


ATAGCAGTAG 


CGAAAGGTGC 


GGATAGCCCG 


TTAATTAAAC 


CANCTGCCTT 


TGCTTCTGAA 


240 


ATACATGGGG 


AATCTGGATT 


AGATGGTCCG 


AAACTACCGT 


CGACACCATC 


ACGTCAAGCA 


300 


GTTGCAATGC 


CAGCATCAGA 


TGTGATTATA 


AACAAAGTGA 


TGACGAGTGA 


TACACCTGTA 


360 


ACAATTGTAG 


CGACAGGTCC 


TCTTACGAAT 


GTAGCAACGG 


CATTGATTCG 


TGAGCCAAGA 


420 


ATCGCTGAGC 


ATATTGAATC 


TATTACTTTG 


ATGGGTGGTG 


GTACATTTGG 


AAATTGGACG 


480 


CCTACAGCAG 


GAATTTCAAT 


ATTTGGGTAG 


ATGCTGAAAG 


CAGCGAAGCG 


TGTTTTGGAA 


540 


AGTGGGATTA 


CTATAAATGT 


GTTTGGTTTA 


GATGTAACAC 


ATCAAGTTTT 


AGCCGACAAT 


600 


CACGTGATTG 


AACGCTTTGA 


AAGTATCAAT 


AATCCTGTTG 


CACAGTTCGT 


CGTAGAATTA 


660 


TTGCAATTCT 


TTAAGAAGAC 


ATACAAGACT 


CACTTTAATA 


TGGATGGTGG 


TCCAATACAT 


720 


GATGCTTGTA 


CAAATTTGTA 


TTTGTTACAA 


CCAGAATTGT 


TTACAATGGT 


ACCCGTTAAT 


780 


ATNGACATCT GAACATCAAA 


GTCCACTAAC 


TTATGGCACT 


ATGGCTGTCG 


ATTTAAATCA 


840 


TGTTACAGGT AAGCCNGCCA 


ANGCTTATTT 


TGCTACAGCA 


GNTGNTGTTG 


AAGAAGTGTG 


900 


GNACTTGGAT 


AGGACCATNA 


GNTANGTCTC 


GTGCCGATAT 


CAAGCTTATC 


GTACCGTCGA 


960 
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CCTCGAGGGG GNCCGGCCNA A . 981 
(2) INFORMATION FOR SEQ ID NO: 938: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 938: . 

TTTAACAACC CTTGTTTAAN TAAGGAATCC ACTTGAATTA TCAAGTTGTG AACCACCTNC 60 

TTTTTACATA ATCTAAACCG TACTTTTGAA NACCCNCNNT TANCAAATAT CTNTTAATTC 120 

TTCCACCTNC NTTNACGTCT TTGTTGTGTA CATGTTGGTG AGTTGNNAAG TAAGCTATCT 180 

GAGATAATTG CTGATAACAT TAAACCGCCA ATTTCAGGGT TAATTTCANA GCCACGTTCT 240 

CTAAACATTT TGTATAAAAT TGTAGCTGTA CACCCAACTG GTTCAGCACG ATAACATAAA 300 

GGGCCAGCAG TTTCGAAATT TGCAATTCTG TGAGGATCAA TTACATGCTT AATGGTAGCA 360 

GAGACAATCG TATCNGAACT TGGTNGGAAT TCGTTATGAT CAACTAAGAT AACATCTTGA 420 

CCATCTAAAT CATCCGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC TAACGCGAAT 480 

TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT ACCTCTAAGT 540 

TGTTCAAATT CTGCCATAAT AATCGCAGAT GAAATTGCAT CAGTGTCTGG ATTCTTATGT 600 

CCGAAAATAT ATGTTTTAGC CATTGTCAAA TATCTCCCTT GTAAATTGTA TTCTTTATAA 660 

TTATTTTAGC ACGATTTAAA CGTGTTGTTC TACCTCTGCA CCTAATGAGT TTGTGGAAAT 720 

GTGCAAGTGC CCATTCATGC CCTTTGTGGA TTAAATGAAG CCACTCCCTC AGCAGGTACT 780 

GAAATTTTAT AACCTAAGTT GTATTGCAGG AAATTGCTGT ATTGGTAACA ACGCAAATAT 840 

CGGGTACAAT ACAACCAACG ATTTCGACTT GGATTAATAC TTCTTTCTCT CAAATAAACT 900 

ATCAAGCGGG GTACCAAAGA ACGAATCATA GCGCGTTTTA TCAATGAAAT GTACATTAGG 960 

TTGCGCTTTA ATTGTGTCGN ATAATTTACC TACTTTACCG TATAATTCTC TACCACTTGT 1020 

ATCTACGATA TTGTGTGGTG GGAATAATTT ACTTTCAGGA TGATGAATGT CATGTAAATA 1080 

ATGTAAATCC ATTAAAAAGA ATATATGGTC TTGATAATAA TTAAAGTCAT TGATACGAGA 1140 

AACAATAAAA TCTTCAATAT TTTGTCCAGG TTTACCGCAT GTTAGTAAGC CGTCATCTGC 1200 

GATAAAGTCA TATGAATAGT CAACAACTAA TAGTGCGCGA TGTGTCATTC AACATCTCTC 1260 

CTATATAATA AAATAAACTC CCATCTATTC ATAAATGCTA GAGTAATTTT TAACCCTAAT 1320 

GGTTAATATA TAAATATAGA TTAANGTATA TAGATTAATC AACTTNGNTG GAAGAGCAAA 1380 

TCACGCAATC ACAAATAATA TTAGAAGTTT TTGGCGATNG TTCAAAATAG CTGCTATAGA 1440 

TACTAATTGT GAAAACTTAG ACTATTCAAG TGT 1473 

(2) INFORMATION FOR SEQ ID NO: 939: 

(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 1372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 939: 



AAGAGTTGTC 


GGATCATNAC 


CGCCAAGTAA 


AAAGCGGATG 


ATTGACCTCC 


CACCTGGATT 


60 


GCATTGTATT 


TGCCAAATCG 


TATCCAAGTA 


CAATGGTACC 


CGTAGTTTAA 


TAAGTCTCCC 


120 


AAGCACCTTG 


AGAAGTCGCC 


AGTAAAGCAA 


CTTGAATTAA 


AGCAGTGGAA 


TAAACCCTAC 


180 


TATGATTTGG 


ACTTGCCACG 


GGATATCACT 


TGTCCTATGG 


CTTGGAATGC 


AATTGTAATT 


240 


AACGTCCACA 


AACCCTGTAT 


GATATTCATA 


ACGTTTGTGA 


TGATACCTAT 


TACCACAACA 


300 


CCTAAAACTT 


GCATGAATAT 


TTGCCCTAAA 


ACTTGCAATA 


TAGGCATTAT 


CGGTTGTAAG 


360 


GTAGATTGGA 


TTTTGCCCCA 


CAATTCAGTT 


AACCAGCCAA 


CTACACCTTG 


AATCGCACCA 


420 


GAAACCGCCG 


TTTTAACACC 


GTTCCACGCT 


TCAGTAATAG 


TGTTTCTGAA 


ATTCTCGTTT 


480 


GTTTTCCATA 


AATAAACTAG 


GACACCGATA 


AATGCGCCAA 


TTACTGCAAT 


TACCGCTAAA 


540 


ATAGGTGCTG 


AAATCGTTCC 


AAAAACACCT 


GTTAATGCTT 


CCAAAGCTCC 


AGTAACTAAA 


600 


CTTGATGTTC 


TAACGAATTC 


TAAAATCTTT 


TCGACGACAC 


TGAATAAACT 


CAAGCCAAAC 


660 


ACATTTGTAA 


GCACACTACT 


TATAGCAACA 


ATCGGAGCCA 


TTAAAGCCCA 


AAATACACCA 


720 


CCTAAAATAC 


CCATAACACC 


AGCAACTTGT 


GCTATAGCTG 


GGTGTGTCTC 


GAATAGCTTA 


780 


GCGATAAAAC 


CAGCTAGATT 


AGTGATAAAG 


TCTAACAATT 


TACTAGCTAT 


AGGAGCCATT 


840 


GCAGTGCCAA 


ATGCTACTAA 


TGCTTTTATG 


ATGTTACCGA 


TTAATTGCAT 


AATAGTAGGA 


900 


CCATTCTCTT 


GAACGTAACT 


TATAAAGTCT 


TTGAACCCTT 


GTGATTGTCC 


TACTTGTTCT 


960 


GACCACGCTC 


TAAATTGAGA 


AGTTAATTTA 


ACTAACCAGT 


CAAAAATATT 


AGAACTGTTT 


1020 


TGGGCAAAAG 


CAATCATTAA 


NTTACCAATA 


CCAGCGAATA 


CATTACCAAT 


TTCTTGGCCA 


1080 


ATCTTAGGTA 


AGTTAGTGGG 


AGGGTAGGCC 


AATAAACGNT 


TTAATAGCAT 


CTTCAGACCA 


1140 


GGCTACACTA 


TTAGCCCAAT 


TGGGGAAAGN 


TATAGACAAN 


NTCCTGTNAT 


CCTTGAGACA 


1200 


CGAATTTGAA 


CAATGGCATT 


AATTGAGTGA 


GAATGTTAAA 


TAATCCGTCG 


CCAAATCGTC 


1260 


CTGCAGCGTT 


CAATAAATCT 


CCGAAGATTG 


CACCACCTAT 


GCTATTCAAT 


GCTTCAAACG 


1320 


CTTTCTTAGC 


CGTTTCAGAA 


TGTTTAACCC 


AATCNTCAAA 


CTCGCTCGTG 


CC 


1372 



(2) INFORMATION FOR SEQ ID NO: 940: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 940: 


- 




GGCACGAGCG 


TAGATTTAGC 


ATTTTCAAGA 


GACCAAGAAC 


ACAAAGAAAT 


ATGTACAGCA 


60 


TCGTATAATG 


GAAGAAAGTG 


AACGTTTCAA 


TGAATGGATT 


GAGCAAGGCG 


CAGCAATCTA 


120 


TATTTGTGGC 


GATGAAAAAT 


GTATGGCGAA 


AGATGTCCAT 


CAAGCCATTA 


AAGATGTATT 


180 


GGTAAAAGAA 


CGTCATATTT 


CTCAAGAAGA 


AGCAGAGTTA TTATTGCGAC 


AAATGAAACA 


240 


ACAACAACGC 


TATCAACGTG 


ATGTTTATTA 


GCGATTGGTG 


TTAAATATTT 


TAAGGTGTAA 


300 


TGATGTAAAA 


AGATATAAAG 


GATGTTGCTC 


AACATGAATA 


TGCCATTAAT 


GATAGATTTA 


360 


ACAAATAAAA 


ATGTCGTCAT 


AGTTGGNTGG 


AGGCGTCGTT 


GCAAGTCGTC 


CGGGCACCAA 


420 


ACATTAAATC 


AATACGTTGA 


ACATATGGAC 


GGTCATCAGT 


CCGACAATCA 


CTGAAAAACT 


480 


TCAAAATATG 


GTAGATAACG 


GTGTCGTCAA 


TATGGAAAGA 


AAAAGAATTT 


GAACCAAGCG 


540 


ATATTGTAGA 


CGCGTATCTA 


GTTATTGCAG 


CAACCAATGA 


GCCACGTGTC 


AATGAAGCGG 


600 


TAAAACAAGC 


CTTACCTGAG 


CATGCCCTTT 


TTAATAATGT 


TGGAGATGCA 


TCAAATGCCA 


660 


ATGTTGTATT 


TCCAAGTGCA 


CTACACCGCG 


ACAAGCTAAC 


TATCAGTGTA 


TCAACTGATG 


720 


GTGCGAGTCC 


TAAGTTGACA 


AAATCAATTA 


TGGCAGAGCT 


TGAGGCGTTA 


TATCCACCAT 


780 


CATACAGTTC 


GTATATCGAC 


TTTTTATATA 


CTTGCCGACA 


GAAAATAAAA 


GTACTTGATA 


840 


TAACATATAT 


ACGAAAAGCA 


ACAGTTACTG 


TCACAAATTG 


TGTCACAAGA 


ATATTTAAAT 


900 


CATGACAAAC 


AAGCTCAATT 


TTTAGCGTGG 


TTGGATGTAA 


GATAATAATA 


GCGGACCGTC 


960 


TAACCGTCTA 


AGGTAAGTCT 


TCTTATTTTA 


ACTTTAACGC 


TTAATCATTG 


AAATTAAGAC 


1020 


ATGGGTGGCT 


TTGTGAATAG 


TCTAATAATG 


AAGGATTTAA 


GCGATAATGA 


TATGCGTTTT 


1080 


AAATATGAAT 


ATTACAATAG 


AGAAAAAGAT 


ACGTAGAACA 


AACTTAATAA 


AATAGATGGA 


1140 


TAAATTGAAA 


TCTGGTTGAA 


GTCGTTACTA 


TCATAGCGAC 


CTTTAGCCAG 


ATTTTTTGTG 


1200 


CAATAGAAAG 


CAATAATAAA 


AATGATAGAT 


CAANATGAAA 


TACAGGACAG 


GATATACAAG 


1260 


GATTAGTCAT 


GCCATGTTAT 


CAAGTAGGAA 


AATCAAACTT 


CACTATTGAT 


AGTTACGCAN 


1320 


AAAGATTTTT 


TTGATAAAAT 


GAGATAACTT 


AAAGATAAAA 


AATTATATTA 


ATTATAATAT 


1380 


TTAAGTTAAA 


GAGGGGGATT 


ATGTAAATTG 


TATTAAAAGT 


GGAGGGAGAA 


AATAATATGA 


1440 


ATAGTGAG 












1448 



(2) INFORMATION FOR SEQ ID NO: 941: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941: 

TGCCCAAGAN CCCCTGTANT GGTAATGTCC CGGGGNAAAA TGCCANTTGA TGTGTNNNGG 60 
TGATATGANC CCGCCATTGT AATATTATTA TTCCAGATAA ANCCGAAATA TGANNCGATG 120 
CTCAATATAG CAGACCCACA GGTTCAATCT CGTATTTTAC CAAGGCGGAT ATCTATATAA 180 

820 



WO 97/30070 



PCT/US97/02318 



CTATCGTAGT 


CGTAATAGAT 


ATACTAAAGA 


AATTGTAATA 


TTAACTAACT 


CGGATCAACX 


240 


TCGTAACATC 


AAGGTATTGG 


GTGAAACTTT 


ACCAGATTTT 


AATTTTCACA 


TCGCTGGCTA 


300 


TTACAGAGAT 


GTCTGATAAG 


CTAATGCAAC 


TTGATCAATA 


TGCTAATGTG 


CACTTATACC 


360 


CATCTATTAA 


TATTGACAGG 


GTAAATGAAC 


TNTACCAGTT 


ATGTGATATT 


TATCTTGATA 


420 


TCNATGAGGG 


TAATGAAATT 


TTAAATTGCG 


GGTGGAACAA 


GCGTTTGACT 


ATGAGTTGTT 


480 


GATTTTAGGT 


TATCGTCAAA 


CCGCACACCC 


ATGCTAAAGT 


AACATTATCA 


GAGCACTTAT 


540 


TTGAGCATAA 


TGATGAGATA 


ACAATGGAAA 


GTAAAGACCA ATTAATACAA ATGTTAGAAA 


600 


GTTTGAAAGA 


TCAACAACAG 


TTTCGAGATG 


CTTTATTAGC 


ACAAAAAGCA 


CATGCACACG 


660 


GCACGAGCGT 


GAACAATTCG 


AGCAAGTTTT 


TAAACAGGCG 


CTCGAAAGCT 


AAAAAGAAAA 


720 


AGCAGTCTAA 


AATAAAGCAT 


CATTTAATTT 


AAAAGATTTG 


ACGTAATATG 


ATGTTAGCGA 


780 


CATGGACAAT 


ATGATATTAC 


GTCATGCTTT 


TTTGTTTACT 


TAAATATAAG 


CTAAGTAATA 


840 


AGTTGATAAT 


TACTAACAAC 


AATAACTAGA 


TAGATAAGTG 


TAAATTTCTT 


GTAAACAGGT 


900 


ATATAATAGT 


ATGTAATTCT 


TTGAAGGAAG 


TAAGTGAGAA 


AGGAGTATGT 


TGATGGCTAA 


960 


ATATCGAGGG 


AAACCGTTTC 


AATTATATGT 


AAAGTTATCG 


TGTTCGACAA 


TGATGGCGTC 


1020 


AAGTATCATT 


TTAACGAATA 


TCTTGCCGTA 


CGATGCCCAA 


GCTGCATCTG 


AAAAGGATAC 


1080 


TGAAATTTCA 


AAAGAGATAT 


TATCTAAGCA 


AGATTTATTA GACAAAGTTG 


ACAAAGCAAT 


1140 


TCGTCAAATT 


GAGCAATTAA 


AACAGTTATC 


GGCTTCATCT 


AAAGCACATT 


ATAAAGCACA 


1200 


ACTAAATGAA 


GCGAAAACAG 


CATCGCAAAT 


AGATGAAATC 


ATAAAACGAG 


CTAATGAGTT 


1260 


GGATAGCAAA 


GAAAATAAAA 


GTTCTCACAC 


TGAAATGAAC 


GGTCAAAGTG 


ATATAGACAG 


1320 


TAAATTAGAT 


CAATTGCTTA 


AAGATTTAAA 


TGAGGTTTCT 


TCAAATGTTG 


ATAGGGGTCA 


1380 


ACAAAGTGGC 


GAGGACGATC 


TTAATGCAAT 


GAAAAATGAT 


ATGTCNCAAA 


CGGCTACAAC 


1440 


AAAATATGGA 


GAANAAGATG 


ATAAAAATGA 


TGAAGCAATG 


GTAAATAAGG 


CGTTAGAAGA 


1500 


ACTAGAACAT 


TTGAATCCAG 


CAAATACACC 


AATCCGAAAG 


ATGCATTGAA 


AGATGCATCC 


1560 


GAAAGATCCG 


GCAGTGTTCT 


ACCACAGAAT 


TATTATCTCC 


TGAATGTACG 


CTAAAACGCC 


1620 


ATATAATGAG 


GTCCCTGAGA 


CT 








1642 



(2) INFORMATION FOR SEQ ID NO: 942: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942: 

CTAGAAATAT AACTTATAAG TTCAACATGA ACTGGCACTT CTTTTAAACG TGCTCTATCT 60 

GATTTCAGCG CAGTAAAAGT TGCATCTATA AAATCTTGTT GATGGNTTAA ACGATATGTA 120 

GCAACAAAAG TATCTGTATC GGACTCAAAG TOTGGTAAAT GTGCAATTAC ACGCTTAATT 180 

TCGCCAGTAG AATCAGCAAT CGTATGTCCT GTAATATTTT CTAATTCTCT ACCTAATTCT 240 

GGCAAAGTAA TATCTNGACC CTCTAAAATT CTAANATCTT GTTCATGCAT GCCGTTTCTC 300 
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TCCTTTATTA 


ATTACGTTGG 


TCTTATTTCC 


ACCGTTCTGG 


TCCTCAATCA 


TGGAACACAT 


360 


TTTTAATTAA 


TACATACCCT 


NAAAATAAGG 


NTTNAAACGC 


NGCACCCNCT 


CATCTATAAA 


420 


CAAGTAAGCA 


ANACGANCAT 


GANTGGTGTC 


TGNCCACGCA 


GCTTTTTGAN 


CCTTATCTGT 


480 


TGATGTAGGN 


TTNTTGTGAT 


NATCTTTATC 


CGATGACTTA 


GTTTGATCCT 


NTTGATTGNT 


540 


TATCTTGTGA 


TGNCTCNTTA 


TCTTTATCTT 


TGTCTTTATT 


TTTGTCTTCG 


TCTTTTTTAT 


600 


CATCCTTGTT 


ATCTTCTTTT 


GAAGGGNTTA 


AATCTTTCAC 


NTTCTTTTTC 


AAGGTCTTTT 


660 


TGCTTATCTG 


CTAATTTTTC 


TTTTTCTTGT 


TGAAGCTTTT 


CTNTTTTGTC 


TNTTAATTCT 


720 


TTATTTTCTT 


TTTTGTAAAT 


TATCGATATC 


CTTTTCCAGA 


GTGGCTTTCT 


CTTGACTTTT 


780 


ACCACCAACC 


TGTTAAAAAT 


AGCATGACCG 


CCAGAATACT 


GATGATTAAT 


TTCTTCATAT 


840 


AAAACTCCTT AAACGTCATT 


AAATCTTTTT 


TAATTTTAAC 


ATGAATTAGC 


AACATAATCC 


900 


TAATTCTAAT 


CAATATCTTT 


ATACAATTTC 


ACATGACTCA 


CTTGATAACC 


TAAAGCTTTA 


960 


TTCAAAGATA 


TCATTGGCAA 


ATTANGTTTA 


TGAATTGTAC 


TGCTATTCGC 


TTTGCATTCA 


1020 


TAGTTTTTGC 


CCATTTTTCA 


AGCGCAATCT 


TCAGTTGCGT 


AGCTATTCCC 


AGTTTGCGAA 


1080 


ATTGTGGGTC 


AACATATAGC 


AGTCCATGTT 


ACCATACTTT 


TTCATTACTA 


AAATGTCCCC 


1140 


AAATAAACGC 


TATTAATTGA 


CCTTCATTTT 


CATATATATA 


AATTTTATCA 


TTTGTATGTT 


1200 


CTAATCTTGA 


ACATATCATC 


TCGTACCTAA 


GAGCAATACT 


CAGTTTAGTA 


CATTTATAGT 


1260 


TGGATTCTGC 


TTGCGACAAT 


AATGTCTCGT 


GTATATTAGC 


TATTTGCTTG 


ATATAATTAT 


1320 


GTTCATCTTT 


ATTAAGTGTC 


CTCATTTTCC 


TCTCCAATAT 


TAATCGTTAT 


ATTTATGCGT 


1380 


TTAATATGAT ATGATTGACT TTATTAATAC TATAAAATCA TTTCCATTTT AATTAGGAAT 


1440 


TAAACATTTT 


TAAAGGAGCG 


CTTAAGTAAC 


AATGAAAAGT 


ACTGCGCAAT 


TAACAAAAGA 


1500 


AAATAATGTT 


AAATCATTAA 


GATTAAGCAA 


TACTGATAGA 


GAAATCTTCG 


AGAATTATAT 


1560 


GACTTATATG 


CGCTCTGACT 


TTCGCGTCAA 


CCCTCATGAT 


ACAGAGTTAA 


TTATTAACCG 


1620 


TATACTCAAA 


CAATTATTAA 


GCGCAGAACA 


ACATGGTTTA 


TTAGCCTTAG 


ATTTTTTCAA 


1680 


TCACGACCCA AAAGCACATG 


CAATAAAAGA 


ACTAAAAGCG 


ATGCCTAATG 


AAACATTTAA 


1740 


AAATATTTTT 


AAATATATCT 


ATCAACACAT 


CGTTCTATTA 


ATTGGGATTG 


TTAGTTTTCT 


1800 


CAAAGGATTT 


TTAGGATTTT 


TTCATGGAAA 


AAAATGGAAG 


TAATTTATAT 


TTTGTTTCAT 


1860 


TCCCGTTTTC 


AGTAGTCGTT 


GGATTCTTTA 


TTGTCTTTTT 


ATTtATATGG 


GTTAGTTTTA 


1920 


AAACAATACA 


ACTGCAGTGT 


TTTAGCAATT 


CAAATTGGAT 


ATGGATATTC 


ACCTATTTAG 


1980 


CAATCCATTC 


TACTAATTGT 


ACGATTTTTC 


TATGTATTCT 


TTATTCCACA 


ATCATTACTT 


2040 


GCCTTTGGAC 


CATATATTCA 


GGTAAGCAAT 


TGGGTATTTA 


TTATATTTTC 


ATTTATAGTT 


2100 


ATGCCTATTG GATTACGCAT TGANAGAAAT ATTTCGAAAA AACATTCTCA TACATTTTTA 


2160 


TAAACAAAAA 


GCTTATTACA 


TCATATGCGA ATTTGTCTCG 


NGCC 




2204 



(2) INFORMATION FOR SEQ ID NO: 943: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943 : 



GAAGANATAN 


CGTGGTNTTG 


AAGATTTGGG 


ANGTTAATAG 


ATNAAGGGGA 


ATATGCAN1 I 


bu 


ATTACCGGGA 


GNNNAGTATC 


CAAGNAAATC 


TAGAGTCAAT 


AGGTTAATAA 


TCTTATGCCT 


ion 


TTTTAATTTT 


GAATAAAAGT 


GGGGTGGTGT 


AATGTTTGGA 


TTTACCCANC 


CGACCCGAAC 


ion 


AAGATTGNCG 


TTTNACGCGA 


TTAGAAGNAA 


ATGATAAGCC 


TATGTTTGNN 


AAATTCGACA 


240 


GAATA6AAGA 


CAGTCTGAGA 


ACGCAAGAAA 


AAATTTATGA 


CAAGTTAGAT 


AGAAATTTCG 


300 


AAGAACTAAG 


GCACGAGCGA 


CAAGGTAGAA 


GATGAAAAGA 


ATAAAGAAAA 


GAATGCCAAA 


360 


AATATTAGAG 


ACATAAAAAT 


GTGGATTCTA 


GGATTAATAG 


GGACGATCTT 


CTAAGTACGA 


420 


TTTGTCATAG 


CCTTACTAAG 


AACTATTTTT 


GGTATTTAAA 


GGAGGTGATT 


ACCATGCTTA 


480 


AAGGGATTTT 


AGGATATAGC 


TTCTGGGCGT 


GCTTCTGGTT 


TGGTAAATGT 


AAATAACAGT 


540 


TAAGAGTCAG 


TGCTTCGGCA 


CTGGCTTTTT 


ATTTTGATTG 


AAATGAGGTG 


CATACATGGG 


600 


ATTACCTAAT 


CCGAAAAATA 


GAAAGCCCAC 


AGCTAGTGAA 


GTGGTTGAAT 


GGGCGTTATA 


660 


TATCGCTAAA 


AACAAAATAG 


CTATTGATGT 


ACCTGGTTCT 


GGAATGGGAG 


CACAATGCTG 


720 


GGAATTTACC 


TAATTATTTA 


CTCCGATAAA 


TATTGGGGAT 


TTAGAAC ATG 


GGGAAATGCT 


780 


GATGCTATGG 


CTCAGAAATC 


CAATTAATAG 


AGGTAGAGAT 


TTCAAGATAA 


TTAGAATCAA 


840 


AAGACTTGGT 


ACACAACCAG 


GCGACTGGGT 


TTTTGG 






876 



(2) INFORMATION FOR SEQ ID NO: 944: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944: 



AANATAAGTT 


AATATTGCCA 


GGGCCTTTAA 


CCCNTGCCNA 


TCAAGTCAGT 


TGTAACCACC 


60 


CTCTTTAAAA 


GCTTGAATTG 


TCGTCTTCAG 


TGATAGGATT 


TCCTGAAATA 


ACTTCATATT 


120 


TTTTAGAAAC 


ATGTTTGTTT 


GGACTCATTA 


ATTCATGATT 


AAAATCACCA 


TCATTCGTTA 


180 


TCAATAAAAG 


CCCTTCTGTA 


TCTTTATCAA 


GACGACCAAC 


CGGAAAAATA 


TTTAGATGTT 


240 


GGTATTCAGG 


TATTAAATCA 


ATAACGGTTT 


TTGAATGATG 


ATCTTCAGTT 


GCTGATATAT 


300 


AACCTTTTGG 


CTTATTTAAC 


ATAATATAGA 


CATTTTCAAT 


GTATTCTATT 


AATTCTCCAC 


360 


GAACTGTTAT 


CTTATCGTTT 


TCTGGTTCTA 


TATGTGTTTT 


TGGTGATTTA 


ATTACTTGTT 


420 


CGTTGACATT 


TACAAGGCCT 


TTTTTAAGTA 


ACTGTTTGAC 


CTCATTGCGT 


GTACCGACGC 


480 


CCATATTTGC 


TAAAAATTTA 


TCTATTCTCA 


TCGTAAAAAC 


CTAACTCTAC 


GTCTTAATTT 


540 


TTCAGGAATT 


TCACCTAAGA 


ATTCGTCCGC 


AAGACGCTCG 


TGCC 




584 



(2) INFORMATION FOR SEQ ID NO: 945: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945: 



AATATCATTA 


NTTTGTAGCC 


GGATTTGTTG 


TTACCCTTTA 


ATTTCCCANA 


AAACCTGGGT 


60 


ACAAGTAACC 


TTGATTAAAA 


AATCGTGGTT 


CAATATTTTC 


TTGTATTATG 


ATNATCTCCA 


120 


CAAAATTGAT 


ACGGTGATAT 


AACATCGTCC 


GGTAAAAANC 


CATTGTCATT 


GAGTNCAATT 


180 


GTTAAATCTT 


CTNCCAACTT 


ACTGGCTTTA 


AAAGACTCAT 


ATAACTTTCG 


TGANTGATCG 


240 


TTAAAGTAAT 


CAAATAATTT 


AATCATGTAG 


CACCTCTTGA 


NCTAATGTTT 


CCCATTTTAA 


300 


AATAATATCT 


TGAGTCATAA 


ATTGCTGTGC 


CACTTCATAA 


GAGATGTCAT 


GTGGTGCCTG 


360 


GGGACCATTG 


TTAAAATACA 


TTACAATGGC 


ATGAGCTAGT 


TTTGCGATAA 


CATCATCCAC 


420 


ACGATCTTCG 


NCGGTATCAA 


AAGGTACCAA 


GTAGCCATTT 


TCCCCATCTC 


GAATAAAGGT 


480 


TGGGTTACCA 


TAATTCACAT 


TTAATCCAAT 


CATACCTAGT 


CCTGAGNCGA CCGCTTCCAT 


540 


TAGTGTAANC 


CCAAAACCTT 


CGCTAGTTGA 


TGCAGNAAGA 


AATAACTCAT 


NATCAATATA 


600 


AATNTCATCC 


AAGGTTNACA 


TGCCCTAGTA 


NACGGAATAT 


AATCTTGTGC 


GCGGGGTGTA 


660 


TCAATNATTT 


TACGCAGTCG 


CGTCTTCTCG 


CTTACCTTCT 


CCATACATAC 


CAAATGTTAA 


720 


TTCTGGCACT 


TGTCGTTTAG 


CCGCGATAAC 


CGCAGTTTGA 


CAAGCCAATC 


AATATGTTTC 


780 


TCANTTGCTA 


AACGAGATGC 


NCTAATCATC 


GCATATGGCT 


TTCTTGATNA 


TNTAGGACAT 


840 


GATNACGCAT 


CAATGCTTCC 


CACCGGAATA 


GTATAGACAC 


GTGGACGATA 


ACCTTGATAT 


900 


TGCTCNAATT 


GTCGACCAAC 


CANATGATTT 


TGAATATCTG 


TTGCTGTAAT 


AAAGAAATCA 


960 


ATGTATTTAG 


CTTTTGAAAA 


TTGATATTCA 


TAATAATTGT 


TCCATAGTAT 


ATGCTGCTCG 


1020 


CTCATCATAT 


TATTACTATA 


ATGATCAGCA 


TGAATCACAA 


CACCAACTTT 


ACTATCACCT 


1080 


TTATGCTGCA 


AAACAGCCTG 


NCCAATATCA 


GAAGCGCGGT 


CTAATATGAC 


AATATCGTCT 


1140 


CGGGNTAAAT 


TCAATCGTTG 


TAAAAAGTAT 


GCAATAAATT 


CCGTTTTGTT 


ATACAACACC 


1200 


GCATCTTCAA 


ACACATNTAT 


AGAGCGGTCT 


CCATCAATAT 


ATTCGTTATA 


AGCGATGGAA 


1260 


CCNTCTTCAT 


TATAGAATTG 


TCGCATATAT 


AATTTCGCTT 


TATTATCAGC 


TGGTGCATAA 


1320 


TACTCAGAAA 


ATATACGCGT 


ATAACTATAA 


AAATCTTTAC 


GTACTAACAT 


ACTATTAATT 


1380 


ACAAATTCTG 


CACGATCCAC 


AACATCTTTG 


TGNTCATTNN 


GNAGATAACA 


TGTTACAAAT 


1440 


GATGATTTCC 


CATTAAAATA 


TAGACGGACT 


ATCTTACCAT 




AAAACTAATT 


1500 


TCATGACCAA 


GCTCACGTTC 


AATGTCATCT 


AACGTGTACG 


TTGTTGGTGC 


TATAGAAATA 


1560 


TCACTAAAAT 


ACTGATACAA 


CCAAATAACT 


TCTTGATCTT 


TTAAACCCAA 


TGTTGNGCGT 


1620 


TAATGTCTGT 


ATGTTCTCTG 


ACTGTACAAA 


ATCGAAAAAC 


ACAAATTTAG 


TGTCTGGATT 


1680 


NGTACGTCTC 


AATAATTNAG 


NACGGNAAGN 


TTGTGCATAT 


TCAACACCGC 


TACTCGCCCA 


1740 


GCCTATACCN 


AAGNGTATAT 


TATATATTGT 


CATGCGCTAC 


CCCTNGGCAT 


TTATGGAAAA 


1800 


TGTATAACTG 


GCATACCCTC 


TTTATCAAAT 


GTAATCATGC 


TTTGACAAAT 


AG 


1852 



(2) INFORMATION FOR SEQ ID NO: 946: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 946: 



AANCACCAAT 


CCNATTGGGA 


GGNAATCCAA 


ATCAATNCCC 


GGANNCCCAA 


TCCAAGTTAA 


60 


TTAAGTCCAA 


GGTTTTGGAA 


CATTACCAAA 


TATGATTCCG 


ATGAGGTCAA 


ATGNCAANCG 


120 


GTGTTAATAA 


ACTACGAAAT 


GNTGTGNAAA 


TGATAGTAGA 


NCAAGTTGCG 


CATACAGTNT 


180 


CTCNATTATA 


TGATGCTTTA 


GAATCGAATG 


AGCAACAACA 


GCGCAGTTAC 


AATCAATAAT 


240 


TTGTAACTAG 


AAGATAATAA 


AGAGAACGCT 


CTATAGAGAC 


GAATTGAAGG 


TTTGATTTTA 


300 


ATGTCTGTTA 


GTAAGAATCA 


TATCAATGAG 


ATGCCTATAG 


TACTCAGATT 


ATATTAAATT 


360 


AAAACCGTCA 


TTAATTGTTT 


TTTTAGAAAA 


CATATAGTAT 


CATTTTAAAT 


GTAGTTGACA 


420 


TACTACGTAC 


TCAAATAATC 


TATAACAATT 


TCATATATAA 


TTCTTTCGGG 


GCAGGGTGAA 


480 


ATTCCCAACC 


GGCAGTAAAT 


AAAGCCTGCG 


ACCTGCTAAT 


ATGTTTCATA 


TTAGTGGCTG 


540 


ATCTAGTGAG 


ATTCTAGAGC 


CGACAGTTAA AGTCTGGATG 


GGAGAAAGAA 


TGTTAATTAT 


600 


CGACAAAGAT 


AATGTAGCGT 


ATTTGTAAAA 


ATGTGTACAA 


ATAGGCTTAT 


TTAACGATAA 


660 


ATTTTTCTCC 


TTTGCATCTT 


AATTCATGAT 


GTGAGGATTT 


TTTGTTTATA 


GAGGTGATCA 


720 


TTTGAGTCAA 


TTTATGGATT 


ATGCGATTCA 


ACTTCCAAAT 


ATGGTACAAG 


GTCANACAGG 


780 


TGTTAATCCA 


CCCGTTGGCG 


CTGTTGTAGT 


TAATGAAGGT 


AGGATTGTTG 


GTATTGGTGC 


840 


ACACTTGAGA 


AAAGGTGACA 


AGCATGCGGA 


GGTTCAAGCA 


CTTGATATGG 


CACAACANAA 


900 


TGCTGAAGGT 


GCGACGATTT 


ATATTACGTT 


AGAGCCATGT 


AGTCATTTTG 


GTTCAACACC 


960 


ACCCTGTGTT 


AACAAAATTA 


TTGATTGTAA 


GATAGCANAA 


GTAGTATTAC 


NCAACANAAG 


1020 


ACAATTCCGT 


TAGACACACA 


TGGGTGATGA 


GACGTTACGG 


GGCTCCACGG 


TATTTGAGGG 


1080 


TTGAATTGCG 


TTGGATGATG 


AACGGGCATC 


ACAATTATAC 


CAAAGACTTT 


TTTTAAAGCA 


1140 


AAAAGCAAAG 


CAACTTGCCA 


CAAAATTACA 


GTGAAAGTNT 


CTTGAAAGTT 


TAGATGGGTA 


1200 


AACAAAGCGA 


ATTGATAATG 


GACAAAGTCA 


ATGGATTACT 


AACAAAGAGG 


TTAAACAAGA 


1260 


TGTCTATAG 












1269 



(2) INFORMATION FOR SEQ ID NO: 947: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 947: 



TTTATGGGTC 


AAGATTGATC 


CCAAAAGAAG 


TGGGATTNAA 


CATTAGGACT 


TTAAAAGACG 


60 


TCCCAAGAAG 


ACAAATATCA 


AGGGCCCGAT 


GGTATTCCAA 


TTTGAAAAAA 


GCTGGGCCAC 


120 


ATCGCGTTAA 


TCGGGAAGTC 


CAGGATATGG 


GAAGACCAAC 


GTTCTTACAC 


AACATTATTT 


180 


TCGATGTTGC 


AAGACACCAT 


CGTCCTGATC 


AAGCACACAT 


GTACTTGTTC 


GATTTCGGTA 


240 


CCAATGGTTT 


GATGCCAGTC 


ACAGATATAC 


CACATGTTGC 


TGATTATTTT 


ACAGAAGATC 


300 


CAAGAAGACC 


AAGATTGCTA 


AGGCGATACG 


TATATTTAAT 


GATGAAATCN 


GGATCGTCGT 


360 


AAGAAGATTT 


TAAAGTCAGT 


ATCGTGTTAC 


TAGTATTTCT 


GGAAATATCG 


GAAAATTAAC 


420 


CTGGTGAAAC 


CAATCCCGTA 


TGTCTTTAAT 


CCTTATTGAT 


AACTTTGACG 


CAGTAAAANG 


480 


ATTCACCTTT 


CCAAGAAGTT 


TTTGAAAATA 


TGATGATTAA 


AATGACACGT 


GAAGGGCTAG 


540 


CATTAGACAT 


GCAAGTAACC 


TTAACTGCTT 


CAAGAGCTAA 


CGCAATGAAA 


ACGCCAATGT 


600 


ACATTAATAT 


GAAAACGCGT 


ATCGCCATGT 


TTTTATATGA 


TAAATCAGAG 


GTGTCGAACG 


660 


TAGTAGGGCA 


ACNANAATNT 


GCGGTTAAAG 


ATGTAGTGGG 


TCGAGCATTG 


TTAAGTAGTG 


720 


ATGACAATGT 


ATCATTCCAT 


ATTGGCCAAC 


CATTTAAACA 


TGATGAGACT 


AAATCATATA 


780 


ATGATCACAA 


TTAATTGATG 


AAGTATCGGC 


GATGACAGAA 


TTTTATAAAG 


GGTGAAACAC 


840 


CAAATGATAT 


TCCTATGATG 


CCAGATGAAA 


TTAAAATATG 


AAGATTACAA 


GAGAATCATT 


900 


AAGCTTACCA 


GATATAGTTC 


CCAATGGTCC 


TTTACCAATT 


GGATTAGATT 


ATNGAAGGTT 


960 


GTTACACTAC 


AANAAATTAA 


ATTAACTGAA 


CCAGCAATGA 


TTTCATCAGA 


AAATCCGAGA 


1020 


GAAATTGCGC 


ATATTGCCGA 


AATTATGATG 


AAAGAAATTG 


ACATATTAAA 


TGAAAAATAT 


1080 


GCGATTTGTA 


TCGCAGACTC 


AAGTGGAGAG 


TTTAAAGCTT 


ATAGGCATCA 


AGTGGCTAAC 


1140 


TTTGCCGAAG 


AAAGAGAAGA 


CATTAAAGCG 


ATTCATCAAC 


TAATGATTGA 


AGACTTAAAA 


1200 


CAAAGAGAAA 


TGGACGGTCC 


ATTTGAAAAA 


GATTCACTTT 


ACATTATCAA 


TGATTTTAAA 


1260 


ACATATATTG 


ATTGCACGTA 


TATCCCGGAA 


GATGATGTTA 


AAAAGCTTAT 


TACAAAAGGA 


1320 


CCAGAACTTG 


GCTTGAACAT 


TTTATTTGTC 


GGCATTCATA 


AAGAATTAAT 


AGATGCTTAT 


1380 


GATAAACAGA 


TTGATGTTGC 


ACGTAAAATG 


ATTAACCAAT 


TTAGTATAGG 


TATTCCGTAT 


1440 


TTCAGACCAA 


CAATTCTTTA 


AATTTAGATT 


TATTCAACGA 


GAACCTGTTA 


TTAAAGAAAA 


1500 


CGAAGCGTAT 


ATGGTCGCGA 


ATCAAGCTTA 


TCAAAAGATT 


AGATGGTTTA 


AATAGTAATG 


1560 


AATTAAATAG 


GAGGGAGGTA 


TGTTATGAAT 


TTTAATGATA 


TTGAAACAAT 


GGGCAAGTCG 


1620 


AAATTTAAAG 


ATATTAAAAA 


GCATGCTGAA 


GAGATTGCGC 


ATGAAATTGA 


AGTTCGTTCT 


1680 


GGATATTTAA 


GAAAAGCTGA 


ACAATATAAG 


CGATTAGAAT 


TTAATTTAAG 


TATTGCACTA 


1740 


NATGATGTTG 


AAAGCACAGC 


AAAGGACGTA 


CAAACTGCAA 


AATCTAGTGC 


TAATAAAGAC 


1800 


AGTGTATCTG 


TTAAAGGAAA 


AGCGCCCAAT 


ACGTTATATA 


TCGAAAAAAG 


AAATTTGATG 


1860 


AAACAAAAGC 


TTGAAATGTT 


GGGTGAAGAT 


ATCGATNAAA 


ATAAAGAATC 


CCTCCAAAAA 


1920 


GCTAAGGGAA 


TTGCTGGCGA 


AAAGGCAAGT 


GAATATTT 






1958 



(2) INFORMATION FOR SEQ ID NO: 948 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 948: 



CCAAATGTGA 


ATGCCAACAA 


CAGTAGACCA 


AATTATCAGG 


GAATGATGGT 


GCCATTCCAC 


60 


CAACCAATAG 


CCATTATAGA 


TGCCGAAATC 


AAAATTGGTT 


TAGAAAGCGA 


AGATTTAGAA 


120 


ATACTGGCAA 


CTAGTAAAAG 


ACGTTGCTAA 


AAGTATCTAA 


GAAGGGATTT 


AAGCAGAAGT 


180 


TGTTGCGATG 


AAGTGTGTTG 


GTGCTACTAC 


TGTAGCGACG 


ACGATGATAT 


GTGCTCCAAT 


240 


GGCTGGTATT 


CAATTTTTTG 


TTACAGGAGG 


TATTGGGGGC 


GTCCATAAAG 


GTGCAGAACA 


300 


TACGATGGAC 


ATTTCAGCAG 


ACTTAGAAGA 


ACTGTCTAAA 


ACAAATGTCA 


CTGTTATCTG 


360 


TGCAGGTGCC 


AAATCAATTT 


TAGACTTACC 


TAAGACGATG 


GAGTATTTAG 


AAACAAAAGG 


420 


CGTTCCAGTT 


ATTGGATATC 


AAACGAATGA 


ATTGCCAGCA 


TTCTTCACTC 


GCGAAAGCGG 


480 


TGTTAAGTTA ACAAGTTCGG 


TTGAAACGCC 


AGAACGACTT 


GCTGACATTC 


ATTTAACAAA 


540 


ACAGCAGTTA AATCTTGAAG 


GTGGCATTGT 


TGTTGCTAAT 


CCAATTCCAT 


ATGAGCATGC 


600 


CTTATCAAAA. GCATATATTG 


AGGCAATCAT 


AAATGAAGCT 


GTTGTTGAAG 


CGGAAAATCA 


660 


AGGTATTAAA 


GGTAAGGACG 


CCACACCGTT 


CTTGTTAGGG 


AAAATTGTAG 


AAAAAACGAA 


720 


TGGTAAAAGT 


TTAGCAGCAA 


ATATAAAACT 


TGTTGAAAAC 


AATGCGGCGT 


TGGGTGCTAA 


780 


AATTGCTGTC 


GCTGTTAATA 


AATTATTGTA 


GGTGATGATA 


CATGAATATT 


TTATTCGCTA 


840 


TCACAGGGAT 


AGCATTTGCA 


CTATTTGTTG 


CGTTTTTATT 


CAAGTTTTGA 


TCCGTAAAAA 


900 


CATAGACTTC 


CAAAAAAACG 


TTAATAATGA 


TATTTATTCC 


AGTTGATTGA 


ATGTTAGTTT 


960 


ATTTATGAAT 


GAAACACAAC 


GAATGGTTT 








989 



(2) INFORMATION FOR SEQ ID NO: 949: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949: 

CNTTATTGGA TNATNNCGCA GAATCCCGTT ATCAAGATTT GTTGGATAAA TATCCCCCAN 60 

NCTTTTCNNA ATAATNAAGC CNCNAANCNA GAGTTAGAAG ATTTAGANTC AGCAGACCCA 120 

AGCATTGCTG CAGCGTTTAG ATTTAATGAA ATTCCAATTA GAAGAATTGT CTGAAGCACA 180 

TCTAAAAGAA GGCGAAATAG AGCAACTTGA AATAGATATT AAACGCATTC ACAACTCTGA 240 

AAAATTAAGC TTAGCGCTTA ATAACGCACA TATGACATTA ACGGATGAAA ATGCTATCAC 300 

TGATCGTTTG TATGAACTAA GCAACCATTT GTTAACAATA AACGATATAG TTCCAAATAA 360 

ATATGACAAA TTAAAAGAAG ATATTGATCA ATTTTATTAC ATTTTAGAAG ATGCAAAACA 420 
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TGAGTTNTAT 


GATGAAATGG 


CTANTACAGA ATTCGACGAA 


CAAGTGTTAA 


ACGAGTATGA 


480 


ATCTCGTATG AATTTGCTGA ATAATTTAAA ACGTAAATAT 


GGAAAAGATA 


TTTCAGAATT 


540 


AATCGCATAT 


CAAGAAAAGC 


TTAATAATGA 


AATCAATAAA 


ATTGAAAACT 


ATGAACAAAG 


600 


TACATCGCAG 


TTACGAGAAG 


AAATAAATGC 


ATTGTATAAT 


CAAGTTATAG 


AGGTTGGACA 


660 


AGCGTTATCA 


AAGCAGCGTC 


GTATAGTCGC 


TAGGGAGTTA 


AGAGACCACA 


TTGTATCGGA 


720 


AATTCAAAAC 


TTACAAATGA 


AAGACGCAAA 


TCTTGAGATT 


TCATTTAAAA 


AATTAGAAGA 


780 


ACCGAATATT 


GATGGAATCG 


AATTTGTAGA 


ATTTTTANTC 


ACGCCAAATA 


AAGGGGGAAC 


840 


CATNAAAAAG 


TTTANATAAA 


ATNGCATCAG 


GTGGAGAACT 


NTCTAGAATT 


ATGT 


894 



(2) INFORMATION FOR SEQ ID NO: 950: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950: 



AAATAATCCG 


ANAATTTCAG 


GCGTTAAATA 


CAATATCGCG 


CCATAATAAG 


CTAGGTGTTG 


60 


ATGGTGCCGT 


ACCTTGTGCT 


GGCTTNTCCA 


ACACACTTTT 


TCACTTCATA 


CTGACGTCCA 


120 


TTTTTAGTTN 


ATGGAGTCAA 


TAATTCCATA 


NCGATGAGTA 


TCTGCTTCCG 


GAACTTCTTG 


180 


GACACCTATA 


ACTGAGTGCC 


CCGTTTNCTT 


CATAAAACGT 


CAATCCAACT 


GTTTCACTGC 


240 


TGGCACTTCA 


GANTCNACAA 


TATCGTCANC 


TAATAATACC 


GCACATGGTT 


CATTGCCGAT 


300 


AAATTGGCAC 


GAGCGCAGAA 


CTAATCGCAT 


GCCCTAAACC 


TTTTNGTTCT 


TTCTGTCTTA 


360 


CATAAAAAAT 


ATTCGCAAGT 


TCTGTCGAGT 


ATTGAACTTT 


TTCTAGTAAT 


TCAGATTTAC 


420 


CTTTTTCTTT 


TAACACCATT 


TCTAATTCTT 


TTTGACTATC 


AAAATGATCG 


GCAATCGCGC 


480 


GTTTGTGGCG 


ACCTGTCACT 


ATAATAATAT 


CTTCAATTCC 


AGCTCTTGCA 


GCTTCTTCAA 


540 


CGATATATTG 


TATTGTGGGT 


TTATCTAAGA 


TAGGAAGCAT 


TTCCTTTGGC 


ATCGCTTTAG 


600 


TTGCTGGTAA 


AAATCTAGTC 


CCTAAACCAG 


CAGCGGGAAT 


GATTGCCTTT 


TTTATTTTTN 


660 


ACAAAGTTAA 


TGTGCTCCTT 


TTCCCAAGTA 


TTAAATCTAT 


GTATCAACGT 


CATTTTAACA 


720 


CTAATTAGAA 


CGCCTACATA 


GTGTCATTGA 


GTATGTAATT 


ATCTCTTGGG 


AAATGTGGAT 


780 


TTAATTTTAA 


AAAAACAGGT 


TTACTACATA 


TTAATTTATG 


AAATAACCCC 


GTCAATTTGT 


840 


GTTTAATTAT 


GCTGTGATAT 


TCTTTTTATT 


TCTGCGTAAT 


AATACTAAAC 


CTAGAATGCT 


900 


GAATAATCCG 


CCGAACAACA 


TACCTTTGTN 


TGNNGATTCT 


TCTCCACCTG 


TTTCAGGTAG 


960 


NTCAGATTTC 


TTAGATTGTT 


TNTGNTTAGT 


TGGTGCCACT 


GCTTTAACCT 


TTTCATTGAT 


1020 


TTCAATAACA 


GGTGTTACTA 


CCTTACCTTG 


NNCCACTGGT 


TAAGAAGGTT 


TTATAGGTTC 


1080 


TTCTTNTAGC 


AGGGTGGGTA 


CTGGTTTACC 


AGGNNCAGAT 


GGGTACCTCT 


NGTGTTGG 


1138 



(2) INFORMATION FOR SEQ ID NO: 951: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951: 



TTATTGTTGT 


GTTTTGTGCG 


ATATACCAAC 


TTTTAGCTAT 


TAAAxjAATAI 




DU 


ATAATAAAAT 


TATGAATAAA 


TTATGTCATT 


/m^m* x « rum 7v 

GTGNAAATTA 


»tv"^ TV rp*T<TV TV rr»7v j\ 

TGATTAATAA 


AATCAA1 IL.T 


1 on 


AAAAAACCGC 


CCGCNATAAC 


CAATGATTTC 


ATGTAAACAT 


TTATTTTNAA 


AATTTTGATA 


1 Qf\ 

loU 


TTTTGTTCCA 


CATAATATTC 


GCAATTAAAC 


U'l'TTTT'l'UTA 


TAGAATTTTC 


TTTAATATCv- 




TGAGAGACAT 


GTNCTATAAT 


liTTTOTGANA 


TAATTCACAC 


AGTATAAAGG 


AGTGGNTGTA 


■a n n 
JUL; 


TATGTTAACT 


ATACCTGNAA 


AAGAAAATCG 


TGGATCGAAA 


GAACAAGAAG 


TGGCAATTAT 


360 


GATTGATGCT 


CTAGCTGACA 


CAGGGANAAA 


AGCATTAGAA 


GCATTATCTA 


AAAAGTCACA 


420 


AGAAGAAATT 


GATCATATTG 


TTCATCAAAT 


GAGCTTAGCA 


GCTGTTGATC 


AACATATGGT 


480 


GCTAGCACAA 


TTAGCACATG 


AAGAAACTGG 


AAGAGGTATA 


TACGAAGATA 


AAGCGATTAA 


540 


AAATTTATAC 


GCTTCTGAAT 


ATATATGGAA 


TNCAATAAAA 


GACAATAAGA 


CAGTAGGGAT 


600 


TATTGGTGAA 


GATAAAGAAA 


AAGGATTAAC 


GTATGTAGCG 


GAACCAATTG 


GTGTTATTTG 


660 


TGGTGTTACG 


CCAACAACAA 


ATCCTACGTC 


GACAACTATT 


TTTAAAGCGA 


TGATTGCAAT 


720 


TAAGACAGGA 


AATCCAATCA 


TTTGTGCATT 


CCATCCAAGT 


GCACAAGAAT 


CGTCGAAGCG 


780 


TGCAGCAGAA 


GTTGTATTAG 


AAGCGGCAAT 


GAAGGCAGGT 


GCACCTAAAG 


ATATTATTCA 


840 


GTGGATTGAA 


GTGCCTTCTA 


TCGAAGCAAC 


AAAACAATTA 


ATGAATCACA 


AAGGTATTGC 


900 


ATTAGTTCTA 


GCAACAGGTG 


GTTCGGGCAT 


GGTTAAGTCT 


GCATATTCAA 


CTGGCAACCG 


960 


GCATTAGGGG 


TGGCGGGGAC 


CAGGGGTAAC 


GGGGCCCGTC 


TTTNCAATTG 


ANAAAACAGC 


1020 


ACACATTAAN 


CGTGCAGNAA 


NTGATATCAA 


TGGGTCAAAC 


ACATTTGATA 


TGGTATTGAT 


1080 


TGGTGCTTCC 


TGACAAAGTT 


GGTGGCCATT 


GATAAAGAAT 


TTATAAGGTT 


GTTACTTATG 


1140 


ATTTTA 












1146 



(2) INFORMATION FOR SEQ ID NO: 952: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952: 
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TTTGTGCTTC 


GTCAATCTTG 


TCACCANCTA 


ACTCTAAACG 


CTTTCCTTTT 


GTCGTTTTCG 


60 


GGNTCCCACC 


ACAATTAATC 


TGTTTTGATT 


TCTGACATAT 


CAATCTTCCT 


AGTTAACCCA 


120 


AGCCAAATCT 


TTACCATACT 


CAATTTCTTT 


ACCTTGGAAC 


AAGCTGTTTT 


TCTTGTTGAG 


180 


TACCACATAT 


CTACCTTTGA 


CGGTATTAGA 


ACTAAGCTCT 


ATATAAAAAT 


CCAATGCCAT 


240 


TTTATAGGTT 


GTACATAATT 


GCTTTAAAAC 


TTCATATCTA 


GTNTGATAAG 


AAGTCCATGA 


300 


CGTAGTACGT 


AAGCCATCGT 


ATTCGGTNTG 


TTCAGAAACT 


TCCCAACCTG 


TATCGCTCAA 


360 


CACATCTTTC 


AATGCTTCTG 


AAGTTGTCTT 


TTTCTCAAAT 


TTGCCTGGTG 


CATACGGTTT 


420 


AGCTGTTGTT 


ATATCAGCAA 


GATAAGACGC 


TATACATTCT 


ATCTCTGTGT 


AGCCGTCCAT 


480 


CGTATCTTGA 


ACCCAGTTAA 


TAATAAATTC 


ACGCCATTGT 


TTGTTTGAAT 


CCCTTATAAT 


540 


AACACGATGT 


CGTTCACGGA 


ACNTTTCAGC 


TCTTTCTGAT 


GATATGAGCA 


GTTCAAGCAT 


600 


TTCTGAATTG 


TCATTAACAT 


TACGTTTATG 


AATCGCTCTA 


ACTAAGGAAG 


GATCATCAGT 


660 


AGAAAGGAAA 


TCTATAATCT 


TGTCGTTAAA 


ATCTAAAACA 


TGTATCACAC 


TCTCATCTCC 


720 


TTTCTATAAA 


TATCTATCTT 


GCCATTTAAC 


CGTCGTATCA 


AAGACGTTTT 


CAGGTTGTAT 


780 


GATTAATTCA 


CTGTACCCAG 


AATCAACATT 


GAAATAATTA 


CTTCCAAACG 


ATTTCTCGCT 


840 


CCAACATTGG 


TTCCTCATTT 


GATGACAACA 


CTTNNNGCTT 


GNATATCTAT 


TTTCACTAAA 


900 


TCACCTTTTT 


GTATAATGAC 


ATCCCTTGCG 


CCTTTCGGNT 


TCGGTAGAAT 


CTCCGTATTG 


960 


AATGAACCTA 


ATCCATTCAT 


CTCCATCCAC 


TTATAACCGT 


NATACTTCGC 


ACTATAGATA 


1020 


GCTATGATAG 


AAGCTGGACG 


CTGATAAAAC 


TTACCGCCAT 


CTATCCACTC 


TTTCTCATCC 


1080 


ATATCAATAG 


GTTTACGTCT 


ATCTGGGTCT 


TTAATGTGAT 


CAAATTTCCA 


AGTTTTAATA 


1140 


GAAAATTTAT 


TACCTACTCT 


TCTGAGCCGC 


ATATAAACAN 


CGATTCTGTC 


CAAGTTATAC 


1200 


ATTATCGGTT 


TATTCTGATA 


GTCGTATATC 


TTTTTGGGGT 


CTCCTTTTTG 


GTTATACG 


1258 



(2) INFORMATION FOR SEQ ID NO: 953: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1620 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953: 



TCGAATTTGG 


TGTTGNCCGT 


TCCTGTATNA 


CATTCTGNCT 


TATCTAATGG 


GGGACGTTAT 


60 


GATGAGTGCC 


CACNNATTAG 


GGATGATCGT 


NCGAGAGTAA 


GTGTTGGTGC 


AAGGTCAAGT 


120 


GTGTTCGCAC 


CTTTCCACAA 


ANTTAGGGTT 


AATCATCATT 


GATGAAGAAC 


ATGAATCTAC 


180 


ATATAAACAA 


GAAGATTATC 


CGAGATATCA 


CGCTAGAGAA 


ATTGCCCAAT 


GGCGAAGTGA 


240 


ATATCATCAC 


TGTCCAGTCA 


TTTTAGGAAG 


TGCAACACCA 


TGTCTTGAAA 


GTTATGCACG 


300 


AGCTGAAAAA 


GACGTTTATC 


ATTTGCTATC 


ATTACCAAAC 


AGAGTGAACC 


AACAAGCTTT 


360 


ACCTGAAATT 


GATATAGTAG 


ACATGCGTGA 


AGAATTGAGT 


GAAGGTAATC 


GGTCAATGTT 


420 


TTCAAAAGAT 


TTACGTGAAG 


CCATACAATT 


AAGATNAAGA 


TCGACAGGAA 


CAAGTTGTTT 


480 


TATTTTTAAA 


TCCGACGTGG 


TTATGCATCG 


TTTATGTTAT 


GTCGGGATTG 


TGGATATGTA 


540 
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CCGCAATGTC 


CAANCTGTUA 


lAl 1 Iv-AI 1A 


2k CfZT 2k TP IT A 
Av.ulAi\^ilA 


AAAPCSAPACiA 


PTTATTAA AA 


600 


TGTCACTATT 


GTGGTTACCA 


TiO TA P/5PP Zk 


fW12k ATP A 2kT 
V.CutAn J. v- nix 1 


f^TPPAAATTf: 


TRARARTGAA 


660 


CACATTCGaC 


AAGTAGGTAL, 


1V2PT 2k P TP A f2 


AAAriTTfiAAP. 


AAPTATTGCA 


APAAflAATTT 


720 




fVAT*?k »1wP2kf2 

UCAxAAx x Alj 


fiATrsriATnTA 




CAAAGAAAGG 


TGCACATGAA 


780 


AAGTTATTGA 


CTGAA1 ±%JwV 


A AA AfZf^TAAP 




TACT AGGTAC 


TCAGATGATT 


840 


GCliAAAVsVjAT 


T AG ATT ATv, U 


A A 2k T A TT A PT 
AAA X AX 1AV. 1 


*1 w i'Tkf ,: J | u l*P_pJTf^ 


THPTGAATTJC 

X VJ\— X X VJV^. 


NGATACAATG 
nun x n\.nn x v> 


900 


TTAAATTTAC 


CTGATTTTCG 




POT a n*r ATP 


A APT ATT A AP 


ry A Af^TnnPT 




GGTAGAfcjCTG 


GTCGTCATCjA 


AAAuuUVjuI 


PA Af2TPATPA 


TTPAAAPHTA 


TAATPPAftAT 
X t\f\ X VrUAuA X 


1 ft^n 

X uz u 


CATTATTCAA 


TATTGGATTG 


TTCAAAAAAA 


TGATTATTTA 


Ai- A 1 1 X 1 AIT, 


ulWiuuAAAi 


XUoU 


GGAATATCGT 


CAAATTAGGA 


AAu 1 A i L.L.Ak~ 


VjVj 1 AI 1 1AX i 


ATTTYSPTTAM 


TTTPAPA ATP 


1 1 a n 


TCACATAAAG 


AAATGAAGAA 


LAj 1 lAluuAA 


bL A 1\,V7*U A\j*— 


ATP.TTPATA A 


A UTTTT ATT 2V, 
rtn X X X XnX X/\ 


i *>nn 
xzuu 


CAGCATTTAC 


AAGAAAAAGC 


GCTTGGTACT 


A (VTYY* A TPP 


TPPraV AftP A 


PTTfiPft AftA A 


XZ DU 


TCAACAATGA 


ATTTAGATTC 


CCAAATTTTA 


f^*TV2k 7A7ATlkT2k 

kj X \jAAA 1 A 1 A 


A A irfC Jk app 
AAAVt IvjAAL L. 


TV2^iATTATTA 
xuuAl 1A1 XA 


i ion 


CAAGCCANTC 


AGTTTTTAGA 


TGACTATTAC 


CATGAAAAAT 


TTATAAAAGA 


AAAATTAGCA 


1380 


TTGAAGATTG 


ATATTGGANC 


CACAGATGAT 


GATGTAACAT 


TACTAATTAT 


TAGAAACAAG 


1440 


NNCAAGTATT 


GTACGAGTAT 


TTGAACCCAG 


TGTGTGAATA 


TTTACTTNAN 


GNACAAGAAA 


1500 


AAGGGCAGAA 


TATACAACTG 


TTAACTATTT 


AAATTAGCAG 


TTTATATTCT 


GTCTTTTTAT 


1560 


ATGGCTTTAT 


AACTTACGTG 


ATTTTGGTTT 


GATAAGGAAT 


TTATTAGTAT 


TTTCATTTAC 


1620 



(2) INFORMATION FOR SEQ ID NO: 954: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954; 



ATAAGTTTTT 


AAAATGTATT 


AAATTTGTGG 


AGGCATGTAA 


ACAATGAAAG 


TATTAAACTT 


60 


AGGATCGAAA 


AAACAAGCAT 


CATTCTATGT 


TGCATGTGAG 


TTATATAAAG 


AGATGGCATT 


120 


TAATCAGCAC 


TGTAAACTAG 


GTTTAGCAAC 


TGGTGGTACA 


ATGACAGATT 


TGTATGAACA 


180 


ACTTGTTAAG 


TTATTAAATA 


AAAATCAGTT 


AAACGTAGAC 


AATGTATCCA 


CGTTTAATTT 


240 


AGACGAATAT 


GTAGGTTTAA 


CCGCATCACA 


TCCGCAAAGT 


TATCACTATT 


ATATGGATGA 


300 


CATGCTTCNC 


AAACAATATC 


CTTATTTTAN 


TAGAAAGAAC 


ATTCATATTC 


CAAATGGAGA 


360 


TGCCGATGAT 


ATGAATGCGG 


AAGCGTCAAA 


ATATAATGAC 


GTTTTAGAAC 


AACAAGGTCA 


420 


ACGTGATATT 


CJvAATTTTAG 


GTATTGGTGA 


AAATGGTCAT 


ATTGGATTTA 


ATGAACCTGG 


480 


TACGCCGTTT 


GATAGCGTTA 


CTCATATCGT 


TGATTTGACT 


GAAAGTACTA 


TTAAGGCTAA 


540 


TAGTCGATAT 


TTTAAAAACG 


AAGATGATGT 


TCCAAAGCAA 


GCCATTTCGA 


TGGGACTTGC 


600 


TAATATTCTT 


CAAGCCAAAC 


GTATCATTTT 


ACTCGCATTT 


GGGTGAAAAG 


AAACGTGCTG 


660 


CTATTACACA 


TTTATTAAAT 


CAGGAAATTT 


CTTGTTGATG 


TCCCAGCCAC 


ATTACTTCAC 


720 
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AAACACCCGA ATGTNGAGAT ATTTTTTAGA CGACGAAGCT TGCCCGAAAA ATGTTGCGAA 780 

AATTCATGTC GATGAATGGA TTGATTGCAA TGTCTAATTA AGAAATGCCT CGGGAAAGGT 840 

TCCAATAGAA AGATAAAAAG CATGTGGAAG GATGATCTTT AGTGGCGAAT TACAATTAGC 900 

AATTGATTTA TTAAACAAAG AAGACGCTGC GGCTGACTTA ACTCATGAGA GTTTAAGATT 960 

ATGTAGATAT CGTAGAAATC GGTACGCCAA TCACTCACCA CGTGGTGCTA CTAGTAGTTA 1020 

AACATATGGC AGACAACCCT AGTAATGTCA AGTATTAGCA TACATGCAAA TTATGGATGC 1080 

AGCTGATTAT GTAGTTATCC TACCAATTAA ACTTGACGCT CGTGCC 112 6 

(2) INFORMATION FOR SEQ ID NO: 955: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955: 



AATTGGGCAC 


GGGTANACAG 


TTGTAGCGTT 


ATTAAGNATT 


TTAACTTTTG 


CGAAGTTAGC 


60 


AACAATTTNC 


CAATGATGTT 


ATTTTATCAA 


TGTTACCTCA 


AGCAGCAACT 


ACAANCGATT 


120 


GCGTTACCAG 


TATCAGCTGG 


TATCGGTGGT 


ATAAAAGAAT 


TAACATCATT 


AGCAGNTATT 


180 


TTAAATGGTG 


TCATTATTTA 


TGCCCTAGGT 


AATAAATTCN 


TGAAGCTTTT 


CCGAATTACT 


240 


AACCCTATTG 


CCCGAGGATT 


AGCACTTGGA 


ACNAGTGGTC 


ACACATTAGG 


TGTAGCACCA 


300 


GCCAAAGAAT 


TAGGACCTGT 


AGAAGAATCA 


ATGGCAAGTA 


TAGCTTTAGT 


GTTAGTTGGT 


360 


GTAGTTGTTG 


TAGCAGTTGT 


GCCTGTCTTT 


GTAGCAATAT 


TCTTCTAAAA 


CGAAAAACCT 


420 


AAGCAAGATA 


ATAGCAATTT 


GAGCCATTGT 


TATTATCGTA 


AAAAAACGTC 


TATACTCCAG 


480 


TTTATAACTG 


GGATATAGAC 


GTTTTTATGT 


ATTTATTACT 


TTTTACTAGG 


AATATAAAAC 


540 


TGTGCATGNC 


GATAATGAAA 


TACGATGTCA 


GACGAATCAA 


AGGGTTTGCC 


AGTCATTGTA 


600 


TAAAAAGTCT 


GGTGGTAACG 


TAAACATGGT 


TCACCTGTAG 


ACAATTGTAG 


TAATGAAGCT 


660 


TCACTTGAAG 


TGAGTTTATC 


TACATTAAAG 


AAAATATCTG 


AAAAACCAAT 


ACGAAGTTTC 


720 


ATGTTTGATT 


CTAAATAATC 


GAAGATAGAG 


CCCTTAGCAA 


TATCATCATT 


TAAATATTTC 


780 


ACGATTTCTT 


TATGATAATA 


AGAATATTCG 


ATACATAAAA 


CATCATCGTC 


CACGAATCTT 


840 


AATCGCTCTA 


AATAGTAGAC 


GGTATCACTC 


GTGCC 






875 



(2) INFORMATION FOR SEQ ID NO: 956 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956: 



Lnnl lULfthl 




TP AAATATTA 


CATCTGAAAG 


ACCTATCCCA 

• A u\*v*n 


CCTGCACCAA 


60 


(«AAv» X XAfUjL 






vjx inlAl wll 


ft X t%\3 X X wl> X X 


CAAATGTATA 


X. \J 


TCATAAAGAC 


ACTlaAAbAx I 


'POT* & T A TT 1 A 1*2 


1*1** I* A AAA A*TY* 


ul JAvlx v> Ion 


A A r* A A PPTY: A 
/iftL-/vrll~k_ IvjA 


i on 


AGGCGATCAA 


CGTGTCTATL 


CjU AI_-Lj lvAAL 


A 1 A 1 vjA 1 x x/V 


arr* Ar v TY3A r rn 


A A A WW A A A 
nAAlA. IVftAft 




AGTAAAACAA 


GCTTTTATTA 


ATGCAAATAtj 


Tv/^TV rTV~*rpTv. «nvn 

AGATvjTAATT 


ALGClluL Cvj 


A jv ppfrv X rpTv rn 

AAGCjTGATAT 


~i rt r\ 


TTCAGTTACA 


AATACACCTA 


ATGGTGCTAA 


TGTAAGTACT 


ATTACAGTAA 


TV rpfv rrvn iv xmiv tv 

ATATTAATAA 


"> C ft 


AGGTCGATTA 


% /v^ iv iv Mn^nfn 

ACGAAATCAT 


TCGCGTCTAA 


UV, 1 AAjC 1 AA I 


ATGAATTTCT 






TAATTTCCCA 


CAAGATTATA 


CAGTUAUAT« 


/-i TV /v A X TV^P TV 

UAUoAAlTjv_A 


AAA AHWIWll A 

AAAA1 IvjCAA 


AlJALaALAJAAl- 


ion 


AGATGGTGGT 


TTATCATGGT 


CCGATGACCA 


mjv 7v iv nv mimum > iv 
TAAATL 71 1 lA 


ATTTATCGTT 


ATGATGCTAC 


540 


ATTAGGCACA 


CAAATTACAA 


CTAATGATAT 


TTTAACGATG 


CTAAAAGCGA 


CTACTACAGT 


600 


GCCTGGATTG 


CGTAATAATA 


TTACTGGTAA 


TGAAAAAGCA 


CAAGCAGAAG 


CAGGTGGAAG 


660 


ACCAAACTAT 


AGAACAACTG 


GTTATTCACA 


ATCAAATGCG 


ACAACTGATG 


GTCAACGTCA 


720 


ATTTACGTTG 


AATGGTCAAG 


TGATTCAAAT 


ATTAGACATC 


ATCAACCCTT 


CAAACGGTTA 


780 


TGGTGGGCAA 


CCTGTTACAA 


ATTCAAATAC 


TCGTGCAAAC 


CATAGTAACT 


CAACTGTTGT 


840 


TAACGTAAAC 


GAACCGGCAG 


CTAATGGTGC 


TGGCGCATTT 


ACAATTGACC 


ACGTTGTAAA 


900 


AAGTAATTCT 


ACACATAATG 


CAAGTGATGC 


AGTTTAGAAA 


GCGCAGTTAG 


ANTTNACGCC 


960 


ATATGGTCCC 


NNACAATATG 


TTGAACATNT 


AAATCCAAAT 


TACAGGAAAT 


ACTNCTGACG 


1020 


CTATTAACAT 


TTATTTTGTA 


CCAAGTGACT 


TAGTGAATCC 


AACAATTTCT 


GTAGGTAATT 


1080 


ACACTAATCA 


TCAAGTGTTC 


TCAGGTGAAA 


CATTTACAAA 


TACGATTACA 


GCGAATGATA 


1140 


ACTTTGGTGT 


GCAATCGGTA 


ACTGTACCAA 


ATACATCACA 


AATTACA 




1187 



(2) INFORMATION FOR SEQ ID NO: 957: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957: 

CGATACCGNC AGTTGCCCCA ATACCAATAC TTTGATGTCC GAATAAANCG CTNNCCGAGC 60 

GNNTGTCGGG GATAATTTCT GAGTAAAATA CCCNCATGCT GGGGGGATAN CGANANAAAN 120 

CCCTTCAGGG TGCCCCCATA CCCAGAAGAA GTTAGCCCAC AGCATTGGCA TACCGCCATG 180 

TGCACCTGTG TAGAATGCTG TGTCAAATAT TCTATCAGTT GTCATTAATG CTAACGCTNC 240 

TGTTAAAGGA GGGAAAGCAA GAATAACAAT TAATGTAGTA ATAAATGTTG TTACTGTAAA 300 
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CATTGNCATT 


TGCATAAACT 


TCATAGNTGG 


TGTTTTACAT 


CTTAAAATTG 


TTACACAGAA 


360 


GTTGATACCT 


GTAGCTAAGG 


TACCAAGCCC 


TGAAATTTGT 


ATAGCTATTA 


AGTAATAGTT 


420 


AACACCCGGA 


CCAGGACTGA 


ATTCACCTGC 


TAGTGGCGCA 


TAGTTTGTCC 


AACCAGCTGC 


480 


TGGTGAACCA 


CCAATAATAA 


ATGACAGGTT 


GAATAAAATC 


ATACCTGCAA AGAATAGCCA 


540 


GAAACTTACG 


TTGTTTAATA 


CTGGGAATGC 


AACATCACGT 


GCTCCAATTT 


GTAATGGAAC 


600 


AACGATATTC 


CATAAACCAA 


AGATAAATGG 


CATTGCCATG 


AAGATAATCA 


TGATTACACC 


660 


ATGTGTACTA 


AAAATTTCGC 


TATAGTGGTT 


AGATTCTAAA 


AATTTGTTAT 


CAGGTACTGT 


720 


TAATTGCGCA 


CGAATAAGTA 


ACGCATCAAT 


ACCACCACGG 


ACGGAACATT 


AATACGGCAC 


780 


AGGATTAAAT 


ACATAATCAC 


CGATTTTTCT 


TTATGGGTCT 


TACAGATGGT 


GAACCCCACT 


840 


CTTTGTAAAG 


ATATTTCCAT 


AATTTAAAGT 


AAGTAATTAC 


TGCGATTAAA 


CCAATAACTA 


900 


AGAATGGGGC 


ACCAATTTGT 


GCCATTGTAA 


TCATCCAGTT 


ACCTTTAACT 


AGTAATTGAT 


960 


CCCATGGAAA 


ATTCATTAAT 


GTCCACCTCC 


ATGATCATCA 


TTGTCTTGAT 


CTTGCGCATC 


1020 


TTTTGAAATT 


TTCTTCATTT 


CTTTCGCATT 


TTTCGATTCA 


TCTTTCTTGA 


ACTCATTGTT 


1080 


ATATGGTTCG 


TCATTTCCAA 


GAATCATCAA 


CTTCATACCA 


TGTCGTTTAT 


AGTTCGCATT 


1140 


TGTAATTTGA 


GCTCGTGCC 










1159 



(2) INFORMATION FOR SEQ ID NO: 958: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1090 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958: 



TGTCATTTTG 


ATTAAAGTGG 


CACGAGCGTA 


GTCTCGTTGC 


ACAACAGTCA 


ACGATTTCAA 


60 


TGTTATCTTT 


GCCTCCCAAA 


CCAGCAACAA 


TAGTTTGTGC 


TCTTTCAGTA 


GCCTCAACTT 


120 


GTTGTGCTGC 


AGCTTTATCT 


TCTCGACCAG 


GTGTTTTGAA 


ATTAAATTTC 


GTAATTAAGA 


180 


ATCTGAAAAC 


GATGTAATAC 


ACACAGAACC 


ACACAATTCC 


AATAGGTATG 


ACGTATAGGT 


240 


AGTTTGTTTT 


ACTATTACCT 


TGTAGCACAC 


CAAAGAGTAA 


GAAATCGATA 


AAGCCTCCAC 


300 


TGAAGGTTTG 


ACCAATTGTA 


ATGTTGAAAA 


TGTCTGCCAT 


CATAAATGCT 


AATCCATCAA 


360 


AGAAGGCATG 


GATTACATAA 


AGAATAGGTG 


CGACAAACAA 


GAAACTAAAC 


TCTAAAGGGT 


420 


TCGGGTAATA 


CCTGTTTAAA 


AATGAAGTGA 


GTGCAGCGGG 


ATAACATTAA 


ACCGCCGACA 


480 


ACTTNTATAT 


GTTCAGGTTT 


AGCTGTGTGA 


TAAATTGCAA 


GTGCGGNACC 


ACATAAGACC 


540 


GAACATCATC 


CGATAATAAA 


ACGGCCTGAC 


ATAAAGCGTG 


ACACACCTGA 


ATAATACTTC 


600 


GTCACATCTG 


GATCACCAAG 


TTGAGCAAAG 


AAGATGTTCT 


GCGTACCTTG 


AACTAAGTGC 


660 


CCTTTGACTG 


NCTAACGAGT 


ACCACCAAGT 


GCCGTCTGCC 


AAAACGGTAA 


GTAAAAAATA 


720 


TGGTGTAAAC 


CGAGTGGACC 


TAACAATCTT 


AAGATGAAGC 


CATAAACAAA 


AGTACCGATG 


780 


GCACCTGTTT 


TCGTTACAAA 


TCCACCAACA 


TGATAAATGC 


CGGCTTGTAT 


GCTTGGCCAA 


840 


ATGAAAAACA TCAATACACC 


TAAAAAGATT 


GCGGCAAATG 


CTGTGACAAT 


AGGGACAAAT 


900 
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CTANAGCCAC CAAAGAAACC TAAATACGGT GGTAATACCA CTTTGTGATA TTTGTTGTGA 960 

AGTATTGCGG TCATAATACC TGTGATAATC CCGCCAAAAA CACCGGTTTC AACCGTTTGT 1020 

ATACCGAGCA CCATGCCTTG TCCATTTTGT TGCAAGCTGA TCTTTTGCCA ATGTTGCCCG 1080 

TGATAGTTAA 1090 

(2) INFORMATION FOR SEQ ID NO: 959; 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959: 



AATTTACACT 


AAACTATTCA 


TTCCTAACAC 


TTCANCGAAA 


AGTTTGAAAA 


TCCAANACGT 


60 


TNACCAATTG 


GTGCTAANAC 


AAATGGAATA 


AAGAACACAG 


GATTCAATAC 


TAGTGNTGCA 


120 


CCAAATAAAA 


TTGGTTCGNT 


TACACCAAAG 


GAATGTTGGT 


ACANCTGGAT 


GCACGTCCAA 


180 


TCGCTTTGTT 


TCGTTTAGAT 


TTCGTCATCC 


AACATAAACA 


TGAACGGGNC 


GACCAATGTT 


240 


GCACCCGTAC 


CTCCAAAAGT 


AACGATAAAC 


ATTTGTGTAC 


CTGATGTAAT 


AATTTTATCT 


300 


GCGTGTTCTC 


CAGCTTGAAG 


CAACTTGAAG 


TTCGCTTCGA 


TATTCGCATA 


TGTAATGGCT 


360 


GCAATTGCTG 


GCTCTACTAT 


TGATGGACCA 


TGAATTCCTA 


CAAACCAGAA 


TAATGCAAAG 


420 


GCACCAAAGA 


TAATTGTGAC 


ACCAATCCAT 


CCATCTGCTG 


CTGTAAATAA 


TGGTTCGAAT 


480 


AATTTTAAAA 


TACCTTCCGC 


TACATTTGAT 


TTAAAGCTGT 


TGCGAATGAC 


TAAATCTAAT 


540 


GCATAAAGAA 


TGATGATTAC 


CGCTGAAAAT 


GGAATTAAGT 


CCTTAAATAC 


TTGTGAAATA 


600 


TTCGGCGGTA 


CTTCTTTCGG 


CATTTTAATT 


GTAATATTGC 


GCTTCACACA 


GAAGTTATAA 


660 


ACAATAACAG 


TTACAAATGC 


TGATAAAAAT 


GCTGTTAATA 


AACCTTTAGT 


TCCCATAAAT 


720 


GCACTTAAGA 


ATCCACCATC 




TCAGAAGCTA 


AGAACAAGAA 


TCCACACATC 


780 


GNTGCTAGCA 


TTGTAGAAAT 


AAAGTTAATT 


TGATTTGTAC 


TTTCTAGCTT 


ACGGTTAAAT 


840 


GAATCTTGTT 


AAAGATTTCG 




TGCTACTAAA 


AATGCTACAA 


GCCCCATCGG 


900 


ATAGTTATAT 


GGGTTCATTA 


AAATGGCTTC 


CATGCCTTTA 


TCCCATTTAA 


AACCAAATAT 


960 


ATTGGGCACG 


TATGCAATTA 


ATAGAAAGAT 


ACTTGAGAAT 


TAGATGACAG 


GCATTGCAGA 


1020 


AATAAATCCA 


TCACGGATNG 


CTCTTAAATA 


TTATGTTACG 


GTGATAATTT 


CTCGGAAAAA 


1080 


TGGCTTCCCT 


TTTTCAATTT 


GTGCGATCAA 


TTTTTGCATC 


ATTGTCATCA QCCTCTTTTA 


1140 


TAAAATTCTA 


ATAAATGCTT 


CATTAAATCT 


TTCAGTAAAA 


TTGTTGTCCA 


TTAAATGGTC 


1200 


CTTGAACATG 


CATCATCCGT 


TACACTGTTA 


TGCAATATCA TCACCTTGCG 


CTTCTTTAGC 


1260 


TAACAGCTTC 












1270 



(2) INFORMATION FOR SEQ ID NO: 960: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:960: 



CAACCTGATT 


TTGATGGAAG 


AAATTACCTT 


GATAAATNAA 


AGCCTTCCCN 


NCNANGGAAG 


60 


AAACCAATGA 


TGAAAATAAT 


TGGTTCTGCT 


TCTAAGTCAT GCGAATGATA ATAAATATCG 


120 


ATGATTCTAC 


AAGACTCTAA 


TATTGAAAAT 


GAGGATTATA 


GATATAATCA AAGAAATTGN 


180 


CGACCCAAAA 


TGAATCGGAA 


TGGAATTNCA 


AGTCGGCANC 


CGGACCAACC 


TCAATCAGCT 


240 


CCTAAAGAAC 


AAAATAGCGA 


CTCGAATGAT 


GAGGAAACAG 


TAACGANAAA AGAACGNAAA 


300 


AGTAAAGTAA 


CACAATTAAA 


GCCATTAACA 


CTTGAAGANA 


AGCGGAAGTT 


AAGACGTAAG 


360 


CGACAAAAAC 


GAATCCAATA 


CAGTGTTATT 


ACAATATTAG 


TATTGTTGAT 


TGCTGTTATA 


420 


TTAATTTACA 


TGTTTTCACC 


ACTTAGTAAA 


ATTGCCGCAT 


GTAAATATAA 


ATGGAAATAA 


480 


TCACGTTAGT 


ACTTCAAAGA 


TAAACAAAGT 


TTTAGGTGTT 


ANAAATGATT 


CGAGGATGTA 


540 


TACGTTTAGT 


AAANAAAATG 


CTATTANTGA 


TCTCGAAGAG 


GATCCATTAC 


TCNCAAGTGT 


600 


TGAGATACAC 


AAGCAATTAC 


CANACACATT 


AAACGTAGAT 


ATCACAGAAA 


ATGAAATTAT 


660 


TGCTTTAAGT 


GAAAAAATAA 


NGGTAAATAT 


TTACCTTTAT 


TAAGAANATG 


GTAAATTGCT 


720 


TNANGGTTCA 


AATGATGTCN 


CAATTAATGA 


TGCACCTGTC 


ATGGATGGTT 


TCAAAGGTAC 


780 


ACAAGAAGAT 


GATATGATTA 


AGGCGTTATC 


TGAAATGACA 


CCTGAAGTTA 


GNCGATATAT 


840 


TGCCGAAGTG 


ACATACGCCC 


CAAGTAAAAC 


CAAACATAGC 


AGAATTGAAT 


TGTTTNCGAC 


900 


AGATGGACTT 


CAAGTAATCG 


GTGATATTTC 


GACGATATCT 


AAGAAAATGA 


AATATTATCC 


960 


GCAGATGTCA 


CAATCATTAT 


CAAGGGATAG 


GTCGGGTAAA 


CTAAAAACAA 


GAGGCTATAT 


1020 


TGATTTATCA 


GTCGGTGCTT 


CATTTATCCC 


ATNCCGTGGA 


AACACGTCTA 


GTCAATCAGA 


1080 


AAGCGATAAA 


AATGTGACTA 


AATCATCTCA 


AGAGGAAAAT 


CAANCAANAG 


AAGAATNACA 


1140 


AAGCGTTTTA 


ANCAAAANTA 


NCAAACAATC 


AAGTAAGAAT AATTAAAAAA 


TTTTGATATT 


1200 


GTCTATGTTT 


ATAGTTCACA 


AGCCATTCAA 


CGTATTGTAA 


ACTAAGGATA 


GTGTATTTTT 


1260 


TTAATAGTAA 


TTTGTCAGGA 


GGTGCCGATC 


TATGGAAGAA 


CATTACTACG 


TAAGTATTGA 


1320 


TATTGGATCA 


TCAAGCGTAA 


AAACAATAGT 


AGGCGAGAAA 


TTTCACAATG 


GTATAAATGT 


1380 


GATAGGTACA 


GGACAAACCT 


ACACGAGCGG 


TATAAAAAAT 


GGTTTAATTG 


ATGATTTTGA 


1440 


TATTGCGCTC 


GTGCC 










1455 



(2) INFORMATION FOR SEQ ID NO: 961 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961: 



TCGTCTAAAA 


GTAAANCTCN 


GATGTGTAGG 


TAGTGCGACG 


GCGTAtsAljAA 


ACGTCATTGT 


f n 
OU 


GTGCCCCCCG 


ATTCGTACTA 


ATGCGAGTAT 


TCCATTTGAC 


AAA 1 vJAA 11 A 


1 1 vjAUAVj 1\jA 




TAGCTCATTA 


AGAAGTGAGG 


ATAAAGCGAT 


GATATGAGTA 


CTTTAGTTGG 


X ALAGTGTTG 


i on 
loU 


AGTGGTTTTG 


AATATCGAGC 


ACATAAGGAA 


GAGTTGGATA 


AtTTAACAGA 


AGTTTTAAAA 


240 


GAATATAAAA 


GTAAATATAA 


ATATACTGGT 


TATACTGAAA 


ATGCAATTAT 


GAAAACGCAG 


300 


AATAGTGGTT 


TTAGAAATGA 


GTATTATTAT 


TTGACTGCAA 


TTCCATATAC 


TTTAGATGAA 


360 


TATAAAAGGT 


ATTTTCAACC 


CTTAATCAAA 


GAGGATGATA 


AGTCTTTTAG 


GGACGGTATG 


420 


AGAAACTCAA 


AAAAACAACT 


TAAAGATAAA 


TCACGTCCAT 


ATGTTGTTAC 


AACTTTATTT 


480 


AGTACTAAAG 


ATAATTTTAC 


TAAAGATAAT 


ACAATAGACG 


AGATGATTGA 


TTTTTCTGAA 


540 


GTCTCGTGCC 


TAAAAAAGAA 


AAAGAATATT 


CCTCATGATG 


TGTAATGTTT 


CTCTTCAAAT 


600 


NTCTAATAAA 


TATATAAATA 


CAAAAAGACC 


TAACTATAGT 


AAGAAGGAAG 


TAATAGAAGT 


660 


AGGTGTTTTT 


ANTCATGAAA 


AAGCTAACAC 


GANTTGATTA 


ANCTAATAAC 


AAATAAAAAG 


720 


GGGGGGTTCA 


AAAAAAGGCA 


TCTCAACAAC 


AAGG 






754 



(2) INFORMATION FOR SEQ ID NO: 962: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962: 



CCAATTTTTT 


ATGGATAACT 


TCGGTCTAAA 


AGTTAAAGCT 


ACGAATGTTG 


TAGGTAGTGG 


60 


CGACGGCGTA 


GAAGTATACG 


TGCATTGTGA 


TGACCACGAT 


ATCGTATTTA 


ATGCGAGTAT 


120 


TCCATTTGAC 


AAATCAATTA 


TTGACAGTGC 


ACGAGCTCAT 


TAAGAAGTGA 


GGATAAAGGC 


180 


GATGATATGA 


GTACTTTAGT 


TGGTACAGTG 


TTGAGTGGCT 


TTGAATATCG 


AGCGCAAAAA 


240 


GAAAAGTATG 


ATAATTTATA 


TAAATTTTTA 


AAAGAAAATG 


AAAAGAAATN 


CCANTATACA 


300 


GGATTTACTA 


AAGAGGCAAT 


TAACAAAACA 


CAAAATAGTG 


GATATGAAAA 


TGAATATTTT 


360 


TATATAGNTG 


CTAATATNCC 


GACGCTCCAA 


GAATATAGGA 


AATATTACGA 


ACCCCTAATA 


420 


AAGAAAAATA 


NTCTGAATTT 


TAAAAAAGGT 


ATGAAACAAG 


CAAGGGAAAG 


GAGTTAGGCT 


480 


ATAAAGCTGC 


AATAGAAGNA 


CATACAAACA 


TTGTTTGCGA 


GAAGTTAGTA 


ACTTNNCAAA 


540 


GGACAAAAAA 


ATTAGATGAT 


GTTTAGATNT 


GTCTGAAAGT 


ACGAAAAAGT 


NACACCTTAA 


600 


TTGTGGAAAA 


TACGAAAATT 


TTTT 








624 



(2) INFORMATION FOR SEQ ID NO: 963 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1096 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963: 



TTTTTACAAT 


TATTNACCCC 


ACGTTAACCA 


ATATATTTTT 


AAGTCGTTAT 


TGTAACNGTA 


60 


ATATCCAATA 


GCATCTTACC 


TGCATTTAAT 


TTTACCTATT 


GCGATGCTCA 


TTGCGTTAAC 


120 


CAGTATGTTT 


AATACCAACG 


ACAATCGGTG 


GTTTGTTATC 


GGCGATTGGT 


ATTGCAGGCA 


180 


TGGATCGTGT 


TACTCAATTC 


AATGTATTAG 


CGNAAAGTGG 


GCGTGCTGTA GAAGTATGTG 


240 


GAGATGTCGA 


TGTAATGATT 


TTAGACAAGA 


CAGGGACAAT 


CACTTATGGC 


AATCGAATCG 


300 


CTAGTGAATT 


TTTACCAGTG 


AATCAACAAA 


TGCTGGAAAA 


ATTGATAGTC 


GCTGCTTATA 


360 


TGTCATCAAT 


CTACGATGAT 


ACGCCAGAAG 


GAAAAAGTAT 


TGTACGATTA 


GCGAAGCAGA 


420 


TGTATATAAA 


TGAGCGGCCT 


AAAGACATTG 


ATGGCACATA 


TAAGCCCTTT 


TACAGCTGAA 


480 


ACTAGAATGA 


GCGGAATCAT 


TACAAATGAA 


ATATCGGTAT 


TCANAGGTGC 


ACCTAATAGT 


540 


ATGATAAATC 


TTGTCAAACA 


ACAACCCAAG 


GCAATATTCC 


TCTTAACATT 


GAGTCTCTCT 


600 


GTATGGATGT 


ATCGAGTAAA 


GGTGTCACAC 


C ATTAATAAG 


TCATTGAAAA 


CAATGTCATG 


660 


CTTGGCGTGA 


TTTATTTAAA 


AGATGTGATT 


AAAGATGGTC 


TTGTAGAACG 


ATTTACTGAA 


720 


TTGAGAAAAA 


TGGGGATTGA 


AACAGTTATG 


TGTACAGGTG 


ATAATGCTTT 


AACAGCAGCT 


780 


ACAATTNCAA 


AAGAAGCCGG 


GGTTGATAGA 


TTTGTTGCAG 


AGTGTAAACC 


CGAAGATAAA 


840 


ATTAAAGTCA 


TCAAAGATGA 


ACAAGCGAAA 


GGCCATATTG 


TTGCGATGAC 


AGGAGATGGG 


900 


ACAAATGATG 


CACCTGCGCT 


AGCACAAGCG 


AATATAGGAT 


TAGCTATGAA 


CTCAGGGACA 


960 


ATTAGTGCTA 


NAGAGGCAGC 


TAATCTAATT 


GATTTAGAGT 


CGAATCCTAC 


AAAGTTAATA 


1020 


GAGGTTGTAA 


AAATAGGCAA 


ACAATTATTA 


ATGACGAGAG 


GTGCATTAAC 


GACATTTAGT 


1080 


TTGGCTAATG 


ACGTAG 










1096 



(2) INFORMATION FOR SEQ ID NO: 964: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964: 
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CCACAATAAA 


GTATAAGGCA 


CGAGACTATA 


TAAGCTATTT GAAAATATAA ACCTATCTTT 


fin 


ATGAATGATT 


TTTCTACATT 


TTTTTCCATG 


TCTATTCCCC 


ATTTATTTAA 


TATTTATACT 


x ^ u 


TTACCTTAAT 


ATACCTTATT 


I 1A1 1 1AA1 1 


TTTATATGCA 


AAATACAAAA 


ATGGAGAACT 


180 


TCAATAl 1 1 A 


m a a a a rr» a nv^ A 

TAAAATATvJA 


AMu 1 IV IvL 


ACACTATATT 


GTTTTATTAT 


ATTTTCGCTA 


240 


TCAATACGCT 


AAATCATCAT 


All Iv-CU TCA 


ACATCACAGT 


AAAACTATTG 


CTCCACCAAG 




TATACrrTTT 


CAGTATAGCA 


TvJ I vJA 1 Alio 


GCAAACTTAT 


CTGCTAAAAA 


CTGCTTCCAA 




TTACGCAGTT 


GTTCAACATC 


ATCATCTTGT 


x x nnu X A/\ X \3 




TTGAAGATTA 


a *5 n 


AGACATCGTC 


CTGAAATATT 


AAAGCGTGTC 


APAPPTGPTG 

tU.AV.Li X VJ\— X 


GCACAGTTTC 


CCCTTTATGA 


/on 


ACAACCGCTT 


CAATTTCCTT 


% #t\a a /"»fFV* a "x m 

ATAACTCAAT 


GGPTGATACT 

VJUV> X VJ»»- X Av X 


TCATGAGTAC 


ATCTTGTTGA 




GAAAGACAAG 


GATATGTACC 


TTGfTGCAATT 


rirTTTACAG 

W X V* X v» x nv^nvj 


AACAACAACC 

/lrlU lifiw IVI^ 


ACTATAACTT 


' cnn 

DUU 


GCGACAACCT 


TTTCCCATAC 


1 l\iAAAAl\jl 


GCTTCGCCTA 


AATCTTTCGT 


ATACAAATAT 


DOW 


TGTTCTGTAT 


CACCATGACA 


ACATCGTAAT 


AAATGGCGCT 


TCTTGTCTTG 


TCTCAGTAGT 


720 


CCATGGCAAG 


CGATGTTCTT 


GTTGTAACGT 


TTCCCACCAC 


ACACCAAATG 


GAACTTTATG 


780 


TTGCCATGTA 


CTAATTGAAT 


ATTGTGTTTC 


ATGGATTTCT 


TGCACTGGAA 


CTTTCTTACA 


840 


TCCTAACGCT 


TTCAAACTTG 


TATACCGATG 


CACACCATCT 


ATAACCATAT 


ATCTACCATG 


900 


TTGCATCGCT 


GTCACTAAAA 


TAGGATGACG 


TATAAAATCA 


TCGGAANTC 




949 



(2) INFORMATION FOR SEQ ID NO: 965: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965: 



ACGTCGTTAA 


AACACTATAT 


CCATCAGATG 


GTTGTTCTAT 


AAACTAATTT 


CAACTATCGG 


60 


TGCATTAATT 


AATGGCATCG 


GAGTCTATCC 


TTGTTGTAGT 


ATCGTATACA 


GTCACTTCAA 


120 


ATCTCCACGT 


GAAGCAACTG 


GAGATAACTG 


GGATGGCCTT 


GGCCGTACTC 


TAGAGTGGAC 


180 


TACTGCATCA 


GCAATTCCAC 


CTAAATACAA 


CTTTGCCATT 


ACACCAGATT 


GGAATGACTC 


240 


CGACACATTT 


GTTGATATGA 


AAGAACATGG 


TCGTCATTAC 


TTAGACAACC 


ATAATTACAA 


300 


AGACATTCAT 


ATGCCAAACA 


ATACTCCTGT 


TGGATTCTGG 


ATTGGTATCT 


TTATGACTAT 


360 


CGGCGGTTTC 


TTCTTAATTT 


TCGAAACTGT 


AATTCCAGCA 


TTAATTTGTT 


TATTCGGTAT 


420 


TTTTGGTACT 


ATGATTTATC 


GCAAGTTNCC 


CAAATCCGAT 


CNCGGGATAT 


CATATCCCTC 


480 


CTGCTGAAGT 


TGCTGAAACT 


GAAGCTAAGA 


TTAAGAGAAG 


CGCGTATTAA 


AGAAAGGGAG 


540 


CCTGTAANTC 


ATGAGTCATG 


ATACAAACAC 


TATTGATTCA 


CGCACACATG 


AAGGCGAATT 


600 


AAATAAGCTT 


GGCTTTTGGA 


TTTTCATTAC 


AGCCGAATTT 


GCGTTATTCG 


GTACCCTATT 


660 


TGCGACACTA 


TTAACTTTGC 


AACATGGTGG 


CGATTATGCA 


GGTAAAATGA 


CTACTGAGTT 


720 


ATTTGAATTG 


CCTCTTGTTT 


TAATAATGAC 


GTTTGCATTA 


TTATTCAGTT 


CTTACACATG 


780 


TGGTATTGCT 


ATTTACTACA 


TGCGCCAAGA 


AAAACAAAAG 


TTAATGATGT 


TTTGGATGAT 


840 
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CATTACGTTA 


CTTTTAGGTT 


TAGTCTTTGT 


TGGATTCGAA 


ATTTATGAAT 


TCGCACACQT 


900 


ATGCATCAGA 


AGGCGTTAAC 


CCAACAATTG 


GTTCTTNCTG 


GTCTAGTTTC 


TTTATCCTAT 


960 


TAGGAACGCA 


TGGTTGGCAC 


GTATCGCTAG 


GTATCGTTTG 


GCCCATTTGT 


TTATTAATCC 


1020 


AAATCCAACG 


ACGCGGATTA 


GACAAATATA 


ATGCGCCAAA 


ATTATTTATA 


GTAAGTTTAT 


1080 


ACTGGCACTT 


CTTAGATGTT 


GTGTGGGTTT 


ACATCTTTAC 


TGCCGTATAT 


ATGATAGGGA 


1140 


TGGTGTATAG 


CGGATGAGTA 


CAATAATGAA 


^CATACTGTA 


GGATTTATCG 


CATCTATCGT 


1200 


ATTAACGCTT 


TTAGCAGTTT 


ACGTAACACT 


ATACACGTCA 


TTAACATTCC 


ACGCTCGTGC 


1260 


C 












1261 



(2) INFORMATION FOR SEQ ID NO: 966: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 979 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966: 



GTGTTCAAGG 


AGGTAAAAAA 


TAATGGCTGA 


AAATAAAAAC 


AATTTGTCGA 


TTAACGACGA 


60 


TCATTCTAAT 


GCAGCTATGA 


CGCATACCTC 


TGACGCTATC 


GCATCATCTG 


ATTTTATTAT 


120 


TAGAGAATTA 


GACTTGAATC 


AGGAACCTGA 


AATGCAACGA 


GAAAGCAAAA 


ACTTTGGGCA 


180 


AGATGCTTGG 


GCTCAGTTAA 


AACGAAATAA 


GTTAGCTGTT 


GTCGGTATGA 


TAGGTTTAAT 


240 


TATCATTGTA 


ATATTTGCTT 


TTATCGGTCC 


AGTTATAAAT 


AAACATGATT 


ATGCTGAACA 


300 


AAATGTAGAA 


CATAGAAATC 


TTCCGGCAAA 


AATACCTGTA 


TTTAGACAAA 


GTTCCATTTT 


360 


TACCTTTTGG 


ATGGTAAAGA 


TGCAGGATGG 


CAAGGATTGC 


TTATTAAAGC 


AGCAAATGCT 


420 


AAAGAAAATT 


ATTGGGTTGG 


GGTACTGATC 


AGTTGGGTCG 


AGATTTATGG 


ACAAGAACAT 


480 


GGAAAGGTGC 


TCAAATTTCA 


TTGTTTATCG 


GTGTTGTTGC 


AGCGATGTTA 


GATATTTTTA 


540 


TTGGCGTTGT 


ATATGGTGCG 


ATTTCTGGAT 


TCTTCGGTGG ACGTGTCGAT ACGATTATGC 


600 


AACGTATACT 


TGAAGTCATA 


GCATCTATTC 


CGAATTTAAT 


TGTCGTAATT 


TTATTTGTAT 


660 


TAATTTTTGA 


ACCATCCATT 


TGGACAATTA 


TATTGGCTAT 


GTCTATCACA 


GGCTGGTTAG 


720 


GCATGAGCAG 


AGTTGTACGT 


GGAGAATTTT 


TAAAATTGAA 


AAATCAAGAG 


TTTGTCATGG 


780 


CTTCGAAAAC 


ATTGGGGGCT 


TCAAAATTCA 


AATTGATATT 


TAAGCATATT 


TTACCTAATA 


840 


CGTTAGGTGC 


TATCGTGGTT 


ACATCAATGT 


TTACAGTCTC 


GTGCCATTTT 


CTTCGAAGCA 


900 


TTTTTAAGTT 


TCATTGGTAT 


AGGNGTACCT 


GCACCTCAAA 


CATCGTTAGG 


GTCATTAGTA 


960 


AATGATGGGC 


GCTCGTGCC 










979 



(2) INFORMATION FOR SEQ ID NO: 967: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 791 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967: 



GAAGACATGA 


ACGCATACGA 


AGATAGTTAT 


GAAAACCGAA 


CTAAAAAAAC 


AACTGCTGGC 


60 


AGTAGCGATT 


TAAGTATGTA 


ATTAACGAGG 


GCATTAGCCC 


TCTATTTTTG 


TGTACAAAAT 


120 


AACGATAAAC 


GAGGTATTTA 


ATATGACTGA 


AACAACTTTT 


AATCCAATTA 


CATCATTAAC 


180 


AATTAACAAT 


GAAGAAGTGA 


AAGCAAAAGC 


AACATTTATG 


TTCGATAAAA 


CCGCTAAAAA 


240 


ATTTGCAACT 


GAACAAGAAG 


ATAACAAAGG 


TAGAAAACAA 


ACGATTTCAG 


GATTTACTAA 


300 


TGTTTATNAC 


GCTTTATTAG 


AACGTGACAC 


AGTGGCAATC 


GTAGACTTTT 


GGGAATGTGC 


360 


AACAGCTTAT 


CTAGGTAAAA 


GCGCGCCTAA 


AAGAGAAGAT 


ATTGAAGCGG 


AAATCATGGA 


420 


AATCATCGAA 


AGAGAAAACG 


ACACGTTGAT 


CTTTTACAAG 


GTGCGTTGGA 


CGTAATGAAT 


480 


AATAGCGGTT 


TTTTCAAGCA 


GAAATCACGT 


CTATTCTGGA 


CACAGATGAA 


CCAAGCGCCA 


540 


TCGTTAGCCA 


AAGAAGACGA 


GAAAGAGGGC 


GCGAAAGCTG 


GTATCGAGAT 


GATGAAGAAC 


600 


AACTACAAAG 


AAATCATGAC 


CGTAGCACCT 


TATTAGACTA 


TTCGGAAATA 


AGGCAGATGA 


660 


CAAGTCGTTA 


CATAGGTTAT 


ATGAGTAATG 


ACGAGCTAAT 


GAGCATGCTA 


CCTGCCGAAT 


720 


GGAATGACTG 


GATTATTGGC 


GCTAGACAAG 


CATTGATTGA 


CCAAAGGGAC 


NTCGCTCGTC 


780 


CCGCTCGTGC 


C 










791 



(2) INFORMATION FOR SEQ ID NO: 968: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968: 



GGCACGAGCT 


AAAATATATG 


AAGCTAAAGG 


CCGTCCATCT 


GACAATCCGC 


TTATTGTTCA 


60 


TATACACAGT 


AAAGTTCAAT 


TAAAAGATTT 


TACATATACT 


TTGGCACGAG 


CGTAGAAAAG 


120 


TTAATGCAGG 


CATTCTGGCC 


GGGCCCTATT 


TCGTTTATAT 


TGCCGCTAAA 


GCCAGGCTAT 


180 


CTATGTCGAA 


AAGTTTCTGG 


AGGTTTATCA 


TCAGTTGCTG 


TTAGAATGCC 


AAGCCATTCT 


240 


GTAGGTAGAC 


AATT ATT ACA AATCATAAAT GAACCTCTAG 


CTGCTCCAAG 


TGCTAATTTA 


300 


AGTGGTAGAC 


CTTCACCAAC 


AACTTTCAAT 


CATGTATATC 


AAGATTTGAA 


TGCCCGTATA 


360 


GATGGTATTG 


TTCAAGCTGA 


ACAAAGTGAA 


GAAGGATTAG 


AAAGTACGGT 


TTTAGATTGC 


420 


ACATCTTTTC 


CTTATAAAAT 


TGCAAGACCT 


GGTTCTATAA 


CAGCAGCAAT 


GATTACAGAA 


480 
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ATACTTCCGA ATAGTATCGC CCATGCTGAT TATAATGATA CTGAACAGGC AATTGCACCA 540 

GGTATGAAGT ATTAAGCATT ACTCACCCAA TACAACACTT ACAATTATTA CAGATATTGA 600 

GAGCAAAATT GGAAATGACG GTAAAGATTG GTCTTCTATA GCTTTTATTG TGCCGAGTAA 660 

TAAGGTGGCG TTTATACCAA GTGAAGCGCA ATTCATTCAA TTATGTCAGG ATGACAATGA 720 

TGTTAAACAA GCAAGTCATA ATCTTTATGA TGGTGTTACA TTCACTTGAT GAAAATGAAA 780 

ATATTTCAGC GGCGTATATA TACGGCTTTG AGCTCGTGCC 820 



(2) INFORMATION FOR SEQ ID NO:969: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969: 



CTGTGTTCTT 


GGTCTCTTGA 


AAATGCTAAA 


GGCACGAGCG 


TGTTAAGTTT 


CCATTTTCAA 


60 


GCCATTCTTC 


TATTTCTTCT 


TCATATAAAA 


AGTCAGAACT 


ACGGTGTTGA 


TCACCAAAGA 


120 


ACAACCATGT 


TTTTCCAGTC 


ATACCAAGTT 


CTTCACGTTC 


TTGTAAATAT 


GCTCTAAAAG 


180 


GAGCAATTCC 


AGTACCTGGT 


CCAATCATAA 


TAACCGGTAT 


ATCTTGCTTC 


ATCGGAAATT 


240 


TGAAGTTCGG 


ATTTTTCTNC 


AAATAAATTG 


GTACTATATC 


GCCTGGTTTA 


ATTCGCTCAG 


300 


CAAAATGAAC 


CGAGCATACA 


CCTTTTCTCT 


CACGTCCATG 


TGCTTGATAA 


CGAACCGTAC 


360 


CAACGGTAAT 


ATGCACTTCA 


TCTGGCGTTG 


GCATAAAACT 


ACTAGAAATC 


GAATACTCTC 


420 


TTGGTGGTAA 


TTTTCTAAGG 


AATTTGATAC 


ATATTCTCAG 


GTTGTAATTC 


TATAGTTGGG 


480 


AAAATCTGTT 


ATTAAATCTA 


TAAAGTCCCG 


ATTTATAACA 


TATTCACGCG 


CCCATGACTC 


540 


ATCTTGAATA 


CGTTCAGATA 


ATTCTTCATT 


GGCAAAATAG 


ATATCTGCAT 


TTTACAATAA 


600 


TGGCAATGTT 


AATTTAGTAA 


ATTCAANATG 


TGATGTTAGT 


GCNTCNACAA 


TAGGAACTGT 


660 


ATCNACACGA 


ACANTAATTG 


GCACCGGAGA 


TTGGCGGATC 


CCAACCCNAA 


NATGGATATT 


720 


AGTTTTTCAA 


CCAATTCAGG 


GTCGTTTTGC 


GGTAATGCTA 


CTATACAATC 


TCCTGGTTCA 


780 


TATGATTCAC 


TAAAATCATC 


NAGTAAAAAT 


TCTATATGTC 


GTGTTTCTTT 


ATTTGAATCG 


840 


GTACCATTTA 


AATTGATATG 










860 



(2) INFORMATION FOR SEQ ID NO: 970: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970 : 



GGCACGAGCG 


CAAGGAGAAC 


ATTATGAACN 


AATTAAGGAG 


GCAATTATCC 


niunnl J^l_V?v~ 


oU 


TGCTCATTGA 


T ACATOG AA T 

a nwi a VrUnn a 


CAACCATTAT 


CAGTARPATT 




uA luAAul 11 


120 


TAGCTGAAAT 


AACTACGGAT 


TCTAAACAAA 


ATCATTCAGT 


GCAATTAATG 


CCTGCGATTA 


180 


GTCAGCTATT 


TGAGCAGAGC 


AAAATAGCTA 


AACAGCAATT 


AGATGCGATT 


ATTGTTGCTG 


240 


AAGGTCCTGG 


TTCATATACT 


GGCTTACGTA 


TCGGTGTTAC 


TGTTGCTAAA 


ACATTAGCGT 


300 


ATGCGTTAGA 


TGTTAAATTA 


TATGGTGTTT 


CATCATTAAA 


GGCATTGGCA 


GCAACAATCG 


360 


ATCATACTGA 


TAAATTATTG 


GGTTCCTGTG 


TTTGGATGCA 


AAGACGACAA 


GCAGGNTTAT 


420 


TCAGGCATAT 


TCCAGGTGGC 


AAAATGGGCA 


ATTAGTGACT 


TATTTTAGAA 


GGATCAATAT 


480 


ATTGTCGATA 


GAGGAATTGC 


GAGCTTTTTT 


TAGAAGATTT 


AAACCAACAT 


TTATTATATT 


540 


G6 












542 



(2) INFORMATION FOR SEQ ID NO: 971: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971: 



ATGATGTACG 


TGTTAGCAAG 


ATAATAGACA 


TATCTCATTA 


ATACTTGACG 


TATTCAAGGC 


60 


GTTCCCAAAG 


CTTAAGATAA 


CCGGACACTA 


CAATCAAGAT 


TTTGCANCGA 


ATTGAGCCAA 


120 


ACATATGAAT 


TGAGTCNAAG 


AAGAGTTAGA 


TAAGAGTGTC 


AAACTAGTAA 


CTCAACAAAC 


180 


GAAGTTATCT 


AAAGTTATTG 


TAGATACNCA 


AATTAATCCA 


GATTTAATTG 


GTGGATTTAG 


240 


AGTTAAAGTC 


GGCACAACTG 


TATTAGATGG 


TAGTGTTAGA 


AATGATCTTG 


TCCAATNACA 


300 


AAGAAAATTT 


AGAAGAGTTA 


ATTAATTATA 


AAGAGGAGTG 


ACATAGATGG 


CCATAAAAGC 


360 


TGAAGAAATC 


AGTGCATTAC 


TTCGCTCACA 


AATTGAAAAT 


TATGAGTCAG 


AAATGTCCGT 


420 


AACTGATGTA 


GGTACTGTAT 


TACAAATTGG 


TGATGGTATT 


GCATTAATTC 


ACGGATTAAA 


480 


TGACGTTATG 


GCTGGTGAGC 


TAGTGACATT 


CCATAACGGC 


GTACTTGGTT 


TAGCCCAAAA 


540 


CCTTGAAGAG 


TCAAACGTGG 


GTGTGGTTAT 


TTTAGGACCA 


TATACAGGTA 


TTACTGAAGG 


600 


TGACGAACTT 


AANACGTACT 


GGGTCGTNTC 


ATGGGAAGTA 


CCAGTAGGTG 


AAGAACTAAT 


660 


TCGGAAGAGT 


TGTNAATCCA 


TTAGGACAAC 


CTATTGATGG 


ACAAGGACCG 


ATTAACACAA 


720 


CTAAAACACG 


TCCAGTAGAG 


AAAAAAGCTA 


CTGGTGTAAT 


GGATCGTAAA 


TCAGTAGATG 


780 


AGCCATTACA 


AACAGGTATC 


AAAGCAATTG 


ATGCTTTAGT 


ACCAATTGGT 


AGAGGTCAAC 


840 


GTGAGTTAAT 


CATTGGTGAC 


CGTCAAACAG 


GTAAAACAAC 


AATTGCAATT 


GACACAATTT 


900 


TGAACCAAAA 


AGATCAAGGT 


NCGATTTGTA 


TCTATGTTGC 


TATTGGTCAA 


AAAGATTCAA 


960 
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CAGTAAGAGC ANATGTTGNA AAGTTAAGAC AAGCAGNCGC TTTAGACTAC ACTATTGTTG 1020 
TAGCAGCATC AGCTCGTGCC 1040 

(2) INFORMATION FOR SEQ ID NO: 972: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 972: 

CGTGCGAGCT GCTTTTGGTA AAAACGGCGG TAATATGGGT GTATCTGGAT CAGTTGCTTA 60 

TATGTTTGAT CATGTGGCAA CATTTGGTAT TGAAGGAAAG TCTGTTGACG AAATACTTGA 120 

AACATTAATG GAACCAAGAT GTAAGATGNA AATGATGTGA TTGACGATAA TGGATTGACA 180 

ATAGTCTATG CTTGAACCAA GATCCAATTT CCAGTCGTTC CAAGATGCGC TTCGTNCACC 240 

AGGTTGTTGG AAGAATTTAA AAGTTGCTTG AATTTGAAAT TGTTACCCTC CAAACAAGAT 300 

ATTGAACTTT CTGAAGCGGA CCNAAGTAAC ATTTGAAAAA TTAATCGATG CATTAGAAGA 360 

TTTAGAAGAT GTACAAAACG TATTCCATAA TGTGGATTTG AAATAATGAA ATCAGCAGAA 420 

CAATGGATTG ATGAATTGCA ACTTGAATCA CATCCTGAAG GTGGTTTCTA TAGAGAGACA 480 

ATTCGAGAAG TATTGAAAGA TGGACGCAGA GCGCCGTTTA GNAGTATTTA TTTTTTACTT 540 

ACAGATGACA ATATTTCGCA TTTTCATCGA ATTGATGCTG ATGAAGTATG GTACTATCAT 600 

GCTGGTGATT CTCTAACAAT TCATATGATA AATCCGGATG GGGAATATAC GACTGCAACA 660 

TTGGGTACTG ATATCCAAAA TGGAGATGTA TTGCAATATG TAGTGCCTAA AGGAACAATT 720 

TTTGCTTCTT CAATCGAAAT TTCAAATACT TATAGTTTAG TAGGTTGTAT GTGTCAACCG 780 

GCATNTGAGT TTAAGCAGTT TGAATTGTTT AAGCAATCTG AATTAATTAC ACAATATCCG 840 

CATCTTAAAT CAGTGATTGA NAAATATGCT TTAAAATAAA AATGATCAAT GAAGTGGTTT 900 

GAAGGTTGTT AATAAACCTT TGAGTCACTT CATTTTTATA TGTATTCTTG ATTGAATCAG 960 

AATAGATTTG ATG 973 

(2) INFORMATION FOR SEQ ID NO: 973: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973: 



ATAATTGAAA 


TGGAGGCATT 


TATTATTGGA 


ACATTATTGT 


TGAATAATAA 


RAAGTTATTT 
vmnu x x a xxx 




AAATAAGAAG 


ACCTTGGATG 


ATTTGTCTCN 


CCAAAACCAA 


AAAATAATTG 


ACCCCTGCAT 




ATTNCGATGT 


GCTAAGAAGC 


GATAAATGAT 


TTATCAAATG 


ATTCATTATG 


TGTGTTAAAT 


i nn 

lOU 


CATTTAATAT 


ATTAACGGAA 


GGACAAGAAG 


AAACATTCGA 


NCAAAAATTT 


TTGCAACGTA 


o a n 


ATAAGCATTT 


AAAAAACGTT 


GATGGGTTTA 


AAGCATTAGG 


TTTTTAAGAC 


CTAGAACAGC 


inn 
JUU 


TGGT AGAC AC 


TATATCATTA 


TTACGCTGTG 


GGAAAATAGG 


CAAGCATTTT 


ATCATTGGCA 


joU 


AAATTCAGCT 


GAGTACAAGC 


ATACACACAA 


GCATCGTGNC 


ACTTCGAAAG 


GTGCAGATGT 


>t 1 fl 


AAAAATAATT 


AATCGTGAAT 


TATCATACAA 
x x% x w\ x nunn 


TATTCGTATT GAATTAGCTG ATATGGTCTA 


480 


GCCCGAGATA 


TCAACCCTCA 


ATCATTTGAA 


AGTCCTTGAA 


TGCTACGTCA 


TGGCAAAGTG 


540 


TAGTGCCGAG 


GACTTTGGTA 


TTTAACCCTA 


TTTAATTGAA 


AATTCTGTAA 


AGATAGGCGT 


600 


ATGATATATA 


TAATTCATTT 


ATAAAATTAC 


AATGTTCATC 


ACAGGGAAAG 


TGGATGTATG 


C C f\ 

OoO 


AAAGGAAGTT 


AGTTATGCAA 


AAGTCAGCTC 


TATTCGCGCC 


TGAAAAATAT 


AACATTATTT 


ion 


CGGAAATCGA 


AAAATATAGC 


CACATACCAG 


ATAAAAAGGC 


AATTTTATAT 


CATAATACTG 


780 


AAGGCGAAGA 


TATATCAGTC 


ACTTACCAAC 


AGCTCATTGA 


ACAGTCTAAT 


AAAGTAGGTA 


840 


ATGTTTTAGC 


ATCGCACGGT 


TTATCTAAAG 

X X FX X \m* x 


GCGATAAAGT ACTCATTATG ATGCCACGAT 


900 


CTATTGCGAC 


GTATGAATTA 


TATATTGCTYS 

■X A X X V7\_ XV7 


CTTTAAAATT 


AGGTGTTGCA ATCATCCCAT 


o £ n 
960 


GTTCTGAAAT 


GCTAAGAACT 


AAARATTTAP 
*vr»#4VJ/lx X AXIV~ 


AATACCGTAT 


CACGCACGGT 


GAAATAAATG 


1020 






1 luALAu 1 l\j 


AATTTGAAAA 


AATTAAAGAA 


TATGATGCGT 


1080 


TANCTAAATT 


TGTTATTGCC 


GGTCAAAAAG 


ATGGATGGAT 


TTCACTTGAA 


GCTGAAAAAG 


1140 


AAAACGCAAG 


TAACAAACTA 


GAAATGGCAG 


ATACGACGCG 


TGATGATATG 


GCGANTCTGT 


1200 


CATATACATC 


TGGTACAACT 


GGCAATCCTA 


AAGCCCGNTA 


CACATTGTCA 


TGGATGGGGA 


1260 


TTTGCACATT 


TACAAAATGG 


CACCAAAACA 


TTGGTTATGT 


ATTAATGAAG 


ANTGACTTAG 


1320 


TTTGGGCAAC 


AAGCAGCANC 


AGGTTGGCAA 


AAAATGGGTA 


AAAACTTAAC 


AACAAAATCT 


1380 


TTTACGGT 












1388 



(2) INFORMATION FOR SEQ ID NO: 974: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 890 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974: 

CTGCAGATGA TTGAGCCCAA GTTCGTAGTT ACAGCTGTAA TGTTAAATAT TTTTAGTGCG 60 

CTTATCATCG CTAGTGTAAT CAATCCCTAT AAATCTGATG ATAGTGATGT TGAGATTGAT 120 

AANTTAACTA AATCAACGGA AACGAAATCA ATTGAATGGA AAAACAGGAA AACCTAAGAA 180 

AGTTGCCTTC TNCCAAATGA TAGGTGATAG TGCGATGGAT GGATTTAAAA TCGCTGTTGT 240 
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AGTAGCCGTA 


ATG1TGTTAG 


CATTTATTTC 


ATTAATGUAA 


uLAAl lAnin 


IVJ i 1 X \JrO 


300 


TAGTGTTGGN 


TTGNACTNTA 


AACAQCTTAT 


IviOC I A 1 Vj I vj 


1 1 luLnLvnn 






AATGGGGATT 


CCATGGAGCG 


AAGCTGTTCC 


AGCTGGCTCT 


1 1 AAXTjuCvxA 


r* T 2V A A TT & & T 
Linnnl 1/U\1 


^* A V 


TACAAATGAG 


TTTGTAGCAA 


TGCTTGATTT 


TAAAAATGTC 


CTGGGTGATCj 




A Df\ 


AACACAAGGT 


ATCATTTCAG 


TTTACTTAGT 


AAGCTTCGCT 


AATGTTGGTA 






CATCGTAGGT TCAANTCAAG 


GCACTTAGTG 


ATAAACAAGG 


AGAAGAAAGT 


TTCATCCTTT 


oUU 


NCAATGAGNC 


TGTTACNTGG 


CTCAACTCTA 


GCNTCAATCA 


TTACAGGACT 


CAATACAATG 


660 


ACTTAGTATT 


GTNAATTANT 


CGAAGTATCT 


AAATTAAATT 


CATGGCNAAG 


CTAAACCCCA 


720 


TCACCAAGTT 


GGCGCAACAG 


CGCATCATAA 


CTTGGTGATG 


GGATTTTATC 


ATAACAATCT 


780 


ACTTATTCGT 


AGCCGTTTTT 


GANATGTATG 


TTGNTGGTTT 


ATCTTTTTCC 


CCAAATTGTT 


840 


AATCCCGTTA TATLTTTTTT 


ATGTTTTGAA 


GGGACAATGA 


AGCTAAGTAG 




890 



(2) INFORMATION FOR SEQ ID NO: 975 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975: 



GGCACGAGAC 


TTATATTAGA 


CGGCTTAGAA 


GACCCGCATA 


ACTTGGGATC 


AATTTTAAGA 


60 


ACCGCCGATG 


CAACGGGAGT 


TGATGGTGTT 


ATTATTCCTA 


AACGTCGTTC 


AGTTACACTA 


120 


ACGCAAACAG 


TTGCAAAAGC 


CTCAACAGGT 


GCAATTGAAC 


ATGTACCAGT 


TATTCGAGTG 


180 


ACAAATTTAG 


CTAAAACTAT 


CGATGAACTA 


AAAGATAATG 


GCTTTTGGGG 


TAGCTGGTCA 


240 


CTGAAGCTAA 


TAATGCAACA 


GATTATAGAA 


ATCTAGAAGC 


GGACATGTCA 


TTGGCTATTG 


300 


TAATTGGTAG 


CGAAGGACAG 


GGTATGAGTC 


GCCTAGTAAG 


TGATAAATGC 


GATTTTTATA 


360 


TTAAGATTCC 


AATGGTTGGA 


CATGTAAACA 


GTTTGAATGC 


TTCGGTTGCA 


GCAAGTTTAA 


420 


TGATGTACGA 


AGTATTTCGA 


AAAAGACATT 


GATGTTGGAG 


AAATATAATG 


NAAGNCCGTT 


480 


ACTTAATCAT 


TGATGGATAC 


AATATGATAG 


GACCAATCAC 


CACCGCTAAG 


CGCCATTCCN 


540 


NAAGAGAATT 


TAGAAGAAGC 


TAGAATGCAA 


TTAATAGATC 


CAATTCCAAA 


TTATAATGCA 


600 


GTTATTTCAG 


ATGAAATTAT 


TTGTGTTTTC 


GATGCTTATG 


ACCAATCGGG 


TGTTGAAAGA 


660 


GAATACATGT 


ATCATGGCGT 


TAAAACGATT 


TTTACCAAGG 


AAAAAGAAAC 


AGCTGATAGT 


720 


TTCATAGAAC 


GTTATGTTTA TGAACTTTAT 


GACAAGCATA 


CTAAGCATAT 


TGCAGTTGTA 


780 


ACAAGTGATA 


TGAGTGAGCA ACATGCTATC 


TTTGGATCAG 


GTGCATATAG 


AATATCATCT 


840 


CGCGAAATGT 


GGAGAGATTT 


AAAAGAAAAT 


GAAATTGATG 


TGAGTAAATC 


ATTAGATGAT 


900 


ATAAGTGAAA 


ACAAGCCAAG 


AACTC 








925 



(2) INFORMATION FOR SEQ ID NO: 976: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1112 b^se pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976: 



CAATATAAAG 


TGGCAAAGGA 


GGTAATTGAG 


ATGGCAACAC 


ATTTAAGCTT 


TAGAACNAGG 


60 


CGNACAAGAG 


TGTANCCCAA 


CATTATTGGG 


TTATGCCGGT 


GTTGGTATCA 


TCATTTGGTA 


120 


TTGTGGCTTC 


GTCTCAAAAC 


TTTAGTATTT 


NAGAAATTAT 


CTNGTTATGT 


CTTGTTATAT 


180 


ATGCCGGTGC 


TGCGCAATTT 


ATTATGTGCG 


CGTTGTTTAT 


AGCAGGTACA 


CCGATATCAG 


240 


CGATTGTACT 


AACTGTATTT 


ATCGTAAATT 


CAAGAATGTT 


CCTTTTAAGT 


ATGTCGCTTG 


300 


CACCAAACTT 


CAAGACATAT 


GGGTTTTGGA 


ACCGTGTTGG 


ATTAGGCACA 


TTAGTAACTG 


360 


ACGAAACGTT 


TGGCGTCGCC 


ATTACACCTT 


ATTTAAAAGG 


AGAAGCTATC 


AATGATCGNT 


420 


GGATGCATGG 


TCTTAATATC 


ACAGCATATT 


TATTTGGGGA 


CAATTTCATG 


TGTAGCTGGG 


480 


GCTTTATCTT 


GGCGAATATA 


TCTCAAATCC 


GCAAACGCTA 


GGGTTAGATT 


TNGGCTATCA 


540 


CGGCTATGTT 


TATCTTCTTG 


GCCATTGCGC 


AATTTGGATC 


AATTAGGCAC 


GAAATCGCGA 


600 


TTAAGAATTT 


ACATAGTACT 


CATTATTGCC 


GCCATAGGAA 


TGATGTTATC 


GCTAAGTATG 


660 


TTTATGCCTT 


CATATCTAGC 


AATATTAATT 


GCAGNCACAA 


TNTCAGCAGC 


GTTAGGAGTG 


720 


ANGATGGAAC 


GATGATAACT 


CATATGAACA 


TGTTAATACT 


TATTTTATTG 


TGTGGTATCG 


780 


TAACGCTATT 


AATTCGAATT 


ATNCCTTTTA 


TCATGATNTC 


AAAAGTGCAA 


TTGCCTGATG 


840 


TTGTGGTTCG 


ATGGCTATCA 


TTTATCCCAN 


TCACACTATT 


TACGGCACTT 


GTCATTGACA 


900 


GCATTATTCA 


ACAGACGCCT 


CATGGTGAGG 


GGTATACATT 


AAACATCCCT 


NACATTATCG 


960 


CGCTCATTCC 


GACGGNTATT 


TTATCTATAA 


TCACGCGGTA 


GTTTAACTAT 


TACAATTATT 


1020 


AGTGGGATTG 


TTATCAAGGC 


AACATTACGA 


TTTTCTTTAA 


AATACTGAAA 


TCATGACAGT 


1080 


ATTTAACTTA 


GGTAACTGGT 


AGTAATCAGA 


AA 






1112 



(2) INFORMATION FOR SEQ ID NO: 977: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9?7: 
CTAGCCAATT AGTAACGTCT AAAAATAACT TACAAAGTTC TGTGAACCAA GTACCATCAA 60 
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CTGCTGGTAT 


GACGCAACAA 


AGTATTGATA 


ACTATAATGC 


GAAGAAGCGT 


GAAGCAGATA 


120 


ACTGAAATAA 


CTGCAGCTCA 


ACGTGTTATT 


GACAATGGCG 


ATGCAACTGC 


ACAACAAATT 


180 


TCAGATGAAA 


AACATCGTGT 


CGATAACGCA 


TTAACAGCAT 


TAAACCAAGC 


GAAACATGAT 


240 


TTAACTGCAG 


ATACACATGC 


CTTAGAGCAA 


GCAGTGCAAC 


AATTGAATCG 


CACAGGTACA 


300 


ACGACTGGTA 


AGAAGCCGGC 


AAGTATTACT 


GCTTACAATA 


ATTCGATTCG 


TGCACTTCAA 


360 


AGTGACTTAA 


CAAGTNCTAA 


AAATAGCGCT 


AATGCTATCA 


TTCAGAAGCC 


AATAAGAACA 


420 


GTGCAAGAGG 


TACATCTGCG 


TTAACAAATG 


TAAATCGTGT 


CAATGAGCGA 


TTAACGCAAG 


480 


CAATTAATCA 


ATTAGTACCT 


TTAGCTGATA 


ATAGTGCTCT 


AAGAATCTGC 


TAAGACGAAA 


540 


CTTGGATGAA 


GAAATCAATA 


AATCAGTAAC 


TTACTGATGG 


TATGACACAA 


TCATCAATCC 


600 


AAGCATATGA 


AAATGCTAAA 


CGTGCAGGTC 


AANCAGAANC 


AACAAATGCA 


CAAAATGTTA 


660 


TTAACAATGG 


TGCACGAGCG 


ACAGACCAAC 


AAATTGCCGC 


AGAAAAAACA 


AAAGTAGAAG 


720 


AAAAATATAA 


TAGCTTAAAA 


CAAGCAATTG 


CTGGATTAAC 


ACCAGACTTG 


GCACCATTAC 


780 


AAACTGCAAA 


AACTCAGTTG 


CAAAATGATA 


TTGATCAGCC 


AACGAGTACG 


ACTGGTATGA 


840 


CAAGCGCATC 


TGTTGCTGCA 


TTTAATGACA 


AACTTTCAGC 


AGCTAGAACT 


AAAATTCAAG 


900 


AAATTGATCG 


CGTACTAGCA 


TCTCATCCAG 


ATGTAGCAAC 


GATTCGTCAA 


AACGTGACAG 


960 


CAGCGAATGC 


TGCTAAAACA 


GCACTTGGAT 


CAAGCGCGCA 


AATGCCTTAA 


CAGGTCGGAT 


1020 


AAAGCACCTT 


TAGAAAATGC 


GAAAAATCAA 


CTACAACATT 


GGTATTGGAT 


ACGCAAACAA 


1080 


GTTCAACTTG 


GTATTGAACA 


AAGACTCTTT 


TTAATTGCAT 


ACAAATGCGA 


AGGTTAACAG 


1140 


GGTGCACCGT 


TATTAGGGTC 


CAACAAATCA 


ATC 
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(2) INFORMATION FOR SEQ ID NO: 978: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978: 



CTAAGACGTC 


AACATCAGTT 


ACAACCCAAA TCCAAGGGTC 


CAANTCNTAA 


TGCGTTTAGT 


60 


TTCCCGGAAA 


ATGACCCAAC 


TTGATAATCA TTGGATGGCC 


ATGTTTNAAG 


ATATGAATTT 


120 


TGATCTAGGT 


ATTATGGCAT 


TGTATGCCAT AGAAAGTGAT 


CCGCTTGCTA 


ATTTNCCGCG 


180 


TAACTCTGAC 


GCTGAAATTG 


CCGTCNTTGA CGAGTCGCAT 


ATAGATGCCT 


ATTAAAAGTT 


240 


GCATATCAGT 


TTAGTCTGCC 


ATTGGGAAAG GACTATGCAG 


ATGCACATGA 


AGAAATGGTA 


300 


AGGGAACATT 


ATCAAAAAGA 


TGTGATTAAA CGCTTAGTAG 


CTTATTTAAA 


TAATGAACCT 


360 


ATTGGCGTTG 


TAGATGTCAT 


TGAAAGTGAA AATT ACATTG 


AATTAGATGG 


ATTTGGTGTA 


420 


TTAGAACAAT 


TTCGACACCA 


AGGAATTGGA TCTACAATTC 


AATCCGTTGA 


TAGGTGAATT 


480 


ACGCCATATC 


AAAAAATCAC 


AAACCAATCA TATTAGTTAC 


AGATGGTGAA 


GATACAGCAA 


540 


AAGATATGTA 


TGCAAAGCAA 


GGTTATGTCT ATCAATCGTN 


TTTGTTATCA 


NATATTAAAA 


600 


GAAGATATTG 


GAAATTANCG 


ATTGATGATA GGTTTGTATT 


AGAATAANAA 


AGCAGTAANC 


660 
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TGGTTTCGAG TAGAAATCAA CTTACTGCTT TTTAAATTGT TTTGAGCTCG TGCC ^ 714 

(2) INFORMATION FOR SEQ ID NO: 979: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 975 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979: 



CATAAAGNAA 


AGCGTAAATA 


CTTTTTTAGC 


CCGAATACAT 


TTGTATTCGG 


TTTTTTTGTT 


60 


GAAATGATTT 


TAAAGTGAAT 


TGATTAAGCG 


TAAAATGTTG 


ATAAAGTAGA 


ATTAGAAAGG 


120 


GGTCATGACG 


TATGGCTTAT 


ATTTCATTAA 


NCTATCATTC 


ACCAACAATT 


GGTATGCATC 


180 


AAAATTTGNC 


AGTCATTTTA 


CCGGAAGATC 


AAAGCTTCTT 


TAATAACGAT 


ATANCTGCTA 


240 


AACCATTAAA 


AACTTTAATG 


TTGTTACATG 


GATTATCAAG 


TGATGAAACG 


ACATATATGA 


300 


GATATACAAG 


CATAGAAAGG 


TATGCGAATG 


AACACAAATT 


AGCTGTGATT 


ATGCCCAATG 


360 


TGGATCATAG 


CGCATATGCT 


AACATGGCAT 


ATGGTCATAG 


CTATTATGAT 


TATATTTTGG 


420 


AAGTATATGA 


TTATGTTCAT 


CAAATATTTC 


CACTTTCCAA 


AAAGCGTGAT 


GACAATTTTA 


480 


TAGCAGGTCA 


CTCTATGGGA 


GGATATGGCA 


CAATTAAATT 


TGCATTAACA 


CAAGGGGATA 


540 


AATTTGCCAA 


AGCTGTACCA 


TTATCTGCTG 


TGTTTGAAGC 


GCAAAATTTA 


ATGGATCTAG 


600 


AGTGGAATGA 


TTTTTCAAAA 


GAGGCCATAA 


TTGGCAATCT 


TTCAAGTTGT 


TAAAGGAACT 


660 


GAACATGATC 


CGAATTATTT 


GCTAGACAAA 


GCTGGAGCTG 


AAGATAAACA 


AACCCCAAAA 


720 


TTGCTCATGT 


AGNTGTGGTA 


GTACACAGAC 


TTGANATATC 


ANNACAACTT 


AGATTTTATC 


780 


GATTATTTAT 


CACGTAAAAA 


TGTTCCTTAT 


CAATTTGAAG 


ATGGACCAGA 


GATCATGATT 


840 


ATGCATATTG 


GGACCAAGCG 


ATTAAGCGTG 


CTATAACATG 


GATGGTGAAT 


GATTAATTAT 


900 


TCCCTGGAAA 


ATATGTGGTT 


GCATTAATAC 


ACAGAGTGAG 


AGATACAACT 


ATTACGCACA 


960 


CTACATTCTG 


ATGTT 










975 



(2) INFORMATION FOR SEQ ID NO: 980: 

(i) SEQUQXTE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980 : 
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ATAGGAAGCA 


TATGNNAGAG 


GAGAGGCCAG 


TTGTTTGTTT 


TGTGCAGCAG 


CCTGCATAGA 


60 


GAGCCTCTAA 


TCACTAAAGG 


TGAAGAGGNA 


CGCAGTTGGA 


TGCTAAGGGA 


CAACTGCATA 


120 


AATCCCTCTA 


ATCGCTAAAG 


CGAAAAAGTG 


GGATTAAAAA 


GGAGATGTGA 


TAATGTGAAG 


180 


AAATCGTTAA 


TTGCTTTTAT 


TTTGATTTTT 


ATGCTTGTCC 


TGAGTGGCTG 


TGGTATGAAA 


240 


GATAATGATA 


AACAAGGTAG 


CGATGATAAT 


GGCTCGTCTA 


AATCGCCGTA 


CCATAGAANT 


300 


GTTTCGTTAA 


TGCCTAGTAA 


TACTGAAATT 


TTATATGAAT 


TAGGATTAGG 


TAAATACATA 


360 


GTTGGTGTGT 


TCAACGGTTG 


ATGATTATCC 


AAAAGATGTG 


AAAGAGGGTA 


AGAAACAATT 


420 


TGATGCTTTG 


AATCTAAGTA 


AAGGAGGAAC 


TTTTAAAGGC 


AAAGCCAGAT 


CTAATTCNTG 


480 


CGCATGAGTC 


NCAAAAGNCA 


ACTGCTAATA 


AAGTATTGTC 


ATCATTAGAG 


AAACAAGGCA 


540 


TCAAAGTAGT 


GTATGTTAAA 


GATGCACAAT 


CAATTGATGA 


AACATACAAC 


ACATTTAANC 


600 


AAATTGGGAA 


ATTAACGCAT 


CATGATAAGC 


AGGCTGAACA 


ACTTGTTGAG 


GAAACTAAAG 


660 


ATAATATCGA 


TAAAGTCATA 


GATTCAATTC 


CTGCTCATCA 


TAAAAAATCA 


AAAGTATTTA 


720 


TTGAGGTNTC 


ATCAAAGCCT 


GAAAGANATA 


CAGCAGGGAA 


GCATACATTT 


TTTAATGATA 


780 


TGTTAGAAAA 


ATTAGAAGCC 


CAAAATGTTT 


ATAGNGACAT 


TAATGGTTGG 


AACC 


834 



(2) INFORMATION FOR SEQ ID NO: 981: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981: 



CTGAAATGGT 


TCAAATNTTG 


GGATGAGAAT 


GGTAACGTCG 


TAAATGAAGA 


CTTAGTACCT 


60 


GATCTTACGG 


ATGAACAATT 


AGTAGAATTA 


ATGGAAAGAA 


TGGTATGGAC 


TCGTATCCTT 


120 


GATCAACGNT 


CTATCTCATT 


AAACAGACAA 


GGACGTTTAG 


GTTTCTATGC 


ACCAACTGCT 


180 


GGTCAAGAAG 


CATCACAATT 


AGCGTCACAA 


TACGCTTTAG 


AAAAAGAAGA 


TTACATTTTA 


240 


CCGGGATACA 


GAGATGTTCC 


TCAAATTATT 


TGGGCATGGT 


TTACCATTAA 


CTGAAGCTTT 


300 


CTTATTCTCA 


AGAGGTCACT 


NCAAAGGAAA 


TCAATTCCCC 


TGAAGGCGGT 


TAATGCATTG 


360 


GAGCCCCACA 


AAATTATTAT 


CGGGNGCACA 


ATACAATCCA 


AAGCTTGCTG 


GGNGTTGGNA 


420 


TTNGNACTTN 


AAAAACGTGG 


TAAAAATGCA 


GTTGCAATCA 


CTTACACTGG 


TGACGGTGGT 


480 


TCTTCACAAG 


GTGATTTCTA 


CGAAGGTATT 


AACTTNGCAG 


CAGCTTATAA 


AGCACCTGCA 


540 


ATTTTCGTTA 


TTCAAAACAA 


TAACTATGCA 


ATTTCAACAC 


CAAGAAGCAA 


GCAAACTGCT 


600 


GCTGAAACAT 


TAGCTCAAAA 


AGCAATTGCT 


GTAGGTATTC 


CTGGTATCCA 


AGTTGATGGT 


660 


ATGGAGGCAC 


GTTAGCTTGT 


ATATCAAGCA 


ACTAAAGAAG 


CACGTGAGGC 


CCGAGCGCAG 


720 


TTGCAGGTGA 


AGGTCCAACA 


TTAATTGAAA 


CTATGACATA 


TCGTTATGGT 


CCTCATACAA 


780 


TGGCTGGTGA 


CGATCCAACT 


CGTTACAGAA 


CTTCAGACGA 


AGATGCTGAA 


TGGGAGAAAA 


840 


AAGACCCATT 


AGTACGTTTC 


CGTAAATTCC 


TTGAAAACAA 


AGGTTTATGG 


AATGAAGACA 


900 
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AAGAAAATGA AGTTATTGAA CGTGCAAAAG CTGATATTAA AGCAGCAATT AAAGAGGCTG 960 

ATAACACTGA AAAACAAACT GTTACTTCTC TAATGGAAAT TATGTATGAA GATATGCCTC 1020 

AAAACTTAGC AGAACAATAT GAAATTTACA AAGAGAAGGA GTCGAAGTAA GCCATGGCAC 1080 

AAATGACAAT GGTTCAAGCG ATTAATGATG CGCTTAAAAC TGAACTTAAA AATGACCAAG 1140 

ATGTTTTAAT TTTTGGCGAA GACGTTGGTG TTAACGGCGG TGTTTCCGTG TTACTGAGGA 1200 

CTACAAAA 1208 

(2) INFORMATION FOR SEQ ID NO: 982: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982: 



CAGTTATTAA ATTAGATGCA 


ACGACAACAA 


GTGTNTTAAT 


TACTTGTGTC 


ATGGCAATAA 


60 


TGATTCCATT 


AGCATTAATG 


TNTGGCAAGT 


TAGCGGATAA 


AATAGGTGAA 


AAGAAAGNAT 


120 


NTCTAATTGG 


CACGAGACTG 


GTGGGCTAAC 


ATTATTTAGT 


ATCATCGCAT 


TTATGTTATT 


180 


ACATTCACAA 


TCATTTGTTG 


TAATAGTAAT 


CGGTATATTT 


ATATTAGGAT 


TTTTCTTATC 


240 


AACTTACGAA 


GCGACAATGC 


CAGGGTCGTT 


ACCAACGATG 


TTTTACAGTC 


ATATAAGATA 


300 


TCGAACTTTA 


TCAGTAACAT 


TTAATATCTC 


TGTTTCGATA 


TTTGGTGGTA 


CGACGCCATT 


360 


AGTTGCAACA 


TGGTTAGTTA 


CGAAAACTGG 


AGATCCATTA 


GCACCTGCGT 


ATTATTTAAC 


420 


AGCAATCAGT 


GTTATTGGCT 


TTTTAGTTAT 


TACATTCTTA 


CATTTAAGTA 


CAGCAGGAAA 


480 


ATCTCTAAAA 


GGTTCGTATC 


CAAATGTAGA 


TAACGAGCAA 


GATAGAGCTT 


ATTATGCAGA 


540 


ACACCCAAAA 


GAAGCATTAT 


GGTGGGTTAA 


AGAACGTAAG 


AATTAGAGAT 


TTCAATAAAA 


600 


AGTATAAATC 


AATCGTATAT 


AAGCACTTTA 


AAGCTAGTAG 


GTTCTGCTAA 


CTTTAAAGTG 


660 


CTTTTTAAAT 


TGAGAACTGT 


AATTAGCCGT 


AATAAAGTTT 


TTGTATATAC 


ATAAACCCCC 


720 


ACTGCAATGA 


TTATCGCAAT 


GGGGGAAAGA 


GGGGACTTAA 


AGCATATTGT 


TTAGCTTTGG 


780 


AATACTTAAA 


ATTCTCTTTG 


GCTATTGAAA 


TTGTTAGGAT 


TGTAAATATT 


GTCTTTAGAG 


840 


TATTTTGGTC 


CCAACGCAAT 


TAATATTTGG 


AGACTCTTAA 


CCCTTCAAAT 


ATTTATTATA 


900 


GGAGGAACAA 


CAAACTTAAA 


TAGGATTGGG 


TGACTTATTT 


GTGTCAGTTA 


TTGCGATTGC 


960 


GATAACTTCT 


TTTCTCTATA 


TACATATAGT 


AACGTCTTAT 


CTAATAAAAA 


ACATGGTCTC 


1020 


GTGCC 












1025 



(2) INFORMATION FOR SEQ ID NO: 983: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 983: 



GGAGCTNCGA AGTTGAATGC 


GTCAACTACA 


GNTAAGCCAT 


TCTCTNGAGC 


TNNGAAAGAT 


60 


AATGCTGAGC 


GTAAAGCTAA 


ACGACGCATN 


NTCTTAGGCA 


TTGNGTATGC 


ATAACTTCTT 


120 


GGAGTTGGTC 


CGAATACGAT 


ACCACCGNCA 


CGNCATTGTG 


GAGCACGGAT 


TGTACCTTGA 


180 


CGAGCACGAC 


CTGTNCCTTT 


GTGTTTCCAT 


GGTNNACTTN 


CATTTNTATA 


CCTCCCTATA 


240 


TTATGGAGTG 


ACGATTATTT 


ATAACGACAA 


GTAACTCGTG 


CCGNGTGTTA 


AATCGTACGG 


300 


AGACATCTCA 


ACAGTTACTT 


TGTCGNCAGG 


GTAGAATACG 


AATTGTAATT 


CATTCCTGAT 


360 


TTNACCACTT 


ACGTGAGCTA 


AAATCTCATG 


ANCATTTTCT 


AATTCTACTT 


TAAACATTGC 


420 


GTTCGGTAAA 


GTATCTAATA 


CAGTACCTTC 


TAATTCAATT 


ACATCTTGTT 


TAGCCATTGA 


480 


TTAACTTCCC 


CCTTTTTGCA 


ATAGTAAGGT 


AATCGTCAAT 


AGACAACTTT 


ATTGTTACGA 


540 


ATCTATCAGT GATTAATTTT 


ATAAGTTAAA 


CAAAAATTAC 


GGGAATTAAT 


TATCGGTAAT 


600 


TGCCACTCTC 


ATCTATCTAA 


TATGATTAAA 


TCAAGCCTCA 


CTTAAAATAG 


ACCGCTAAAA 


660 


GTGGATCTCA 


TTACAAATGA 


TCNAAAATAT 


CAATGACATC 


TTTGGGTAAN 


GTCGCTAATA 


720 


TCTTTTGAAC 


CATCAATATT 


TTTCAATACA 


CCTTTTGGAT 


CATAGAAATC 


TAAAATAGGC 


780 


TTAGATTGTT 


TAATATTAAC 


ACTCAAACGG 


CACGAGCTAC 


CGNTTCAGGA 


TTATCATCTT 


840 


CTCGTTGATA 


CANNTNAACA 


CCATCGATAT 


CACAAATACC 


TTCGACCTTC 


GGAGGATTAA 


900 


ATACAAGATG 


ATACGTTGTA 


CCACATGACT 


CACAGATTCG 


ACGACCTGTA 


AGACGGTTCA 


960 


TTAATTCTTC 


TTCCGGAACT 


TCGATATTGA 


TGACAGCATC 


AATGTTTCTG 


TCAAGCTCAG 


1020 


ACATAATATT 


ATTTAATGCC 


TCAGCTTGCT 


CGATTGTTCT 


TGGGAAGCCA TCTAATAAAA 


1080 


AGCCTTGTTN 


GNGCATCCGT 


CCTTNCAGAA 


ATTCCTTTCC 


CTTAACGATA CCTACAGTCA 


1140 


CTTCCATCCA 


GGAACTAATT 


CCGNCACGGT 


CCCATATAAG 


ACTTAGCTTC 


CTTTACCTAA 


1200 


TTCAGTNCCT 


TCCTTCTATN 


GCTTTTCTTG 


AACATGTTCA 


CCAGCTTGAA 


ATGTTGGNGT 


1260 


ATTGNGNAAT 


TCCTTGACAA 


TTNCACTTGC 


TTGAGTTCCT 


TTACCTGCGC 


CAGGGTNAAC 


1320 


CCCATCAAAA 


ATGATATTCA 


TAANGTGNCC 


CTCCCTAAAC 






1360 



(2) INFORMATION FOR SEQ ID NO: 984: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984: 
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T ATrTGNCA LA 




TTTUP A TTP A 
1 X 1ALA1 X l-~r\ 


TATTT a AP AT 
XA1 IWulwll 


AGCAATATTA ATTTATTCTT 


fin 


n rTY* a iY2 a pp 


atty^ocatyia 

nl l\JCI>AiUn 


TPTGTTPAAC 

X V» X O X x wnnx. 


AATAPTPAP/T 


TTATTGCTCG 


TTACAGGCTG 


120 




APAPTPATTA 


CCATAGCCAG 


TACAATTATC 


GGAACTTTTT 


TGAGTATAGG 


180 


TrraAPAfSAA 


TTAATPATAT 

x x fin x w/\ x tx i> 


ATATC3ACAGA 


TGGTAAAGGC 


ATTAAATATG 


AAACGATGAA 


240 




TTAPPAPPGA 


AAGATATCTT 




GTGTTAATAG 


GTTCACTTGG 


300 


LuLlni lAlo 


r; a TYTPTYiP A A 


TTAPAATTGC 
x x Aunn x x vjv 


TAGTGGGATG 


CATGAAATTT 


NACAGCGNAC 


360 


UttUlUlLAi 1 


11.AA 1 uAxTUtA 


ttATY3fifiPf5PT 


X OV. X X VJVJO\»0 


aaapatppha 

nAnLAl\.vuA 


P A aMATATTA 
Uuiuinlnl In 




TGGGAACAAT 


GACTAACATC 


TTATTATTCT 


CATATTTATC 


CGGTGCACTA 


CCTATGTTTT 


480 


TACTCTTTTT 


AAAAAATGCA 


AATACTGTGA 


CTTATACTAT 


TTCGATGAAT 


TGGTCGTTAG 


540 


AACTCGCGCG 


CGCACTAACA 


GGCGGTATAG 


GTATCGCACT 


TACAATTCCT 


ATCACAATAT 


600 


TGTTATGGAA 


ATATTTGAAA 


CATAAGGAGG 


CGAAACAATG 


AATGCTGTAT 


TATATTGGCG 


660 


TATTGCTCTC 


TTTCTGATGA 


TATT 








684 



(2) INFORMATION FOR SEQ ID NO: 985: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 985: 



CCCCACNAAG 


TACNCGAAAC 


GCAAACAAAC 


ATCTTAAAAG 


GAGGAACGAA 


CAATGCAAGC 


60 


ATTACAAACA 


TTTAATTGGA 


AAGAGCTACC 


AGTAAGAACA 


GTGGAAATTG 


AAAACGAACC 


120 


TTATTTTGTA 


GGAAAAGATA 


TTGCTGAAAT 


TTTAGGATAT 


GCAAGGGCAG 


ACAATGCCAT 


180 


CAGAAATCAT 


GTTGATAGCG 


AGGACAAGCT 


GACGCACCAA 


TTTAGCGACA 


GCAGGTCAAA 


240 


ACAGAAATGT 


AACGATCAAT 


CAACGAATCA 


GGATTATACA 


GTTTAATCTT 


TTCTAGCAAA 


300 


TTAGAAAATG 


CGAAGCGGTT 


CAAACGTTGG 


GTAACTTCGG 


AAGTTTTACC 


AACATTAAGA 


360 


AAAACTGGTG 


CTTACCAAGT 


ACCTAGCGAC 


CCAATGCAAG 


CATTGAGATT 


AATGTTTGAA 


420 


GCTACAGAAG 


AAACAAAACA 


AGAAATTAAA 


AACGTAAAAG 


ATGATGTTGA 


TNGATTTGAA 


480 


AGAAAATCAA 


AAACTGGATG 


CGGGAGACTA 


CAATTTCTTA 


ACTAGAACTA 


TCAATCAAAG 


540 


AGTAGCTCAC 


ATNCAAAGAC 


TACATGCGAT 


AACAAATCAA 


AAACAACGTA 


GCGAATTATT 


600 


CNGGGATATT 


AATTCAGAAG 


TGAAAAAGAT 


GACTGGTGCA 


AGTTCAAGAA 


CGAATGTTAG 


660 


AC 












662 



(2) INFORMATION FOR SEQ ID NO: 986: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1112 base pairs 

(B) TYPE: nucleic acid 
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{ C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 986: 



NTGATTAATT 


ATTTCCTGGA 


AAATATGTGG 


TTGCATTAAA 


TACACAGAGT 


GAGAGATACA 


60 


AACTATTTAC 


GCACGACTAA 


CATTTCTGAG 


TGTTTAAATT 


AAATTTGTAT 


TAATATGATT 


120 


GGCGCAATTT 


GNTGATACGC 


AAAAATGTTT 


CTCGTGAAAC 


TTAGATTTAG 


CTTATAGTTT 


180 


TATCATCATT 


TGTATGACTT 


ACATTATAAA 


TTTTATTATA 


ATGAGGTTAA 


CGCTTTGAAA 


240 


GGAGTCATCA 


TCANGTCGAC 


CAATAAAAAC 


GATTATGAGC 


ATATGTTGGT 


TTTATTGTGC 


300 


ATATAAAACG 


TTTATTACTA 


CCGCTGATGA 


AATTATAGAG 


AAGTATGGTA 


TGAGCCGTCA 


360 


GCATCATCGT 


TTTTTGTTTT 


TTATCAATAA 


ATTACCTGGT 


ATTACTATTA 


AATCATTACT 


420 


AGAAATATTA 


GAAATTTCTA 


AACAAGGATC 


ACATGCAACA 


CTTCAAAAAT 


TAAAAGAGCA 


480 


AGGTCTCATT 


ATTGAAAAAG 


TTTTAGAGAC 


TGATCGACGT 


GTCAAAAAAT 


TATATTCGAC 


540 


GGATAAAGGC 


GATCAACTCA 


TTGCTGAATT 


GAACAAGGCG 


CAAGATGAAT 


TATTGCAAAA 


600 


TATATATCAA 


CAAGTCGGTT 


CGGATTGGTA 


TGATGTGATG 


GAAGCATTGG 


CTAAACGGCG 


660 


ACCTGGCTTT 


GATTTTATTA 


AGCATTTGAA 


AGATGAAAAA 


GAAAGCTAGC 


ATCAGAAATG 


720 


TTAAAAATCT 


TCGCATTCTT 


AAATTTAAAA 


AATATGTCAA 


AAAGTGTATA 


ATAAAAACAT 


780 


ATAATTTAAT 


TGAACTCAGT 


TTCAACACAT 


CTTAGAAAGG 


AGTNTGAATG 


ATGAAAAAAT 


840 


TAGCAGTTAT 


TTTAACATTA 


GTTGGCGGTT 


TATACTACGC 


ATTTAAAAAA 


TACCAAGAAC 


900 


GTNTTAACCA 


AGCACCTAAC 


ATTGAGTACT 


AAATTAACCA 


TAAAAAATTC 


CCGAACACCT 


960 


TGNTATAGTG 


CTCCGGGNAA 


TTTNNNTATG 


CTTNACTTGA 


ATATTTCAAA 


TATTATTTGT 


1020 


GCGCTTTCTT 


GTATTTTCCG 


ATATTACACT 


AATGATCTGA 


TCTTNGGCCC 


CCNAGCGTAG 


1080 


GTATTAACAA 


TCTCGCTGTA 


TTTTCCACCG 


GA 






1112 



(2) INFORMATION FOR SEQ ID NO: 987: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987: 

CTTATGGATT CACATCTAAG TTACATAATA TATATTAATA GGAAGTAATG TGTGTGACCC 60 

CAAAGGTACC CTACATCATA TCATTATCTA ATATCGTCAC ATAACTTACT TATGCTATAA 120 

TCATGTATTA TATTGTTTGG AGTGATTTGA TGAGATTTGT CTTTGATATT GATGGTACGC 180 

TTTGTTTTGA CGGCTGATTA ATTGACCAGA CTATTATTGA TACATTGTNA CAATTACAAC 240 
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ATGATGGTLA 


TGAACTAATA 


1 1 rGCATCAla 




IVuiuAX 1 1 Vj 


1 luCLAul 1 1_ 


inn 


TACCATCAGT 


ATTTCATCCA 


GCACACAl 1 A 


All IxAitk-ViCA 


x T\t/~"TVTV?0'T w TV > 

AWvj 1 ITjKjI IXf 


LlAluAl 1 1L. 


JOU 


ACAGCCNATC 


CAAAGATTTC 


TGTTATCCNN 


KT/">/"» » A T*TV^ AT» 

NL L.AAI IxJA l 


AV-AALs A 1 ACA 


T» TV TV* A f~*(~' A A 

1 A 1\. AUCAl A 




TCTTAAAAAT 


AATTCAAAAG 


TATGAGTTAG 


ATTATATTAT 


ITV"^ TV fTV^ TV fT/V* TV #T» 

TGATGATGAT 


rrv a a m u n iv frv* 

TGGAATTATG 


480 


CTGCACAACT 


TGACGCTGAG 


ik ik ik irMiMii 

AACGCGATTT 


TTGAGCGTTT 


m /"i ik nvws* ik ik m 

AGATCCACAT 


AAGCTGGCCA 


540 


GTTGTATTGA 


n i^imww'iA ik ik ik m 

TGTTGCAAAT 


ATCGACACAC 


/~» m tv rrv* tv 7v iv m 

CAATCAAGAT 


CATTTTATTA 


AATATAGACC 


600 


CGGCACAAAT 


m A /"* A mA 

TACAACTATA 




A A AT»A A Tkrp JV 

1 AAA 1 AAA 1 A 


r*C* A fTV^* A A /"» A A 

CLATLAAGAA 


TTLiGNAATGA 


c r r\ 

ddO 


TTCACCA I Iv- 


AAA iviAvjl A 1 


AALA1 ITjAIA 


1 AAL^Ao^.u\UA 


A 7A A T» 7V *T*T» A A *"P 

AAA1A1 X AA1 


A. A A IT A A r'rrv^ 

AAAIATACTG 


Tj r\ 
/ Z\) 


CATTACAATA 


TATTTTTGAT 


TV A nv^fTVp Jv. 


A. J\rniv rpjv rpA 

AA1AIATAGC 


ATTTGGGTAA 


A ik ik ik m 

TGACCACAAT 


780 


GATATTTTCA 


TGTTACAACA 


TGCTAGTAGG 


TGGCTATATT 


ATAGGACCAT 


CAGAAGCATA 


840 


CACACACGCA 


ATATTGAAAC 


TTGATAAAAT 


CAAATACATC 


AATAATAATA 


CACAAGCTAT 


900 


TTGCAAAGTC 


TTAAAATCAT 


ATAAATAAAA 


GCACCCCTAT 


CAAATGATAA 


TCATTATCAA 


960 


TCGATAGGGG 


CTATTTTAAT 


AAAATTCGTC 


CTCGAACATT 


TCTTCCNCTT 


CATCTAATCC 


1020 


AAATAATTCT 


GCCATTTCTC 


CATGTTCAAT 


TAACATGTTT 


AAATATGCAN 


CGCNCGTGCC 


1080 



(2) INFORMATION FOR SEQ ID NO: 988: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 base pe.irs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988: 



CCTGCATTGA 


TTATTTTTAT 


TTTCAGTTTT 


AATAAAAATG 


AAGGTGTAAC 


ACCTGCGAGT 


60 


ATGGTATTAA 


TAGGTGTAGG 


TTTACAAACA 


GCATTATATG 


GGTGGCTCAA 


TTACAATTAT 


120 


GTCAAAATTT 


GATGATAACC 


AATCTGATTT 


CATCCCTGCT 


TGGTTTGCAG 


GTAATATTTG 


180 


GGGTGACGAA 


TGCCCATTTG 


TCATTGCATT 


TTTACCGTGG 


GTGTTGATTA 


TTATTCCTTA 


240 


CTTACTATTT 


AAATCGAATA 


CACTAAATAT 


TATTCATACG 


GGTGATAATA 


TTGCACGAGG 


300 


TCTAGGTGTA 


AGGTTAAGCA 


GAGAACGTTT 


AATATTATTC 


TTTATCGCAG 


TGATGTTATC 


360 


ATCTGCTGCT 


GTAGCAGTAG 


CAGGTTCAAT 


TTCGTTTATC 


GGATTAATGG 


GTCCGCATAT 


420 


TGCCAAACGT 


ATCGTTGGAC 


CACGTCACCA 


GTTGTTTTTA 


CCAATTGCCA 


TTTTAGTAGG 


480 


GGCATGTTTA 


CTTGTTATAG 


CTCGTGCCAT 


TGGCCAAAAT 


TTGTATTACA 


ACCAGGTGGG 


540 


GTTCCAGCAG 


GTATTGTCGT 


AGCAATTATT 


GGTGCACCGT 


ATTTCTTATA 


TTTAATGTAC 


600 


AAAACGAAAA 


ATGTATAGTG 


TCAAAGGACA 


CAACTTATTG 


CTATGAAAGG 


CACTTTATTA 


660 


TAAGGCTTTT 


CATAGCATTT 


TTTATTTAAT 


GAGCCACTCA 


AGACTATTTA 


TTTTTTCAAT 


720 


AATGAACCAT 


TAAGTTATCA 


AGAGGATCTT 


ATCaaAAATA 


TATTTGATAA 


CGGTATCAGG 


780 


TTAATTCTTT 


ATGATAGCGC 


ATTCATTTAT 


TCTGTTTTAT 


ACTATGACTG 


ATAATACCAA 


840 


GGAGGTCTCG 


TGCC 










854 
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(2) INFORMATION FOR SEQ ID NO: 989: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 989: 



CATCACCCAG 


AGCGAATAAA 


AGAGATTGTA 


GCGGCCGTTA 


AAGAGGNGGG 


TAAATATCGT 


60 


ATTGTAGACC 


ATCAACGTAA 


TGTCACAGTA 


TATGTGATGT 


GTAGGCATTT 


ATGTATCTTT 


120 


AGTTATGGAT 


GNATCTAAAA 


CATCAGNCGC 


TAGTGGTGGA 


GACTGTNGTA 


TTACAATTAT 


180 


CAATANAGAT 


GTAGATAAAG 


AANAAATTTA 


TGATGAATGG 


ACAAAACATG 


GTATTAAACC 


240 


ATTAAAATTT 


AATATTTATC 


ATGGGCAATA 


AATGATTTGA 


TAGGGGAGCA ATCATCTAAT 


300 


TTAATTAGAA 


GATAAATGCT 


CCATTTTTTA 


TTGCCAGCCG 


TGTTACGTAT 


CTTGAAAAAA 


360 


ATATTAAAAA 


GAGTAAAATA 


GATAGAGTTA 


ACGAAAAAAT 


TAATGAAATC 


GACGATATAG 


420 


ATATGATAAA 


GAAAGGTGGG 


TAGCAATATG 


AAAAATACAT 


TCCTTATTTG 


TGATGAATGT 


480 


CAGGCAGTCA 


ATATAAGAAC 


GTTACAAAAG 


AAGTTGGAAA 


AATTAGATCC 


CGATGCTGAA 


540 


ATCGTGATAG 


GTTGTCAATC 


TTATTGTGGA 


CCTGGACGCC 


GAAAAACATT 


CACTTTTGTT 


600 


AATAACCGCC 


CACTGGCTGC 


GCTTACTGAA 


GAAGAAG 






637 



(2) INFORMATION FOR SEQ ID NO: 990: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990: 



60 



TAATTCGCAA TAGGAGTGAT GAATATCATA AATTTTACCC TCCAAATGAA GCTAATGAAG 

TCCTGGACCC GAGTAAGACG CATGTAGCCA AGCTAAAATA ATCCACTCTA CCTTATCTTT 120 

AGTTAATAAT GTTACTAAAT GTTGTTCATA CGCTGCTTTT GAATCAAATT GTTTTGGTTC 180 

ATTAATATAA ACAGGAATAT CGTGCTTGTT TGCTCTATCT ATACAAAACG CATTTTGATG 240 

ATCCGTATAT AGCNCCGTAA CTTCAATATT TTCAAGTTTT CCTGATTCAA CATGCTCAAC 300 

TATATTTTCA AAGTTACTTC CTGAACCTGA TGCAAAAATC GCAATTTTAA CCATTGTTAT 360 

ACCCCCAACA ATTCAATTGC AGTTGACTCA TTTTTCACAA TATGACCAAT TTGATAAGCT 420 
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TCCACATTTT GTTCTGCTAA AATCTTCAAA GCGCGTCGAT GCATCTTTTT CATCAACGAT 480 

AACCGTATAG CCAATACCCA TGTTAAAAAT GTTATACATT TCATTTGTGT CTATATTGCC 540 

TTGTTGTTGT AACCAATCAA ATATTTTTGG CGTTGGAAAT GATGTAGTAT CAATTCTAGC 600 

AGCATATCCG GCTGGCAATG CACGTGGAAT ATTTTCATAA AAACCTCCAC CAGTAATATG 660 

ATTCATTGCC TTAATAGAAA CTTCTTTTTT TAAAGCAAGT ACAGGTNTGA CATATAATTT 720 

AGTTGGCTCT AAAAAGACAT CTATAAATGG ACGATTATCG NAGGGTGATG CCAAATCAAT 780 

GNCTGATTCA NTAATTAATN TGCGCACTAA ACTGTNTCCA TTNGANTGAA TGNCACTTGG 840 

ACGCAAGTCC TATAACAACT TGGCCCTCTT NCAATTCTTG AACCATCTTA CAATAGNCAA 900 

CCTTTTTCAA CTGCTCCAAC AGCAAATCCG GCTACATCAT ATTCACCTTC GTGATACATT 960 



(2) INFORMATION FOR SEQ ID NO: 991: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991: 



GGCACGAGCT 


CGTGCCGAAT 


TCGGCACGAG 


ACTTCAAATC 


AATCGAGTGG 


CATGTCAAGG 


60 


TCATATCAAT 


ATTTTAGAAT 


CTGCGACTAT 


GAGAGAGGAA 


ATAAATGAAA 


TGGCACGAGC 


120 


GACGTATCAT 


CGTTGATATT 


CGTGATAAGC 


AATTACGATA 


TCAAGATATT 


GCAATTTTAT 


180 


ATCGTGACGA 


GTCTTATGCT 


TATTTATTTG 


ATTCCATATT 


ACCGCTTTAT 


AATATTCCTT 


240 


ATAACATTGA 


TACAAAGCGT 


TCGATGACAC 


ATCATCCGGT 


CATGGAAATG 


ATTCGTTCAT 


300 


TGATTGAAGT 


TATTCAATCT 


AATTGGCAAG 


TGAATCCAAT 


GCTACGCTTA 


TTGAAGACTG 


360 


ATGTGTTAAC 


GGCATCATAT 


CTAAAAAGTG 


CATACTTAGT 


TGATTTACTT 


GAAAATTTTG 


420 


TACTTGAACG 


TGGTATATAC 


GGTAAACGTT 


GGTTAGATGA 


TGAGCTATTT 


AATGTCGAAC 


480 


ATTTTAGCAA 


AATGGGGCGT 


AAAGCGCATA 


AACTGCACCG 


AAGATGAACG 


TAACACATTT 


540 


GAACAAGTCG 


TTAAGTTAAA 


GAAAGATGTC 


ATTGATAAAA 


TTTTACATTT 


TTGAAAAGCA 


600 


AATGTCACAA 


GCGGAAACTG 


TAAAAGACTT 


TGCAACTGCT 


TTTTATGAAA 


GTATGGAATA 


660 


TTCGAACTGC 


CAATCATTGA 


TGACAGAGCG 


AGATGACTGA 


TTAATGTATC 


ATGAAAGCGA 


720 


GAATGATCAT 


ATGATGCTAT 


CATCCTGACG 


ACT 






753 



(2) INFORMATION FOR SEQ ID NO: 992: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 952 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992: 



CAATCT ACTT 


AATCGTAGCC 


GTTTTTGGAA 


ATTGTATGTT 


GTGGNTTAAT 


CCTNTTTCCA 


60 


CAAAATTTGT 


AATNCCCGTA 


ATAATCTTTT 


TTATTGTTTT 


TGAAGGGGGC 


CNATTGGAAG 


120 


CTAAGTATAA 


TAAACCANAA 


GACACAAGCA 


ACTTGTAAAT 


GTGATGGGTA 


AGATACATAA 


180 


GAAAGGTGAG 


TTACCCTTTG 


CCAACCACCA 


TTATAAGACA 


TAAGCACAAG 


ATGATACCCA 


240 


ATATTAATCC 


ACANATAACA 


CCGAATGTAT 


TCGTACGTTT 


AGTGNAAATA 


CCCANCTGCA 


300 


NATACACCAG 


CTAATGGAAC 


ACCGAATAAT 


CCAGTTACCA 


AACAAGAATA 


AATCCCATAA 


360 


GTCATTTGTA 


TTAGAAGCAA 


TTAAGTATAG 


TGACATTCCN 


CAAACCGAAA 


ATACCTGCAA 


420 


TGATGATAAT 


GAAACGTGCA 


AAGTTAACTT 


CGTGTCGCTC 


GCTATCTTTT 


CCGAAGAAGC 


480 


GTTGCTTAAT 


GTCGATTGAA 


ATACAAGCAG 


ATATAGAATT 


TAAACTAGAT 


GAAATGGTAG 


540 


ACTGTGCAGC 


GGCGAAAATG 


GCTGCAATAA 


GTAATCCTGC 


TACAAATGGT 


GGCATCTCAG 


600 


TCAAAATGAA 


ATATGGCACT 


ACAGATGATG 


TATTGAAGCC 


TTTTGGTAAA 


ACAGCTTCAT 


660 


GTGTATAAAA 


TGAATACAGC 


ATTGTACCCA 


T AC CAT AAAA 


TAAGGGTGCT 


GAAATTAAAG 


720 


CTAGGATACC 


ATTTGTCCAT 


AACGATTTAT 


TTGTTTCTTT 


TAAACTATCA 


GAAGCTTGAT 


780 


AACGCTGCAC 


GACGTCTTGA 


CTCGCTGTGT 


ATTGATACAA 


GTTGTTGAAA 


ATATTTCCTA 


840 


GGAAAATAAT 


TGGAATGGCA 


GCTGCCGCAG 


TATTTAGTTT 


CCAATTGTCT 


GCACTAATTA 


900 


ATTTTTTGTG 


CTCAATCGCA 


TCTGCAAAGA 


CAGTGCCAAA 


ACCGCCTTTA 


AG 


952 



(2) INFORMATION FOR SEQ ID NO: 993: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993: 



TTTTCGCAGG 


ATTCCTTTCA 


AGCCTAAGAA 


TTGGGCCCAN 


TTGGGAAGAA 


ATGNATGGGT 


60 


GCCCGTAAAC 


CTGCCGGACC 


GCATTGAACC 


AAGCATTCCT 


AAAATACCCA 


ACAAATATAG 


120 


CGCCATNNNN 


AAANACANCC 


CTGCTACCCG 


TAAGTTATGA 


ATATCTCATA 


ACATTCAAGC 


180 


TAGCAGTTCT 


TTTTATTTTA 


AAGGTTTAAA 


TAGCAGNCTG 


TACCCACGCA 


TAACTGTAAT 


240 


TAATTGTTCT 


NTGTCATTCT 


TAATTTTTAT 


ATCCCATACA 


TGTGTCGACT 


TACCTCGATG 


300 


AATAATTTCA 


GCTGTCGCAG 


TAACACGACC 


ATCTTTAGCA 


GAATGAATAT 


GGTTAGCATT 


360 


CATCTCTAAA 


CCTAATGGAA 


TAAATTTGGT 


TGTATCAATT 


AAATTAGCAG 


ATCCTAATGA 


420 


ACATGCTGTT 


TCACCTAAAG 


CAATCGAAGC 


CCCACCATGT 


AAATATCCAA 


ATGGTTGTTT 


480 


TACTTTATCA 


GTGACAGGCA 


TAGAAATAAC 


AACTAAACCA 


TCTTCCTGGT 


GATCTATTGA 


540 
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CATCTCAAAT GTCTCTAATA AATGAGTCAT ATTTTCACCT CTTTATGCTT ATAGTACCAT 600 
AGCAGCAATA GTACCGAAAA TCATTAAAGG AATATTATAA AACAAGAAGT TTGGTACACA 660 
TGTATCGCTC GTGCC 675 

(2) INFORMATION FOR SEQ ID NO: 994: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994: 



AGGGGCNGTN 


TTTTTGGAAA 


CCNGGTCACG 


ANCTCACATG 


GGTTAAGNTG 


ATCANCACCT 


60 


TTGGTGTTTT 


AGGNTTAAAA 


ACAATTTACC 


TATTATCGCG 


CCAATTGCAC 


TTATCACGCT 


120 


TTTTAATATA 


TTGATTCATT 


TGGGCTTTGC 


GAGTGCAACA 


AGTTTATCAT 


CAGCGTTAAT 


180 


ACCTGTTTTT 


ATTTCGCTAA 


CCTCTACGTT 


ACACTTAGGA 


GACCAGTCTA 


TAGGATTTGT 


240 


TTTAATTCAA 


CAATTTGTTA 


TTAGTTTTGG 


TTTCTTATTA 


CCTGTTAGTG 


CACCTCAAAA 


300 


TATGTTGGCT 


TATGGCACTG 


GTACTTTTAC 


GGGTAAAGAT 


TTCTTGAAGG 


CAGGTATACC 


360 


ATTGACAATT 


GTAGGGTATA 


TTCTAGTGAT 


AGTTTTTAGT 


ATGACTTATT 


GGAAATGGTT 


420 


AGGTTTGCTT 


TAATTAAAAA 


TATAAATAAG 


AATCTAGGTT 


ATTTTAAAGT 


GACAAAAAGC 


480 


TTAATAAAAT 


AAAAAGATAA 


TTGAAGGGTG 


TTTTGTTTAT 


GGCAATTGCT 


GTGTTATTAA 


540 


ATCGNATGTT 


TCGNATGGGA 


CACNATCCAT 


TATTTGGATA 


TATTTATCCA 


CCAAAAGGAG 


600 


ACCTTGGTGC 


ATGTTATTTT 


ATCATTCCGG 


GAGGGGACAT 


GTCTTCAGCT 


TCTGATTTGT 


660 


TAGCACATTT 




GCC 








683 



(2) INFORMATION FOR SEQ ID NO: 995: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995: 



CTGCATCTGG TGAAGGAAAT CAANTANNTN TTATNNCTGA AGTGATTGCA TGGTGCAACN 
TCAAATATCA CATATGGCTC AGTGGANTAT ATGGATAAAG GCTTAACAGG TCATATCATG 

859 
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CGGCGTGGTA 


TTACTGAAGC 


GGATGCCTCA 


ATTAATTGGG 


CACTAGGTTT 


AATGAATGAG 


180 


GGTAGCCAAA 


TTATTGATAA 


TACAACAAAT 


TTATGTGGTG 


ATCGCACAAC 


AAGNNCACTT 


240 


AAATCAGGAG 


GTGNAGGTAC 


AGGAGAACAA 


AAAATTAATC 


TAACATCTAA 


ANTCGCACAA 


300 


itmnm^s tiff* Ik « 

ATATGGTAAA 


GAAACAGATG 


GTTATNTCCT 


TAAACATGGT 


GTTATGAAAG 


AACATGCATC 


360 


ATCTTGTATT 


TTAATGGGTA 


TCCGGCTACA 


TTANGCATGG 


GGGGAACTAA 


ATCAAGTGCT 


420 


AATCAGGAAT 


CACGTGTTAT 


TNATGTTATC 


TTGAACATGC 


TCGGGGGNGA 


CGCGAATCCT 


480 


ATTTTATTTA 


ATTGANGAAG 


ATGATGTACA 


AGCTGGTCAT 


GCTGCATCAG 


TAGGCCGTGT 


540 


TGGATCCCAG 


ATCAACTTTT 


ACTATTTAAT 


GAGTCGTGGT 


ATTTCTCAAA 


GAGAAGCGGA 


600 


ACGTCTTGTT 


ATACAGGGTT 


TCTTAGATCC 


AGTAGTACGT 


GAATTACCTA 


TCGAAGACGT 


660 


TAAACGTCAA 


TTGAGAGAAG 


TAATTGAACG 


CAAAGTTTCT 


AAATAATATT 


TTGAAAATAA 


720 


AAGTTTGTAA 


TAGATATAGA 


CTGTCGATAT 


TGGTATAAGA 


CTAATACAAC 


GTCAGTATTT 


780 


AAATGATTAG 


GATTTTTATT 


TAAGAAAGGT 


CGTGAATGAA 


GTGGCCGAAC 


ACTCATTTGA 


840 


GGTTNATGAA 


GTAATCAAGG 


ATTTTCCGAT 


ATTAGATCAA 


AAAGTCCATG 


GCAAACGTTT 


900 


AGCATATCTT 


GATTCAACAG 


CGACAAGTCC 


AACGCCTATG 


CCAGTGTTAA 


ATGTT 


955 



(2) INFORMATION FOR SEQ ID NO: 996: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996: 



AAACCCCTTC 


GGATTANAAA 


TTGGANCCCA 


GGATTCTTTT 


GTTTATCACA 


CAAGATTTAA 


60 


AAAAAGATTG 


CACATTGTCT 


TGCTGTCTAA 


NTTAATTACA 


TTAACACCTG 


GTACCTGTAG 


120 


TGCTTGGTGT 


AAGTGATGAT 


CGCACACAAA 


TTTATATTCA 


TGCCATAGAT 


TTTTCANCTA 


180 


AAGAGCAAGA 


AGTAGAAAGT 


ATTAAAACAT 


CATTAGAGAA AATTGTAAGA 


GAGGTGGGCG 


240 


AAATATAATG 


AATCATAATG 


TTATTATCGT 


TATTGCATTA 


ATCATAGTTG 


TCATTTCTAT 


300 


GTTAGCTATG 


CTCATTCGCG 


TTGTGCTAGG 


CCCATCACTT 


GCCGATCGTG 


TTGTCGCATT 


360 


AGATGCGATT 


GGTCTTCAAT 


TAATGGCAGT 


TAGAGCATTA 


TTCAGTATTT 


TATTAAATAT 


420 


TAAATACATG 


ATTGTCGTTA 


TTATGATGAT 


TGGTATATTA 


GCCTTTTTAG 


GTACTGCAGT 


480 


ATTCTCTAAA 


TTTATGGACA 


AAGGTAAGGT 


GATTGAACAT 


GATCAAAATC 


ATACTGATTA 


540 


GTCTTGCACT 


TATCTTTGGT 


ATCATCGGCG 


CTTTAATTAG 


CGCCCTAGCA 


GCTATAGGAT 


600 


TATTGAGACT 


TGAAGATGTA 


TATTCACGTN 


CACATGCTGG 


CGGAAAAGCA 


TCAACATTAG 


660 


GTGCAATGTC 


AATACTATTT 


GGGAC 








685 



(2) INFORMATION FOR SEQ ID NO: 997: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997: 



GGCACGAGCT 


ATTTGGACAT 


TCTTAAGTAC 


ANTCTGGAAC 


ACGATAATTA 


CAATCGCTAC 


60 


TACGATTTGG 


AATTTGTTAG 


TCACTGTAAT 


AACTACAGTA 


TTTACCACAA 


TTATGACTAT 


120 


CGCAATGACA 


ATTTGGAACG 


CTATTTGGAC 


GTTCTNACAA 


ACGTTGTGGA 


ACACTATAGT 


180 


TACTGTGGCA 


ACTAAGGTTT 


GGAACGCTAT 


CACTACAGCT 


AT ATC T ACTG 


CATTACAAGC 


240 


GGCATGGAGT 


TTTATTTCTA 


ATATATGGAA 


TACGATTTGG 


AGTTTCTTAT 


CTAGTATATT 


300 


AACGACAATT 


TGGAATAAAG 


TTGTAAGCAT 


ATTCACACAA 


GTTGTATCAA 


CTATATCAGA 


360 


CAAAATGTCT 


CAAGCTTGGA 


ACTTCATTGT 


CACTAAAGGT 


ATGCAATGGG 


TATCTACTAT 


420 


AACAAGTACG 


CTAATTAACT 


TTGTTAATAG 


AGTTATTCAA 


GGATTCGGTA 


ATGGTTGTTA 


480 


ACANAGTTAG 


TCAAGGTATG 


ACANATGCAG 


TAAATAAAAT 


AAAAAGCTTT 


ATAGGAGATT 


540 




AGGTGCAGAT 


ATGATCCGTG 


GNTTAATTAG 


AGGTATTGGA 


CAAATGGCTG 


600 


GCCAATTAGT 


AGATGCAGCT 


AAAAATGTTG 


CTAAGAAAGC 


TTTAGATGCA 


GCTAAAAGTG 


660 


CTTTGGGTAT 


TCACTCACCN 


TCACGT 








686 



(2) INFORMATION FOR SEQ ID NO: 998: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998: 



ATTAATTTAT 


GGGCCCAAAA 


GACCATGAAT 


ACTTGAAATT 


GAAGTCGCCC 


AAACCCAATT 


60 


GCACCAAAGA 


TCCCAGGATT 


GAAACATCAC 


GGTTTAAGAT 


ATGTCAATTA 


TTATCCACCN 


120 


AACTTACCCC 


CAAACNGCAT 


TAACCACAAC 


ATGATATGGG 


AAATTACTAA 


TAATGGAAGA 


180 


TGGGAATACC 


TAAATACATT 


TGTTCCAAGC 


TAAGAAATTN 


ACTTTTCTTT 


GTCGTTTTGC 


240 


AAGGCGCTCT 


AAGCTTATCC 


AAATTGGGGC 


TAAGCATATT 


ATTACAGGCG 


TATGTGAAAC 


300 


AGACTTTTCA 


GGCTACCCAG 


ACTGTCGCGA 


TAGTTTTATA 


AAATCAATGA ACGTAACATT 


360 


AAGCCTAGCT 


ATGGACAAAG 


ATTTTGTCAT 


TCATACTCCT 


TTAATGTGGT 


TAAACAAAGC 


420 


AGAAACGTGG 


AAATTAAGTG 


ATGAACTCGA 


AGTTTTAGAT 


TATATTCGTA 


CAAAAACATT 


480 


AACATGCTAT 


AACGGTATCA 


TTGGGGATGG 


CTGTGGTGAA 


TGTCCAGCTT GTCATTTACG 


540 
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TCAACGTGGA CTAAATCAAT ATCTTGAAAG TAAAGGAGCG CNTTAAAAGG TTACAACAAA 600 

TCTATCCTAG TACAACGCAT CCATATCAAT TCGAATTAAA TAAAGATTTN AATTTCTCGG 660 

CTGCACATCA CATTCCTTGT GAAGAAGCAG GTATTTGGCA AAATGTCCAT GGTCATACTT 720 

ACTNTGGTTA ATTTAACAAT TGNCGGGTGG ATAAACTAGA TGACACTGGC TTCTTAGGNG 780 

ACTTTTAGCC CATNTGGAAA NGATGATACA CGGTAAATTT GACCATCAAC TGTTAAATAA 840 

CTTACCTGCT TTTAAAAACA AACTCCCTCA ACTGAAATCG GG 882 



(2) INFORMATION FOR SEQ ID NO:999: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999: 



CTAGATTGGT 


GTTATATTNT 


TGATATGATA 


AGAAGAGATG 


TAAGAGTAGG 


GATAAATACA 


60 


ATTGAGGTGA 


ACCCATGTTA 


ACGACAGAAA 


AACTAGTTGA 


AACATTAAAG 


TTAGATTTAA 


120 


TCGCTGGTGA 


AGAAGGACTA 


TCGAAGCCAA 


TTAAAAATGC 


TGATATATCA 


AGACCAGGCT 


180 


TAGAGATGGC 


AGGTTATTTN 


NCACATTATG 


CGTCAGATAG 


AATACAACTA 


TTAGGAACAA 


240 


CGGAACTATC 


GTTTTACAAT 


TTATTACCAG 


ATAAGGATCG 


CGCAGGTCGT 


ATGCGTAAAC 


300 


TATGCAGACC 


AGAAACGCCT 


GCAATTATTG 


TGACACGTGG 


ATTGCAGCCA 


CCAGAAGAAT 


360 


TAGTTGAAGC 


TGCAAAAGAA 


TTAAATACCC 


CACTTATAGT 


TGCTAAAGAT 


GCGACTACAA 


420 


GTTTAATGAG 


TCGCTTAACA 


ACGTTTTTAG 


AGCATGCACT 


TGCAAAGACG 


ACATCTTTAC 


480 


ATGGTGTTTT 


AGTAGATGTT 


TACGGTGTTG 


GGTGTACTTA 


ATTACCGGTG 


ATTCAAGGGA 


540 


ATAGGGTAAA 


AGTGGAGACT 


GCGGTTGGGA 


ATTAGGTTAA 


ACGGTGGGCA 


TTAGATTACG 


600 


TAGCAGATTG 


ATAATGTNGA 


AATACGTCAA 


ATTAATAAAG 


ATGAACTTAA 


TAGGGAAACC 


660 


ACCAAAGGTA 


ATAGAACATC 


TATTTAGAAA 


TACGGTGGAC 


TAGGTATTAT 


CAATGTTATG 


720 


ACTTTATTTG 


GCGCGGGTTC 


AATATTAACT 


GAAAAACGAA 


TTAGATTAAA 


TATTAATTTG 


780 


GAAAACTGGG 


ACAAGCAAAA 


GTTATATGAC 


CGCTCGTGCC 






820 



(2) INFORMATION FOR SEQ ID NO: 1000: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1000: 



GGCACGAGCG 






/IVj 1 1 nAU 1 1 


A /~* JV T» JV rp/r»x.T A A 


GAGATTCATC 


60 




A Af2T* A hTPfT 1 


/"* 7A JA /"^fM^OTV* 

LAAu I Not 4\~ 


LA1AL I\, IvMj 


TGCCATGTTT 


GCAATTGTNG 


120 




TAGTGGNAAA 


TGTTGTACAC 


CTGGACCAAA 


GAANTCCACA 


AATTTACCAA 


180 


AAACACCTTT 


TCTACGTAGC 


TCTTGAGTTA 


CTCTTAACGC 


ATAAATCAGT 


TGCTGTTGCG 


240 


CCTTGTGGTA 


ATGAATNTAC 


TAGTCGTACA 


CCAATAACCT 


CTGGAATTGG 


GAAATAAGAA 


300 


GGTTGTCCAA 


GCATTCCAGC 


TTCAGNTTCA 


ATACCACCAA 


CACCCCATCC 


TAGTACGCNA 


360 


ANNCCATGTA 


TCATTGTTGT 


ATGTTGAATC 


AGTNCCAACT 


AAATGTTATC 


TGGAAATGCA 


420 


GTTTTTTCAC 


CATCTACATC 


CACGAACATG 


TTACAACACT 


GTGCTAAATA 


TTCCTAAGTT 


480 


AACTTGGTGA 


AGTATTCCAA 


TTGCAGGAAG 


AACTGCATCG 


GTAATTATCA AATGCTTTCC 


540 


GTTGCCCAAT 


TTAA 










554 



(2) INFORMATION FOR SEQ ID NO:1001; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001: 



ACTTTACNTC 


CAATTGGTAC 


CAACATGGTC 


ATGGAGCTAT 


GAAACAAGCT 


AATGGTCCAA 


60 


ACCTGCTGGT 


TTCTCCAAAC 


GTTGCAGGTT 


GCAAGACTAC 


ACTACTTCAA 


TACACCCAAG 


120 


GTTCAAGATG 


TACAATCAGT 


AAGCTACAAT 


GCACAATCAA 


GTAACTCANA 


CGTTGAAGCT 


180 


GTTTCAAGCT 


CCAACTTACC 


ATAACTACAG 


CACTTCAACT 


ACTTCAAGTT 


CAGTGAGATT 


240 


AAGCAATGGT 


AATACTGCAG 


GTGCTACTGG 


TTCATCAGCA 


GCTCAAATCA TGGCTCAACG 


300 


TACTGGTGTT 


TCAGCTTCTA 


CATGGGCTGC 


AATCATCGCT 


CGTGAATCAA 


ATGGTCAAGT 


360 


AAATGCTTAC 


AACCCATCAG 


GTGCTTCAGG 


TTTATTCCAA 


ACTATGCCAG 


GTTGGGGTCC 


420 


GACAAACACT 


GGTGACCAAC 


AAATCAACGC 


AGNTGTTAAA 


GCATACAAAG 


CACAAGGTTT 


480 


AGGTGCTTGG 


GGATTCTAAT 


TCCTTCATAA 


TANATAAAAG 


TAATGTTTAG 


ACATTAAATA 


540 


TATAGCTCTT 


CCCATAGAAA 


TTGGAGAGAG 


TTTTNTTATT 


AATTAAAAAT 


ACATTTCTTA 


600 


AAAAGTGGAT 


ATTGANGTTA 


GGGAAATTTA 


TTGGGACTTT 


ATTGTATTAC 


TGCTAAAAAT 


660 


ACTTGGTCTT 


TATTAANGGT 


GCAANGTGGG 


GAATAAAACT 


TCATGTGAGG 


CTGGTNCTGN 


720 


GTACAACTAG 


ATACNAACTA 


AACGCTCGTG 


CC 






752 



(2) INFORMATION FOR SEQ ID NO: 1002: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002: 



ANCTACCTTG 


TCACTTTTNT 


AGAAAACATA 


TTAGTAATAA 


GTTCTTCATA 


TGTATGTGTA 


60 


TTCATATTCA 


TCAATGCGCT 


AGTAGCACCT 


ATTGCAACCA 


TGTTTGGCAC 


GAATGCTCGT 


120 


ACCTAATTCT 


TTAGCGGTTG 


CTGTAAAAGG 


TAATTCAATA 


AGCTGTGCAT 


GACATCCTTC 


180 


TGGTTTCACA 


GGTTTAGCCT 


TGGCATCAGC 


TAAAATAATA 


CTGTCTTCTC 


TCATTTCATG 


240 


ATGGTTAACA 


TCAATTGTTT 


CTTGGTCAAA 


TGCAATCAAA 


ATATCTAAAT 


CATCACTAAT 


300 


TGCATGAACA 


GGCGTCGTAG 


AAACTCTAAT 


TTTATTATTC 


GTATGTCCAC 


CTTTGATACG 


360 


ACTTGAAAAA 


TGTCTATATC 


CATATAAATA 


ATATCCTTTT 


CTATTCATAG 


CCGTAGCGAA 


420 


NATTTCCCCA 


GTTGATTCAA 


TACCTTCGCC 


TTGTTGACCG 


CCAACTTTCC 


ACGATAATTG 


480 


TGGTTTCATA 


GTTATATGCC 


TCCTATGGAT 


ATATTTCATA 


TTAAATGATA 


ACAAAAAGAA 


540 


CCACCGATTT 


ACTATGTAAA 


CAGTGGTACT 


ATATTAAATT 


CATACTAATG 


GACGAATAGG 


600 


GTCCAATTCC 


GAATGTTTTA 


CTTACGTTAA 


TTTTTAAAAT 


GTTTGNAAAT 


GGATGGTCAT 


660 


CATTTATCAA 


TATACGTTCG 


ATGTGCTTTG 


TTTTACCTTC 


TTTGTCTAAA 


TCAATAACAA 


720 


CACCAGATAA 


TACACTTCTA 


CCTTCAG 








747 



(2) INFORMATION FOR SEQ ID NO: 1003: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003: 

AATTTTGTTG ACCCTAAAAC CAAGACAATG CGTAATGCTG CTTGATTAAT ATTCATTTCG 60 

GCACGAGTAC TAATATTATC CAAAATGATG AATTAGGTCG TACCATTTTA AATGAACNTG 120 

GTAAAAAAGC GGAACAAGGT GTAGAAGTTA AAATTCTTTA TGATGACATG GGTTCTCGTG 180 

GACTGCGTAA AAAAGGCTTA CGCCCGTTTC GCAATAAAGG TGGACATGCT GAAGCATTTT 240 

TCCCATCAAA ATTACCTTTA ATTAACTTGC GTATGAACAA TCGAAACCAT CGAAAAATTG 300 

TTGTAATAGA TGGGCAAATT GGATATGTTG GTGGTTTTAA TGTTGGTGAT GAGTACTTAG 360 

GTAAATCAAA AAAATTCGGC TATTGGCGAG ATACGCATTT ACGAATTGTC GGGGATGCAG 420 

TGAATGCATT GCAATTACGA TTTATTCTAG ATTGGAATTC CACAAGCCAC ACGTGACCAC 480 
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ATCTCCTATG ATGATCGTTA TTTCCCANAT GTNAATTCTG GTGGAACAAN TGGCGTTCAA 540 

ATAGCTTCTA GTGGTCCTGA CGAAGAATGG NAACAGATTA AATACGGCCA TNTGAAAATG 600 

ATTTCATCTG CTAAAAAATC CATTTATATT CAATCTCCCT ATNTCATACC TGATCAAGCC 660 

CTTTTAGATT CT 672 

(2) INFORMATION FOR SEQ ID NO: 1004: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004: 



GAGGCAACCC 


GACCTATATT 


CAAAGAACCC 


TCCAGGACCA 


GCTTGTGTAA 


ATCTTGGAAA 


60 


TGTTATCGGA 


TGGATAATAA 


AACCTTCTTG 


ACCTTTGTCG 


TCNCGGGGAA 


ATATCAAGCC 


120 


ATTTGTTAAA 


CCNAAGATAA 


TTCCAACAAC 


GCAATACGTG 


CACCCAGGTG 


GCTCAATTCA 


180 


ATTTTGNCCC 


GGAATACCCA 


TGGCCTGTTT 


TTCTTTAATT 


AACCACCACT 


TGTCACTCAT 


240 


TTCGGAAGTA 


TAAATCGTTA 


CCCTTCACGG 


AGTACCGTTC 


AACCTACACC 


GGCGGAATAC 


300 


AAGAAATACC 


CACCGTGCTC 


TTTGAGCGAT 


GTTGTTAATT 


AATTCTTGGA 


TTAATACTGT 


360 


TTTACCTACA 


CCGGCACCAC 


CGAACAATCC 


GATTTTACCA 


CCTTTAATAT 


AAGGTGCTAG 


420 


TAAATCTACT 


ACTTTAATAC 


CTGTTTCTAA 


AATTTGAACT 


TCTGTTGAAA 


GTTCATCGAA 


480 


TGCTGGTGCT 


TGACGATGGA 


TAGGATCGCG 


GCGAAC AG AA 


TCACTAATTT 


CTTCTTTAAG 


540 


GTCAATTGTT 


TCACCTAGTA 


CATTAAATAC 


ACGACCTAAT 


GTTTCGTCAC 


CAACAGGGAC 


600 


ACTAATTTCT 


TTGCCTGTAT 


CTTTTACATC 


CATGCCTCTT 


TGGACACCAT 


CAGNTGAATC 


660 


CATCGCAATT 


GTNCGANCAN 


CGTCGTCACC 


TAATTGCAGC 


GCAACTTCTA 


ATGTTAGTTG 


720 


TATTGTACCT 


TCTTCTTTAG 


GCACATCAAT 


AACCAAGGCG 


TTATTAATTT 


TAGGAACTTC 


780 


GTTATGTTCA 


NATCGAACAG 


CAATTACAGG 


ACCCATAACT 


TGAGTTACAC 


GGCCAATTCC 


840 


CATGCTATTT 


TCCTCCTTTT 


AAATATTATT 


CCAAGCGCTT 


GCGGAACCCA 


CCAACAATTT 


900 


CAGTAATTTG 


TTGCGTTAAT 


TTCTGCTTGT 


CTCGCTCTGT 


TATATTCCAA 


ATGATAATCA 


960 


TCAAATAAGT 


TCAGTTGCAT 


TATCAGTGGC 


ATTTTCATCG 


CAGTCCATAC 


T 


1011 



(2) INFORMATION FOR SEQ ID NO: 1005: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005: 



AATGTATTGT 


AGAAGCTGAG 


CACTTAAAAA 


GAGAGCGCGT 


AGAAGCTAAG 


TATAGAAAAA 


60 


TGATNTTGGA 


GGCTAAAAAA 


TAATGTTAAA 


TCCACCTTTA 


ANCCAATTAA CGTCACAAAT 


120 


TAAATCAAAG 


TATTTAATTG 


CAACAACTGC 


AGCGAAAAGA 


GCGCGTGAAA 


TTGATGAACA 


180 


ACCTGAAACT 


GAATTATTAA 


GTGAATATCA 


TTCATTTANA 


CCAGTTGGTA 


GAGCGTTAGA 


240 


AGAAATTGCT 


GACGGTAAAA 


TTCGCCCTGT 


TATTTCAAGT 


GATTATTATG 


GTAAAGAATA 


300 


GTTTCGACCA 


TTAAAAATAT 


GTGAAGCTAG 


ACGTACATCA 


AATACGTCAA 


GGCATATCCA 


360 


AATCAAACGC 


ACCTATGTAG 


GTGTGTTTNN 


NTTAGNCTTA 


TCCAATGATT 


GATGTTATAA 


420 


TAATACTAAA 


TTGGCTATCT 


ATAAAAAAGT 


AATGAGCATT 


TGTGCGCATA 


TGATGATGTA 


480 


AAGCGTAAAT 


GTCATAGTGA 


TATTTAAGGG 


AGATTGAATA 


TGAAGAAAAT 


ATTATTAGCC 


540 


GTTACAGGTG 


GCATTGCGGC 


ATATAAAGCA 


ATTGATTTGA 


CAAGTAAGTT 


AACACAATCT 


600 


GGGTATGAAG 


TTCGC 










615 



(2) INFORMATION FOR SEQ ID NO: 1006: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006: 



TTTGTCTTTG 


GAATGGGCTC 


GTTTTTTTTA 


ACCTAATAAG 


AAATGATAGG 


GCATTTGAGA 


60 


TTGGAAGGNC 


ATTTTGGCTT 


TGTGCAAATA 


ATACAATAAG 


CTAAATGTCT 


m-rrriirm 1 


120 


TTGTGAAAAT 


ATGATGGATG 


GCTTGTGTGG 


GCAAGTTTGC 


TAATTTAATA 


AGATATGCAT 


180 


TTTTCAATTT 


AGGAGTTGGC 


CATGCATCTA 


CACTTTATAA 


TGGTGAGAGC 


GTGGTGAGGT 


240 


ATTGTTAATC 


ACGCAATTGT 


AGCGAGGAGT 


TATTGCTACA 


TATGTCGTTA 


TGGCCTATTG 


300 


ATTTTCTAAA 


ATAGCTGTAT 


CAGATCATGT 


GACNAAATAA 


AAATAATTTG 


TTGAAAGCCT 


360 


TTACATAACT 


TGTCTAGACA 


AGTTATACTC 


GTTTTAAGAC 


ATTAAGGGAG 


TGAAATATAT 


420 


GGCTGTAAAA 


AGAGAAGATG 


TAAAAGCCAT 


CGTAAGNCGC 


TATTGGGGGA 


NAAGAAAATC 


480 


NTGAAGCTGC 


ANCGCATTGT 


GTAACNCGAT 


TACGTTTNGT 


GCTTAANGAT 


GAAANCANAG 


540 


TTGATAAAGA 


CGCATTAAGG 


AACAACGCGT 


TGGTCAAGGG 


GCAGTTCAAA 


GCAGACCATC 


600 


AATATCAAAT 


TGTCATTGGT 


CCAGGANCAG 


TCGATGAAGT 


GTATAAGCAG 


TTTATTGATG 


660 


AAACAGGTGC 


TCAAGAAGCT 


TCGAAAGATG 


AAGCGAAACA 


AGCAGCTGCG 


AAAAAAGGGA 


720 


ATCCAGTACA 


ACGTTTGATC 


AAATTGTTAG 


GGGAGATTTT 


TATACCAATA 


TTACCTGCGA 


780 


TTGTGACAAC 




ATGGGGATTC 


AATAATTTAC 


TTACAATGAA AGGTTTATTT 


840 


GGTCCCAAAA 


GCACTTATTG 


AGATGTATCC 


CGCAAATTGC 


TGATATTTCA 


AACATCCATT 


900 
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AATGTGATTG CGAGTTACCG CATTTATTTC CTTACCANCA TTAATTGGTT GGAGTAATAT 960 

GCTGTGTATT TGGTGGTAGT CCGATCCTAG GCATAGTCTT AGGTTTGATT TTAATGCATC 1020 

CGCAATTAGT ATCTCAGTAT GATTTGGCAA AAGGGAATAT TCCGACGTGG AACTTATTTG 1080 

GCTTAGAGAT TAAGCAGTTG AATTACCAAG GTCAAGTGTT GCCTGTTTTA ATTGCAGCTT 1140 

ATGTTCTAGC TAAAATTTGA AAAAGGATTA AATAAAGTCG TTCACGATTC GATAAAAATG 1200 

TTGGTCGTTG GACCGTAACG CTTTTAGTTA CTGGATTTTT AGCATTTATT ATCATTGGAC 1260 

CAGTTGCATT ATTGATTGGT ACAGGTATTA CTTCTGGTGT TACATTTATA TTCC 1314 

(2) INFORMATION FOR SEQ ID NO: 1007; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1007: 



CTATAATTGT 


TAACGTTGTT 


AATAATGTAA 


AAAGTGCAAT 


TTCATTTGGT 


GTTTTATTTC 


60 


TTTCCGCCCC 


TTCAACTAAG 


GCAATCATTT 


TATCTAAAAA 


AGATGTAGCC 


CGCTTCCACT 


120 


CTCAAAGACG 


TATTTCTNAC 


CCAATCAAGA 


TGTTACCAAG 


TGTACCGCCA 


ATGACTCCAT 


180 


CAAAATCGNC 


ACCTGATTCT 


TTTATCACAG 


GTGCAGACTC 


ACCAGAAATT 


GNAGATTCAT 


240 


CANCGGTTGC 


TAATCCATTT 


ATTACAACGC 


CATCAGCAGG 


GATTGTTTCT 


CCATTTTCNA 


300 


CCCGAATATT 


GTGTCCGGCT 


TTTAACTCTG 


TGGCGTACAC 


TATCCGATAC 


GCACCATTTT 


360 


CTTCTATCAA 


TCGAGNAGNT 


AAATGTGATT 


GTGCTTGTCT 


TAAACTATCA 


GCTTGCGCTT 


420 


TTCCACGACC 


TACAGCAAAG 


GCTTCTGAAA 


AATTAGCAAA 


CAATATAGTT 


ATTAATAATA 


480 


TGATAAAAAT 


TGTAATCAAA 


TAACCTCGNT 


CGNCAGACAG 


CTAGNTCCAA 


ATATGTCAGG 


540 


AAAACATATT 


AATATCAACG 


GTTAAAACCA 


TTCCANCCTC 


AACGACAAAC 


ATTATCGGAT 


600 


TTNNNTATTA 


ATTGGNNTAG 


GATTCAAGCT 


TATAAAAACT 


CAATNTACAA 


AGCTTCTTAT 


660 


CAACATTGTG 


TGATTGAAAT 


ATTTATTNAC 


ATGATGNATT 


TTTTCCATCC 


CTTTACTTTA 


720 


TTTTAAAGTT 


AAAAATTCAC 


CAATAGGACC 


AAGTAATAGC 


ACTGGAATAA 


ATGTCAAACC 


780 


ACTTAGTAAA 


ACGATAAATA 


CGATTAGTGA 


TACGCCAAAA 


TAAGGTTTAT 


CAATCGCTAT 


840 


TGTATTTTTA 


TCTTGATGGT 


ATGATTTTTT 


ATTCACTAAA 


CTTGATGCAA 


TCATTAATTG 


900 


CAAAAATAAT 


TGGTATATTA 


CGAGAAAGCA 


ACATAATGAC 


TTCCTGTAGA 


GATATTCCAG 


960 


AATGTGTATC 


ATCTTACAGT 


CCTTCAAACC 


CTGATCCATT 


GTTCGGAGGA 


G 


1011 



(2) INFORMATION FOR SEQ ID NO: 1008: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1118 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008: 



GGCACGAGAC 


TCAAANCACT 


GAAGCATTAA 


CAAAATAATA 


CTATATTACT 


GTCTAATCAT 


60 


AGACATGTTG 


TATTTAACTA 


ACAGTTCATT 


AAAGTAGAAT 


TTATTTCACT 


TTCAATGAAC 


120 


TGTTTTTTAT 


TTACGTTTGA 


CTAATTTACA 


ACCTTTTCAA 


TAGTAGTTTT 


CATGCCACGA 


180 


GCTATCCTAA 


CCCACAGATT 


AGTGATTTCT 


ATACAATTCC 


TCTTTTGTCT 


TTACATTTTC 


240 


TTAAAATATT 


TGCGATGTTG 


AGTATAAATT 


TTTGTTTTCT 


TCCTACCTTT 


TTCGTTATGA 


300 


TTAAAGTTAT 


AAATATTATT 


ATGTACAACG 


ATTCAATCGC 


TCTATTTTTC 


AACTTTCAAC 


360 


ATATTATTAA 


TTCGGAAAGG 


ACCACTTTAA 


AATTTAACNG 


GCCACAACAA 


ATCAAATCAA 


420 


TTAATCACTT 


TTTCCAAAAT 


AATCATATAA 


GGAGGTTCTT 


TTCATTATGA 


ATATCATTGA 


480 


GCAAAAATTT 


TATGACAGTA 


AAGCTTTTTT 


CAATACACAA 


CAAACTAAAG 


TTATTAGTTT 


540 


TAGAAAAGAT 


CAATTAAAGA 


AGTTAAGCAA 


AGCTATTAAA 


TCATACGAGA 


GCGATATTTT 


600 


AGAAGCACTA 


TATACAGATT 


TAGGAAAAAA 


TAAAGGCACG 


AAGCTTATGC 


TACTGAAATT 


660 


GGCATAACTT 


TGAAAAGTAT 


CAAAAATGCC 


CGNAAGGAAC 


TTAAAAACTG 


GACTAAAACA 


720 


AAAAATGTAG 


ACACACCTTT 


ATATTTATTT 


CCAACAAAAA 


GCTATATCAA 


AAAAGAACCT 


780 


TATGGAACAG 


TTTTGATCAT 


TGCACCATTT 


AACTATCCTT 


TTCAACTAGT 


ATTCGAACCT 


840 


TTAATCGGTG 


CTATTGCAGC 


AGGTAATACA 


GCAATTATTA 


AACCATCTGA 


GTTGACACCA 


900 


AATGTTGCAC 


GAGTGATTAA 


ACGATTAATC 


AATGAAACAT 


TTGATGCAAA 


TTACATTGAA 


960 


GTTATTGAGG 


GAGGAATTGA 


AGAAACGCAA 


ACGTTAATTC 


ACTTACCTTT 


TGACTATGTC 


1020 


TCTTACAGGA 


GTGAAATTGT 


AGGCAAATCG 


TTTATCAAGC 


TGCAGCGAAA 


TTTAGTCCTG 


1080 


TGACATAGAA 


TGGTGGGGAA 


ATCTCCAGTC 


ATCGNGGG 






1118 



(2) INFORMATION FOR SEQ ID NO: 1009: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009: 

AAAGAGTTCG CCTTAAAAGG TAACGTCTTA GATTTAGCAA TTGCTGTTGT GATGGGTGCA 60 

GCTCACACAA GATTATATCT NCATTAGTAG AAAATATCAT CATGNCATTA ATTGGTAAAA 120 

TTCNCGGATC AGTTGATTGT GCTAAAGAAT GGACANTCTG GGGTATTAAA TACGGTTTAT 180 

TTATCCAATC TGTTATCGAC TTTATTATCA TCGCGTGTGC TTTATTCATC TGTGTTAAGA 240 
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TTGCAAATAC ATTAATGAAG AAAGAAGAAG CCGAAGAAGA 


AGCAGTTGTG 


GAAGAAAATG 


300 


TTGTGTTATT AACTGAAATC 


AGAGATTTAT 


TGCGTGAGAA 


AAAATAATTC 


TAAATAAAAA 


360 


GAGATGGGCT AAGGTTGCTC 


TATACATTCC 


AAAGCACCAT 


CTCTATTTNN 


TATGGTNNNA 


420 


ATCTCTTGTG ANTCTTGGCG 


AACTCTGATA 


TTGGATCTTG 


ATTAPACTTC 


TAAAAPTAAA 


480 


GGTNTTCTTA TTATCTCTTA 


AATTCGCTAC 


AAACCATGTG 


AAATCTATGA 


TTGGAATCCC 


540 


AACCATTCTA CCAGGAAAAT 


TATTTGATTA 


AAATTGATGA 


TTGGCATCCT 


AATAAAGGGT 


600 


CTTTAACGCT NTTGCAAGCG 


GCCAATTTAC 


TTGATCTAAT 


GTACNACGAA 


TCCTTCATCA 


660 


ATAAATATTG ATTCTAGTGA 


AATACC 








6B6 



(2) INFORMATION FOR SEQ ID NO: 1010: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010: 



GGCACGAGCT 


TTTAAAGAAG 


ATGGACGTTT 


TATTCCAACA 


ATTGCATTTT 


ATAGTCCGAA 


60 


TGCATTAGGC 


GCTATCCAAC 


TTAAAAGCCA 


CCTACCATTT 


CTCGATAACT 


TACAGTTTTT 


120 


ACAAAAATAT 


ATATCATGAA 


TTATCAANCG 


GATTATTTGG 


ATGTCAGGGA 


TGTAGATGCG 


180 


CCCTCATATT 


GGTACAAAAA 


ATATAAATTA 


TCAGCATGAT 


TTGGACGCTT 


TAATTCAAAA 


240 


ATTGTAAGCT 


GTTAGGAGGT 


CCACAAATGG 


TAGAACAAAT 


AAAAGATAAA 


CTAGGACGTC 


300 


CCATCCCTGA 


CTTACGGTTA 


TCTGTGACAG 


ATCGGTGTAA 


CTTTTAGGTG 


TGATTATTGC 


360 


ATGCCCTAAA 


GAGGTATTTT 


GGAGATGATT 


TCCGTATTTT 


TACCCTAAAN 


ATGAACTTGT 


420 


TAACGTTTGA 


TGAAATGGCT 


AGAATCGCTA 


AGGTATATGC 


AGAATTAGGT 


GTAAAAAAAA 


480 


TACGCATTAC 


AGGTGGAGAA 


CCATTGATGC 


GACGCGATTT 


AGAGCCATGT 


ACTTATAGCT 


540 


AAATTAAATC 


AAATCGATGG 


TATTGAAGAT 


ATTGGTTTGA 


CTACAAATGG 


TTTGTTATTA 


600 


AAAAAGCATG 


GACAAAAGTT 


ATATGATGCT 


GGGCTACGCA 


GAATTAATGT 


CAGTTTGGAT 


660 


GCTATTGATG 


ATACGCTATT 


TCAATCAATC 


AATAATCGTA 


ATATTAAAGC 


GACTACGATT 


720 


TTAGAACAAA 


TTGATTACGC 


GGCACGAGCT 


CGTGCC 






756 



(2) INFORMATION FOR SEQ ID NO: 1011: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1011: 



GGCACGAGCG 


CACGATGGGC 


ACCGTGAGTT 


ATACCTATGT 


AACTGACATA 


CCTAGTAGTG 


60 


CTTATGCTTA 


TCTAATGCCT 


TATCGTAAGT 


TAAGTTGGGG 


TAAGCGATAT 


GTTTAATCCG 


120 


TTTGATGAGT 


TTCCGCACAC 


AATTGAAATT 


GGAGAGGTTG 


AAGTTGCAGG 


AACATTTCCT 


180 


AAAGAATACG 


AGCGTTTTAA 


AAGTAACGAA 


ACAATTAAAG 


GATTTATGGA 


TACGCCTACA 


240 


TCAAGCGAGA 


CACTCAAATT 


TCATCAAATG 


AGCAAAGACT 


TCGACCGTAA 


CCTATATACG 


300 


CCGTATCACA 


TACCAATAAC 


AAACAAAACT 


TTATTTAATT 


ACGAGGGTAA 


AACGTACGAA 


360 


GTTGTAGTGC 


GAACCGGTCG 


ACCAAGGCGG 


ACAACATGAA 


ATCAATTTAA 


CTAGATTGAG 


420 


GTGCGATCTA 


TTGGCAAAGG 


TTTC 








444 



(2) INFORMATION FOR SEQ ID NO: 1012: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1012: 



CAAATCATCA 


TATGCGCACA 


ACTTGCTACT 


GTACTCATAG 


CATTTACATA 


AATAGGGTTT 


60 


CACGAGCGTC 


ATCACTTTAT 


CTATTTCGTC 


GTCGTCCTTT 


GTATTAATCA 


TCTCTCGACT 


120 


TGTTACAATA 


CCTACTAATT 


TATACGACTC 


ATTGACTACC 


GGAAATCTTG 


TATGGCCAGT 


180 


TCGATTCGCC 


ATACGCTTAT 


AATCTGCTAT 


TTTCATCGTA 


TCAAACAGCA 


CAGACAAATC 


240 


ATCTAATGGC 


GTCATTATAT 


CTTGAACTAT 


TAAGATATCT 


TTTCGTATTT 


TCTTGATTAA 


300 


AAAGTGCTTT 


GTTGATAATA 


TTTGCAACTA 


GGAATGTATC 


ATAACTCGGA 


TGATAGAACA 


360 


GGGTAAATCA 


TGTTCAATTC 


GGCAAAATTA 


ATAACTTTAT 


TAGGATGGGC 


TTAAATCCAC 


420 


CAGGTAATTA 


ATATAGGCCG 


NACCTCTTTN 


NTAANGCTNC 


AACCTGGCAC 


ATCTTCGCGA 


480 


TTTCCGACAA 


TCAATAATGT 


CTTTGGACCA 


ATATATTTTA 


AAATATCTTT 


GAGTTCCATG 


540 


GCTCCAATGG 


CAAATTTAGA 


TACCATCTTA 


GTGATACCTN 


TATGGCAACC 


TAACACTTGG 


600 


CCATCAATAA 


TATTGAAAAT 


TTCATTAAAA 


GTTAAATGTT 


CAATTTCATA ACGATTACGT 


660 


TTTTCGATTC 


GAACCGTACC 


AACACGATCT 


ATCGTTGCGA 


CCATGCCCAT 


TTTATCAGCA 


720 


TCTTTAATTG 


CACGATATGC 


TGTCCCTTCA 


GATACGTTTA 


AAAATTTAGC 


GATTTTACGC 


780 


ACCGAAATTT 


TAGAGCCTAT 


AGATAACGAT 


TCAATATAAT 


CTAAAATTTG 


TTCATGTTTT 


840 


GTCATTCTTT 


ACCTCTTCCT 


TTTCGAACAG 


TATTAACTAC 


ATTATAACTT 


TATTTTGGAT 


900 


AAAAAGCATT 


GAAGTGAAAT 


GAAATAATGA 


TCGTTTCACC 


TATTTTATTT 


TTTGAAAATA 


960 


TACAACAANA 


CACAAAGGAT 


CACAAAATCT 


TTAATTTTAA 


ATGGAAAAAT 


CCATTATTAT 


1020 


TTATTAGAAT 


TGTAAGTGAG 


GAGGGATTGT 


ACTAAT 






1056 
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(2) INFORMATION FOR SEQ ID NO: 1013: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013: 



ATGCCTCNAA 


ATGTGCACTT 


GGGGTGNNCC 


CGATGCTAAA 


TGNAATAGAC 


AAGATGAGCG 


60 


GGATCGTTAG 


CNTTACTACC 


CAAAGATAAT 


CCCTTCAGTG 


AAAGATGATA 


GTGCGAAANC 


120 


NACACCANTC 


TGTACATAAT 


CAGTACCAGC 


GTGAANAAGT 


GCAATATGTG 


GACTAATGAC 


180 


CATATATGAC 


CCGNNTTGTT 


CNGNACATTG 


TTTTAANNNG 


ANTTGCANCT 


GTATAATCTG 


240 


ANTAGACAAT 


GCCATCATTG 


ANTAAAGGNT 


GCACAGCCAC 


TNCAANTNCG 


GANTCAANTG 


300 


ATAATGGNTG 


TTCANTTATA 


NTGATNCGAT 


GTTCAGGACA 


ATAAATCTGC 


GAGTGACTTG 


360 


CCATCAGTTG 


CCATTTTCAT 


GACTCGNTGT 


TCTCTTGAGT 


CATTGATAAT 


TTGATTCAAT 


420 


TTTTGACGAG 


NTTGTTGATT 


GATAAATGGA 


TCGACATGAA 


TAACTGGTAC 


AGCTGATATT 


480 


TCACAAGGTA 


CTCGTTGAAA 


TGACATAANC 


AATGTTATCT 


TGCAATAATC 


GACTTTCTTC 


540 


CAATTGATAA 


ATGGAATAGG 


CATCCCAAAT 


GTGAAACTCA 


GGATACAGAT 


GATTTAGTTT 


600 


TGATTTTAAA 


AGTTGCGACG 


TGCCTATACC 


AGAACCACAT 


AGTAAGACAA 


CCTTAATCAT 


660 


TGATTGTTTA 


TGTGTTGCAA 


CACGCTCTAG 








690 



(2) INFORMATION FOR SEQ ID NO: 1014 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1019 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014: 

GGCACGAGCT GATACATTAG ATATCACACC AAAATTAATG TTACAAGGTT CTGTTGACCT 60 

ATTAAATGAA GTTGCAACTT CTAAAATCAC AGGTGAAGAA GAAATTTATT CACATACAGA 120 

TTTATATGAT TTTAAAGCGA ACGTTGAAGG CGCACAAAAA ATTTATGACT TATTTAAACC 180 

TATTTTAGAG AAAAAAGATA AAAAATTAAG TGATGATATT CAAATGAACT TCGATAAAGT 240 

GAATCAATTA TTGGATAAAT ATAAAGATAA CAACGGCGGT TATGAGTCAT TTGAAAAAGT 300 
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ATCGAAGAAA 


GACCGTAAAG 


CATTTGCGGA 


TCCTGTTAAT 


GCATTAGGGA 


GAGCCACTAA 


360 


GTAAAATGGC 


CTGTGATTAC 


TTGAATGACA 


AATTATTGAA 


CAAGTTAACG 


GATAGTACNC 


420 


AATTTTCAAG 


ACGTACATTT 


TTGNAAATGT 


TAGGTATTGG 


CGGGTGCCGG 


TGTTCCAATT 


480 


GGCGCAAGTG 


GTGTTGGTAG 


CATGTGGTCT 


TTCAAATCAA 


TGTTCAATAC 


ACCAGAAGAT 


540 


CCGGAAAAAG 


ATGCGTATGA 


ATTTTATGGT 


AAAGTNCAAC 


CAGGCATTAC 


CACACCCACC 


600 


CAAAAAACAT 


GCAATTTCGT 


TGCGTTAGAT 


TTGAAGTCAA 


AAGATAGAGA 


TGCAATTAAG 


660 


GCAATGTTTA 


AAAAGTGGAC 


GGTTATGGCT 


GATCGTATGA 


TGGATGGTGA 


TACAGTTGGC 


720 


AAGACGAGTA 


ACAATCCTTT 


AATGCCACCA 


GTAGATACCG 


GTGAATCGAT 


AGGATTAGGT 


780 


GCAAGCAAGT 


TAACGATTAC 


CTTTGGGATT 


AGTAAGTCTT 


TGATGAAGAA 


AAATGGGTTA 


840 


TCTAAGTAAA 


ATTCCCGATG 


CCTTTAAAGA 


TTTACCGCAT 


TTTCCGAATG 


ATCAGTTAAT 


900 


AGACGATTAC 


AGCGATGGTG 


ATATTATGAT 


TCAAGCATGC 


TCAAATGATT 


CGCAAGTATC 


960 


CTTTCATGCG 


GTTCATAATT 


TAGTTCGTCC 


ATTTCGAGAT 


ATTGTTAAGG 


TCTCGTGCC 


1019 



(2) INFORMATION FOR SEQ ID NO: 1015: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015: 



NANACTGGTA 


ATTTACTTGT 


CCACTTGANA 


AGAATCCCCT 


ACCTCAATTT 


TAATTGAAGC 


60 


CGAAGAAAGT 


AAAATACCTG 


AAATCAAAAA 


TATGCTTACT 


CATTTTTATG 


CCGNTCATAT 


120 


TGAGCATCGA 


CGCTGGGGCG 


CACCANTCCC 


TGNCATTGAA 


ATTGTAAAAC 


TTGGTATTAA 


180 


TAAAGCAAGA 


GGCATTGAGC 


AAGTTAGACA 


ATTTTTANAT 


ATTGACCGAA 


ATAATATTAT 


240 


TGCATTCGGT 


GATGAAGATA ATGATATTGA 


AATGATTGAG 


TACGCCCGCC 


ATGGTGTTGC 


300 


TATGGAAAAT 


GGTGTGCAAG 


AACTTAAAGA 


TGTAGCGAAC AATATTACAT 


TCAACAATAA 


360 


TGAAGATGGC 


ATTGGGTCGA 


TATTTGAATG 


ATTTCTTTAA 


TTTAAATATT 


AGATATTACT 


420 


GNTAATTTAT 


AACTAATCAT 


TNTATAATAT 


TTCAAAACAA 


TAGGAGGTAA 


GTTACGATGC 


480 


CCAAAATAGT 


CGTAGTCGGA 


GCAGTCGCTG 


GCGGTGCAAC 


ATGTGCCAGC 


CAAATTCGAC 


540 


GTTTAGATAA 


AGAAAGTGAC 


ATTATTATTT 


TTGAAAAAGA 


TCGTGATATG 


AGCTCGTGCC 


600 



(2) INFORMATION FOR SEQ ID NO: 1016: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016: 



CTCNGCCCTG 


CCATGCCAAT 


CNCGTCCCNT 


ACAAATATTT 


TTAATTATTT 


TAAAAAATGA 


60 


TGCNCTAAAT 


TAGGCACGAG 


CTTAGNAGNT 


CTATTGTCAG 


CGTCATATGT 


TGGATTCATC 


120 


TCAGCAATAC 


TAACTGAAGA 


CACCTTATCA 


CTTGGAATAA 


TACGTTTTGC 


TAATTCAAGA 


180 


ACAGTATGTG 


GATACAAACC 


TAACACTGNC 


GGCGCACTTA 


CCCCAGGCGC 


AAACGCACTA 


240 


TCAATGACAT 


CCATACAAAT 


CGTAAACATA 


ATGACATCAT 


GTTCATGTAC 


AAAACGTTCA 


300 


ATCATATCTT 


TAATTGTTGG 


TGATACGTGA 


CTCAATAATT 


CATCTGCAAA 


GACATAATCA 


360 


ATCTTTTTCT 


CTTTAGCATA 


ATCAAATNAN 


CTTTGCGTAT 


TACCACCTTG 


AGCAAATNCC 


.420 


AAGCACTAAA 


TAATCTGTGC 


TTTCATCTTC 


TTCTAAAATT 


TGGCTAAAGC 


TCGTTCCAGA 


480 


TGTAGATTGT 


TGT 










493 



(2) INFORMATION FOR SEQ ID NO: 1017: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017: 



GGGATTCCGT 


GGAGGCATCA 


TGTGGATTGA 


ATCTAAGTCA 


TTTACTAAAA 


CATTCGGAAT 


60 


GGGTATTCTT 


GATGCTCGTA 


TGCCTCCAAA 


CTCATTTCCT 


TAAACCTCTT 


GTAAAAACTG 


120 


TGTCCACATT 


TTATCAGGTG 


ACGGTTGCCC 


ACTATATACC 


AACGTATTAG 


CAAACATGCC 


180 


TAGCATTTGC 


TCCGTGCCTT 


TATGCATACG 


CGCACTCATC 


ACACTACCGA 


CAACAACATC 


240 


ATCTTTTCGA 


GCATATCTGC 


TTAACAACGT 


CATGACCACA 


CTCATAAAGA 


ACATAAAATC 


300 


AGTAATTTGA 


TGCTTTTCTA 


CATACTTTTG 


AAGTAGCTGT 


CTCATTTGTT 


GATTCATTGT 


360 


AAATGACATC 


ATTGCTCCAT 


TTGTCGTTTT 


AATATTTGGT 


CTAACATAGT 


CTGTCGGTAA 


420 


GCTTAAAATA 


GGTACTTCAT 


CTTCTGAATT 


GAGATAACCA 


ACATTGTCTA 


TGTTTCCGTC 


480 


ATATCACGAA 


TGCGACATCC 


ACCCACTATA 


GTCTCTATAT 


TGCAATTTAA 


GTGGTAACAA 


540 


TAATTTATGT 


TGATAAAGTG 


CGTTAAGATC 


ATTCATTAAT 


TGTATATTAC 


TCATACCGTC 


600 


CATTAATGAT 


ATGATGTGTA 


TCTATAAACA 


GTATGCA 






637 



(2) INFORMATION FOR SEQ ID NO: 1018: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1032 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1018: 



GGGAGATAAA 


TATTGCCAAT 


CAGGACAGAC 


CATGAAATCT 


ATGTCAGAGC 


AAATGTATAA 


60 


AATAGACAAG 


TTTCCCTCAA 


GATACGAATC 


ACCATCATAC 


AATGTTTGGT 


GGTACATTGA 


120 


TGGCTAATAT 


TGATGAAATT 


GCAGCAATCA 


CAGCTATGAA 


ACAATGCTGG 


TGCGCAAGTA 


180 


GTTACCGCAT 


CTACAGACTC 


AGTAGATTTC 


TTAAAGCCGA 


TTAAAACAGG 


GGACATATTA 


240 


CAATACGTAG 


CGATGGTTTC 


ATACGCTGGG 


ACTAGTTCAA 


TGGAAGTGGT 


CGNTCACATT 


300 


AGANTTGTCG 


ATGTATTTAA 


TAACAAGCAT 


GATTTAGCTC 


CATTAAGTTA 


CTTAACATTT 


360 


GTTGCGTTAG 


ATGATGAAGG 


CACACCTAAA 


CATGTACCTG 


GTGTATATCC 


AGAAGATGAT 


420 


GTTGTAAAAT 


GGTTTTATGA 


TACAGCACCA 


CAACGTGTTG 


AACGAAGAAA 


AGCACGTCGC 


480 


ATTGAAAGTA 


AACAAACAAN 


TGAATATTTA 


GCTCAAGTGC 


AACATATTAG 


AGACTAAAAA 


540 


TATAAAGACA 


GATTGTTAAG 


GATGAATTGA 


ACTGTTATGT 


TCGAATATTC 


CACCTACAAT 


600 


CTGTCTTTTT 


GATTATGCAC 


GTTTATGCTT 


TTAGTTAAAA 


TTTAACCTTT 


TTATTATAGA 


660 


CTATTAGTCC 


GATTGCCTNA 


NTGACCTTNA 


TTATNAANGT 


AGAGTACTGN 


CAGAAATTAA 


720 


TTGTTTTTTA 


TTNAATAATA 


GCATNAAAAT 


ATGCGGAGAC 


TACTTATCGA 


TAAAGTGCTT 


780 


CTCCGCATAA 


TTAAGAGTTT 


TAAAAATTGN 


NTAATNTGTG 


GGTGNANGNG 


CATATGACGC 


840 


TCATTATCTT 


TAAANTCCAT 


NTTCGGATAG 


TATCGTGCCA 


TATTTGTAAC 


AAGAACATGT 


900 


GGTCCTAAAC 


ANCTAAACTC 


GGNACAATGA 


CAATNCANAT 


GATTNAGCAA 


GATCAGAGGA 


960 


TANCCATTTA 


TCAAATACAT 


CTAGTTAATT 


AATCTTTGNG 


TATATTCCGA 


AATAGTACCT 


1020 


AGTATCATCT 


CC 










1032 



(2) INFORMATION FOR SEQ ID NO: 1019: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 1019: 

CCATCGAAGA GTATCGACAC CATTTAGAGC ATTTAGACCG CATAAGAATT AGCTTTTTCA 60 

CATATCATAG GTCTGATGCA GACATTGAAA TTGTTGAAGA AGACTTTGAT TTATAAGTTT 120 

AAAGCAATTC CTGTAATTAA TACCACAATG AGATAAAGTA ATATTGAATC TCGAAATTAA 180 

ATTTAAAAAA GCAGTAAGAT TATTTTCAAT GAAGAAAATC TCTTACTGCT TTTCTATGTT 240 
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ATGACCCCAC 


CATCPTGTTA 

\~t\ X V— V- X X x*» 


CAAATCTTCC 


CTACAAGATT 


GTNCTGAAGT 


CAAACAACTA 


300 


ATA fTTTATA 

£X i. «VV_> X X XnXn 


x x x x vn** a 


GCTATATAAA 


TGCTCGCAAG 


CAACTTGGTT 


TGGAATCAAG 


360 


GCACGATGGG 


TNPPrfVTTTC 


AACAAATATT 


GTCCCATTTT 


CAATTACGAC 


AANTTNGGCA 


420 


— rl x vt x i\j l Jt\ 


ATAfypf3fiACA 

A X A\J XUUAUl 


AAGPGATGCG 


CTACGATAAG 


TGTCGGTCGA 


1V*1 X XAAXV^A 


O v 


AUnL A J. A— 1 AA 


X\jl_ 1 X V_ X XAjA 


ATTANTnfi AT 

A X X AH X VJV3A X 


TPAPTTTT'TA 

X IvA w 1 1 IV. 1 A 


AATPAAfSTfif* 
aa x unnkj x vj\_ 


A\_ X X o 1 IVA_ 1 


j4U 


TCATC CAAG A 


TAAGAATTGG 


CGGATTATTT 


AAAAATATTC 


TAGCAATCGA 


TAATCTTTGT 


600 


TTTTGACCAC 


CTGATAATTT 


AACACCTCGT 


TCACCTACTT 


CAGTGTCATA 


TCCCTGTGGC 


660 


AAGTTCATAA 


TAAAGTCATG 


TGCATTAGCC 


ATTTTCGCCG 


CTTCAACTAC 


TTCTTCATCT 


720 


GTTGCTGNTG 


GACGACCANG 


TAAAATATTT 


TCCTTAACTG 


TATCAGAGAA 


TAAAATATTA 


780 


TCCTGTTGGC 


ACCCAATCCT 


ATTTGATTTC 


TTAAACTTCC 


CGTTAAAAAA 


TCTNTAATAT 


840 



(2) INFORMATION FOR SEQ ID NO: 1020: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020: 



AATTTTAAAG 


ACCCCNCGCA 


TAAATANCCA 


TCCCACCTAC 


TTATCCAAAA 


GTTANGTGGA 


60 




ATTAAAATTA 


ATATTAGTGT 


AANCCAATCA 


AAGATTTAAT 


CNAATATGCC 


120 


CCTGCTCAAA 


ACATTTCCTC 


ATTTAATTTG 


CTTTACTTTC 


AATTTAATAT 


CATTATCCAC 


180 


AACACTTGGC 


GTGTCATCGT 


TATTATTTCG 


CATCTTTGAC 


ACGTTTATCA 


TCATTAGGAN 


240 


TCGGCACCGA 


ATAAAATTGC 


GATAAATGCC 


ATGATTCCCA 


TTAATACGTT 


AACCCAAAGT 


300 


GCAATCATCG 


CACCTGTATG 


AATGCTCGTT 


GCAGCAACTG 


CACCAACATA 


TACAGCACCA 


360 


CTAATTGCGA 


CACCGAATGC 


GCCACCAAGT 


GATGAAGCCA 


TTTTATAAAT 


ACCTGAAGCA 


420 


ACGCCAACTT 


TATCTAACGG 


TGCATTCGAA 


ATAGCTGTAT 


CTGTAGAAGG 


TGTTGCATAA 


480 


ATACCTAAGC 


CTAGTCCGAA 


ACATAAATAT 


CCTACGACAC 


AACTGATAAC 


ATAAAATATG 


540 


CCTGGTAAGA 


ATACTANTGA 


AATAAGTGCA 


ATNCCAATGA 


CCACAATGNA 


TGTACCTNAT 


600 


AACATTGGTC 


GCTTAGAACC 


CANTTNTGGT 


NATAATAATT 


TTTCACCAAC 


TCGAATCATC 


660 


AATAACAACA 


TGATTAAATA 


AGTAANTGAT 


NAGTATCCTG 


CCTGCCATNC 


TGTATAACCT 


720 


AAACCTTGTT 


GCACGCATGT 


ATTCGCTACA 


ATTNATGTAC 


CTACAACNCC 


GTTG 


774 



(2) INFORMATION FOR SEQ ID NO: 1021: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021: 



/"■TV* TV A OWT»T« X 

l_ IV- I\xATTt-A 


ACAAAATGAT 


TATTCTTTAC 


GTAAACTACC 


TTTTTTTATT 


TGAGATGAAG 


60 


CATATGCTTT 


TAATAATATT 


GTCCCAATAA 


TACCAACTGA 


AATAATATTT 


AATACTGCAG 


120 


AGATAACACC 


TTGTGTATAA 


ACCTTGTTAG 


CCGGTTCGTT 


ATAAATCAAA 


ATATCTAATG 


180 


TTGGTGCAAT 


AAGTGCCCAG 


CAAATAATAT 


TCGCAATAAT 


TTGACCGATA 


TTAAAATAAA 


240 


TCATCGATTT 


CCTAGAAAAT 


AGGCATGAAG 


AAAGATTTAA 


TTTTAGGGCC 


AATCCATCCA 


300 


TATTAAACAG 


GCGATAATTC 


CTGAACAAAT 


AACCCAACTC 


CACCAAGCAC 


TACCCGTATG 


360 


TCGGGGAAAT 


CTTTAATAGC 


GTGNCCAACT 


AATCCAGGCA 


TTAAACCAGC 


AAAAGGCCCA 


420 


AATATTGCAG 


ATATTAATGC 


TAAAAATGCA 


TAAGATGTTT 


CTATATTCGT 


ATTAGGAAAA 


480 


CCTGTTGGTA 


TTACAACAAA 


ACGCCCTAAA 


ATCACAAATA 


CCGCNGCTCC 


TATACCAATC 


540 


GCAACAACAG 


TTTTAACTGA 


AATATCNTGT 


TTTTTCATCT 


TCATTACTCC 


TTACATAAAA 


600 


AATTCATTAA 


ATTGATGGTG 


CTTTAGATAA 


ATGAATCGTC 


CAATCATTTC 


CAGTACCAAT 


660 


ATGATATAAA 


TCTGAAAATG 


AGTCTCGATT 


GACTGCTACA 


CCAATATTTA 


CTAGCGAGTT 


720 


AACATACACA 


AGAGGTTCAC 


CCACATTAAC 


ATCTGCAAAC 


GATCGCTCGT 


GCC 


773 



(2) INFORMATION FOR SEQ ID NO: 1022: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022: 



CCTGCAAATG 


TTACACCTGT 


TCTCGCCAGG 


ACGGTTGACT 


GAGAAACTTG 


TTGTACGTTT 


60 


TAAAGACTTT 


TCAGAAAGTG 


AAAGTCTAGG 


TGACATATCA 


GAAATTTCCT 


GTTAATAATT 


120 


CTTTGAGTTC 


CTTAGATTTT 


TCATCTGANC 


CAAGGCTGCC 


AACGAATTCA 


ACGTTGCCCT 


180 


CCATAAGTTC 


TAATAATTGT 


TTAAGTTGTT 


GTTTTAAATC 


AGCATTAAGC 


ATGGNTGTAA 


240 


TGCCTCCTAA 


GATTTTACCT 


ACTAAATCTA 


AACCAGGTTG 


CAATGTTTTA 


CCGCCTTCTT 


300 


CCCATTTAGC 


TGGGCATACT 


TCNCCAGGGT 


TTTTACGAAC 


ATATTGAGCT 


GCTTTGATTT 


360 


TGTGAGCTAA 


TGTACTAGCG 


TCACGGCCAA 


TTCCGTCAGC 


GTTAATTTCA 


GATGCTTGTA 


420 


CAACACCGTC 


TGGGTCGATA 


ATGAATGTAC 


CACGTTGAGC 


TAAACCAGTA 


GCTTCATGCC 


480 


GAANACATCA 


AAATTACGAG 


TGATTGTTTG 


TGATGGGTCA 


CCAATCATAG 


TGTAAGTGAT 


540 


TTTGCTAATT 


GCATCTGAAT 


GGTCATGCCA 


TGCTTTGTGT 


ACGAAGTGAG 


TATCAGTTGA 


600 


TACTGAGAAT 


ACATTTACGC 


CTAATTTTTG 


TAATTCTTCA 


TATTGGTTTT 


GTAAGTCTTC 


660 
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TAATTCAGTT GGACAAACGA ATGAGAAGTC AGCAGGATAG AAGCATACTA CGCTCCAAGA 720 
ACCTTTTAAA TCTTCTTGTG TAACTTCTTT AAATNGATCT TTTTTTGGAT CGAAAGCTCG 780 
TGCC 784 



(2) INFORMATION FOR SEQ ID NO: 1023: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023: 



CCGACTTCTT 


CTAAAGACCC 


TTCAATCTTA 


TCTTTATTTG 


TCCATTTAGC 


TACAACATCT 


60 


TTCGTAGGAT 


TCATACCATA 


TGAACGTCCT 


TCAAAAATTG 


CGCCTCTTGA 


GTCACACATA 


120 


ACCATATTTC 


TTACACCATA 


CGCGTATAGT 


AATTTAACAA 


TGGNTATTCC 


TGCTGCACCA 


180 


GCACCATTGA 


GTACAACTTN 


NATTTTAGCA 


ATATCTTTGG 


TAACAACTCT 


CAATGCATTT 


240 


ACCAAACCTG 


CCATTGTTAC 


AATTGCTGTA 


CCATGTTGAT 


CGTCATGGAA 


TACCGGAATA 


300 


TTAGTTTCTT 


TTTTCAATCG 


GTACTTCAAT 


TTCAGAACAA 


CGTGGTGCCG 


AAATATCCTC 


360 


TAAATTAATA 


CCACCATAAT 


TAGGTTCTAA 


CGACTTAACT 


GTTTTAATGA 


TTTCTTCGGT 


420 


ATCAGTTGTA 


TTTAACGCAA 


TAGGCACCCC 


ATTGATACCA 


GCGAAGCTTT 


TGAATAATAC 


480 


TGCTTTACCT 


TCCATTACAG 


GAATACTTGC 


TTCAGGTCCA 


ATGTTACCTA 


AACCTAATAC 


540 


CGCTGTTCCA 


TCAGTAATAA 


CTGCAACTGT 


ATTTCCTTTA 


ATTGTGTAAT 


CATATACTTT 


600 


TCTTTTATCT 


TCATAAATAT 


CTTTACACGG 


TTCAACAACG 


CCAGGTGAGT 


ATGCTAAACT 


660 


TAATTCCTCT 


TTATTAGTAA 


CTTTTACATT 


TGGTTTAACT 


TCTAATTTAC 


CTTGATTACG 


720 


TTTGTGCATT 


TCCAATGCTT 


CATCTCTTAA 


TGACATGAAA 


TCAGCCCCTA 


ATTCAATATT 


780 


TATTTTTAAA 


AAATAACTTG 


GATAAAACGC 


ATTACATTAT 


AAAAGTAAAA 


ATATTGGGTA 


840 


AATCTGAATG 


AGTAAGAATT 


TATGGTTTTG 


GATTATGTAA 


CACAAATAGC 


GATAAACGAT 


900 


AATAAAATAA 


TATTTATAAA 


GATACATTAA 


ACCCATACTT 


ATCTTAAAGA 


TATTACCCTT 


960 


TAATTATTAT 


AATGGATAGC 


AAAAACAATA 


TATCAAAAAG 


TTATTATTTT 


CCGCACGATA 


1020 


TATCCGACAA 


AAATTCTTTT 


ACTCAATTTA 


TNGTATACTG 


GTTNGGGGGC 


TAATTATTCT 


1080 


TNATTGGATT 


TAACAATAAT 


TGTAAAGTGA 


AACTCCATAA 


AAATAATAAG 


CATTAAAAAC 


1140 


TTATATTNAA 


CGCAAACTGA 


TGGG 








1164 



(2) INFORMATION FOR SEQ ID NO: 1024: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1018 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: lin ar 
(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024: 



CTCAGTTTGT 


TGTTTAATTT 


CTTCACCATA 


ATAATATTCA 


TTCGCATTTA 


CTTTATCACT 


60 


TTTAGGATGA 


ATAAGCACGA 


CAATATTTTC 


ATCATTTTCT 


GTAAAAGGTA 


AACTTTTTCG 


120 


CTTACTTCTA 


TAATCTAATA 


TTTGCTGTTC 


ATTCATCGCA 


ATATCTTGAA 


TAATTAATTG 


180 


CGGTGATTGA 


TTACCATTCC 


ATTCGTTGAT 


TTGAACAGAT 


CCTAATATAT 


TAATTGGCTG 


240 


TTCATCTTGT 


AACTCAGGTT 


CTAAGTGTCC 


ATTTGGCCAA AATAGCACGA 


GCGGNGATAT 


300 


TACTTTCACC 


AAGTGTCAAT 


TTTAGATGAT 


TTTNTGGTTG 


ACCGATCGCC 


TTAACTGAAG 


360 


AAATCTGATA 


AATCATCCAT 


TTCAAAAATA 


GGTCTTAGAA AAATCTTGTN CCGAAGGGTC 


420 


TTAAACGATT 


CAATATCACG 


AATATTTTTA 


ATCGTTATAT 


CATTTTCTGT 


TAATAATACG 


480 


TCTACTGGCT 


TTACGGGATC 


TAACGAAGTT 


GTTTTAGATA 


ATTCTTTCAT 


CCATTTATTT 


540 


AAACCTTCAG 


CTAACGATTC 


TATATTTACA 


ATATCCATCG 


GACATACCTG 


NAGNCATATG 


600 


ATGGCCGNCC 


AAATTTAGCG 


ATTAACTCTT 


GATGGTGCTG 


ATAGTATTTC 


AAACATCGAC 


660 


ACTTGCATCA 


ATTGCATCTT 


GGCGGAACCT 


TTTGGCATGA 


TTTTGCTCCC 


TATCAATATT 


720 


TAAAATTAAT 


GGTTGGCAAA 


GCAAATGTTT 


CGACAATTTG 


CGAAGCAACA 


ATACCTAAGA 


780 


CACCTACATG 


CCAATTTTCT 


TTTGCTTAAA 


AGTAAAAATA AATCTCCCCT 


TTTAACTTCC 


840 


GGTTCTGCAA 


TAGCCATTGC 


TTCTTCTGTG 


ATAGTTGCTA 


CAATATCTTT 


TCTTTCACGG 


900 


TTAAAATGTT 


CAACTTGTTC 


TGCTAAAAAT 


GCAGCTTCTT 


CTTCGTCGTC 


AGTCATCAAC 


960 


AATTCGCAAG 


CTAATGATGC 


GTCATCTAAA 


CGACCTACAG 


CATTAAGTCT 


AGGTCCAG 


1018 



(2) INFORMATION FOR SEQ ID NO: 1025: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025: 

CCCAACCGTG CATGTGTGCC GTCGCATCTT GTTCCAACTT TCAATATAGG CCACGTTTGA 60 

CCAATCGGTA CGAGTGCACC AAAGTGCAGT TCAATAGTAC CATGCTCTTT TCCATGCTTC 120 

TATATGCGCA CGTGTTACGA TGAATCATTG TTGTTCGCTA CTCTGTTGAN AAAGTCCTAA 180 

AAACCAATTA NCCGGCNAAT ACCCCAATTC CTTTTAAATC CAATTGTCCT TTCTAAATAA 240 

CGGTTTGTTC TCCGACCAAA GTGGATTGGT TGCATCCAAT GTGAGTACCA TATTGCGTTA 300 

CAATATTCCA ACGTTGCACA TAGAAACCAT GATCTTTAAC CGTGAATAAA GTTGAAACTT 360 

CACCTTTTTC AAACTCACTA AAACGTGGTA TTTCTGGATC AAATGTATGC GTTAAATCAA 420 
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CCCAAGTTGC TTGTTTTAAA GTATTTAATT GTTGCCATAA AGGATATTGT GTCATAAAAT 480 

CACCCGTTTT TAGTTTATTA TATGATAAAT GCTGCGATTA TTCTTGGCGT TTAGCTTTAA 540 

CAGCATTCAC AAGTACAGTC AATGCATCTT TAACTTCTTC TTCTTTTCGC TCGTGCCAAC 600 

CACAGTCAGG GTTTACCCAG NATAATTGAG CGGCCGATTT GTTGTTGTGA ACGATTGATT 660 

GCTGTAGTAA TTTCTTCT 678 

(2) INFORMATION FOR SEQ ID NO:1026: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026: 



CTGAAGATGA 


TGAAATGATT 


TTTAATATTC 


CGGCACGAGA 


CTATTGAGTA 


TAAATATGGA 


60 


CAAGCGATTT 


ATGGAGTCGG 


AGCAGGTATT 


ACAATTGATT 


CTAAGCCAAA 


AGATGAAGTA 


120 


AATGAATTTT 


ACGCAAAAAC 


CAAGATTTTG 


GAGATGTTAT 


AATGCAATTA 


TTTGAAACAA 


180 


TGANAATTGA 


TAATGGACAT 


ATCCCTAGAC 


TTACTTATCA 


TACTAATCGC 


ATAAAATGTT 


240 


CTTCTGAGCG 


ATTAAACTTT 


AAATTTGATG 


AACATGCATG 


GCGAAATGAA 


TTAAACGATG 


300 


TAACAACAAA 


GTATCACAGT 


GGTCAATATA 


GACTTAAAAT 


CGTATTAAAT GCTGAAAGCA 


360 


AATTTGAAAC 


GATAGTGTCA 


CCTTTACCTG 


AGAAAAGTAG 


TTTTACAGCA 


AAATTTCAAG 


420 


TGTTGCCCCA 


AAGTTAGTTA 


ATCCAACTTT 


TATANNGAAA 


TAAAACGACN 


GAACGAAAGC 


480 


ATTTAGCACA 


CAATCNATGA 


AACGATTTAA 


TATTGCTACT 


TCCAGAGACG 


GCAGTCCCTT 


540 


GATTTGA 












547 



(2) INFORMATION FOR SEQ ID NO: 1027: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027: 

GCCTGTGGTT CCAATCATTT CANTTATCGG TTGTCCTTAC NTTTTAATAT TTAAGTTANA 60 
AAAAGGAGGA CGCTCCAATA TGATTAGTTC CAAATAATAA ACGCAGACAA TTGATAGCAC 120 
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TGCCTGTTTT TAAGCATTCT ACTATTTCTA GGTTGTACTT GGAGTATTAC CTCAGGTGAA 180 

TACAACATAC CTGTTGAAAG ATTTTTCAAA ACTTTAATTG GACAAGGTGA TGCCATTGAT 240 

GAGTTAATCT TATTAGATTT CAGGTTACCT CGGATGATGA TTACTATTTT GGCTGGCGCA 300 

GCGCTTAGTA TTAGTGGTGC AATAGTGCAA AGTGTCACAA AAAATCCAAT AGCTGAACCA 360 

GGTATATTAG GTATTAACGC AGGTGGCGGA TTTGCAATCG CATTATTTAT TGCAATTGGT 420 

AAAATTAATG CTGACAACTT TGTTTAGGTC TCGTGCC 457 

(2) INFORMATION FOR SEQ ID NO: 1028: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1028: 



CTGCCATTGG 


ATACTTGTGG 


TGCCATGTCC 


CCGGGTATGA 


TAGGCTATTG 


GTTGGAAACT 


60 


GAAATCAATC 


GCATTTTAAC 


TGAAATGAAT 


AGTGATAGAA 


CTGTAGGCAC 


AATCGTTACA 


120 


ACGTGTGGAA 


GTAGATAAAG 


ATGATCCACG 


GATTCAATAA ACCCAACCAA ACCAATTGGT 


180 


CCTTTTTATN 


CGAAAGAAGA 


AGTTGGAAGA 


ATTACAAAAA 


GAACAGCCAG 


ACTCAGTCTT 


240 


TAAAGAAGAT 


GCAGGACGTG 


GTTATAGAAA 


AGTAGTTGCG 


TCACCACTAC 


CTCAATCTAT 


300 


ACTAGAACAC 


CAGTTAATTC 


GAACTTTAGC 


AGACGGTAAA 


AATATTGTCA 


TTGCATGCGG 


360 


TGGTGGCGGT 


ATTCCAGTTA 


TAAAAAAAGA 


AAATACCTAT 


GGAAGNTGTT 


GAAGCGGTTA 


420 


TAGATAAAGA 


TTTTGCTAGT 


GAGAAATTAN 


CAACGCTGAT 


TGAAGCAGAT 


ACCTTAATGA 


480 


TTCTTACGAA 


TGTAGAAAAT 


GTATTTATTA 


ACTTTAATGA 


ACCTAATCAA 


CAACAAATCG 


540 


ATGATATTGA 


TGTAGCAACA 


CTGAAAAAAT 


ACGCTCGTGC 


C 




581 



(2) INFORMATION FOR SEQ ID NO: 1029: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1029: 
CTACATTATT ACTTACGCTT AAAATGCTTT AAATTTAAGA AATGATATAA GTTAGGTGCC 60 
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CAGGTACTAA 


AGTTTAGTAG 


GAATCCATCA 


TGTCCAACAT 


TATCAGGCAC 


GAAGAAATGA 


120 


CGATGATATT 


TAAAACGTTC 


ACCTAATGCA 


CGAACTTGAT 


CGTCCGGATA 


TAGCAAATCA 


180 


TCTATGAACC 


CCATCGNTAA 


GACTTTCGTT 


TCTAAATTTT 


TAAAAATATG 


CGTTACGTCT 


240 


GTGCGACCTC 


GGTCAATGTT 


GTGACTATCC 


AATACATCTA 


GCAGTGTCAG 


ATAACNATTC 


300 


AAATCAAAAT 


GTTCTTTAAA 


TTTATTACCT 


TGATGTTGTT 


GGTATGCGAC 


TNCTTCATCC 


360 


GGCGTGAAAC 


GTTCATCATA 


ACTTTTTGAT 


GATCGATATG 


TCAAAAAACC 


TAATTGGCGT 


420 


GCAATACTTA 


GACCTTCCTT 


ACCACCAAGA 


TGAATGGCTT 


GCCTTGCAAT 


TTCATTGAAA 


480 


GCTCTACTAT 


AAGATGATGT 


TCGACTTGTT 


GCAGCAAGGA 


TAATGGCTTT 


NTCTACTTCA 


540 


AACTGTTGAT 


TGTAAAGTAG 


CTCCATTGAT 


TGCATACCTC 


CAAGGACTTC 


CCCCTATNAA 


600 


AATATTAATC 


TTATCATAAA 


CAAGGGCTTG 


TATACCTCGC 


ATCATGCGCT 


CTGACTATAT 


660 


CTCTTAATGT 


TAATlTm A 


uuAAAAluAu 


ubl\.ol 1 1AA 


AUuluAAv.1 I 


r* a nrrrz & & a ci 

unuV. Lwnnftw 




GACTACCAAT 


AACATCAAAT 


GTTAAAAATT 


GATAATCGNG 


AATGGGTATT 


TTTCCCCCCA 


780 


TCANTAATTT 


CTCGTCACCA 


ACCCGGATAA 


TCANCTTGTT 


CCATATTGTT 


AANTGATTGG 


840 


CAGGNTAATG 


CATGACAAAC 


TACAACTAAT 


GGTTGTCCAT 


GATAACCAAC 


ATGNTCATAT 


900 


CTCAAACGCA 


AGTATCTATG 


ATTCCCCAGG 


TCT 






933 



(2) INFORMATION FOR SEQ ID NO: 1030: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1030: 



GGCACGAGCG 


ACAGGTTTTT 


TAGGTGCTTA 


TCTGATTGAA 


GCACTACAAG 


GCATACAGTC 


60 


ATCGCATTTA 


TTGTTTCATA 


CGTGCTGATA 


ATGAGGAAAT 


AGCATGGTAT 


AAGTTGATGA 


120 


CGAATTTAAA 


TGATTATTTT 


TCAGAAGAGA 


CGGTTGAAAT 


GATGTTATCA 


AACATTGAAG 


180 


TCATTGTTGG 


TGATTTCGAG 


TGTATGGATG 


ATGTTGTTTT 


ACCTGAAAAC 


ATGGATACNA 


240 


TTATTCATGC 


ANGTGCTCGT 


ACGGATCACT 


TTGGTGATGA 


TGATGAATTT 


GAAAAAGTAA 


300 


ATGTTCAAGG 


TACTGTTGAT 


GTCATACNTG 


TGGCACAACA 


ACATCATGCA 


AGGTTAATAT 


360 


ATGTGTCTAC 


GATAAGTGTG 


GGAACTTATT 


TTGATATAGA 


CACAGAAGAT 


GTGACATTTT 


420 


CAGAAGCGGA 


TGTCTATAAA 


GGGCAACTAC 


TAACATCACC 


ATATACACGG 


AGCAAATTTT 


480 


ATAGTGAATT 


AAATATTAGA 


AGCTGTAAAT 


AATGGCTTAG 


ATGTCGATTG 


TACTGTTGGT 


540 


ATTTGAC 












547 



(2) INFORMATION FOR SEQ ID NO: 1031: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 476 base pairs 
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<B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 1031 : 



AAGTATTAAC 


AAGACCATGA 


AATTATGGAT 


GCAGCTGATT 


ATTGGAAGTT 


AACCCAACCA 


60 


ATTAAATTTG 


GCACGAGCGG 


ATGTAATTAC 


CAAATACCTA 


GGTGTTGCAA 


GAAGATGCAT 


120 


CNAATTAAAG 


CAGCTATTGA 


AGAAGCTCAT 


AAAANTAATA 


AACAATTACT 


AGTTGATATG 


180 


ATTGCTGTTC 


AAGATTTAAG 


AAAAACGTGC 


AAAACAACTA 


GATGAAATGG 


GTGCTGATTA 


240 


TATTGCAGTA 


CACACTGGTT 


ATGATTTACA 


AGCAGAAGGG 


CNATCACCAT 


TAGAAAGTTT 


300 


AAGAACCGTT 


AAATCTGTTA 


TTAAAAATTC 


TAAAGTTGCA 


GTAGCAGGTG 


GAANTAAACC 


360 


AGATACAATT 


AAAGATATTG 


TCGCTGAANG 


TCCTGATCTT 


GTTATTGTTG 


GTGGCGGAAT 


420 


CGCAAATGCA 


GATGATCCAG 


TAGAATGCTG 


CAAAACAATG 


TCGCGCGTGC 


CNTCGT 


476 



(2) INFORMATION FOR SEQ ID NO: 1032: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032: 



AGCCGCTGGT 


GTCAATTTGT 


GAGATTATGA 


ATGATGACGG 


CACCGATGGC 


GAAAGGACCA 


60 


AGATTTACCA 


AAATTTTTAA 


AAGAAAAACA 


TCAATTAAAG 


ATGATTACGA 


TTGATGATTT 


120 


AATTGAATAT 


CGTAAAAAAT 


TAGAACCAGA 


AATTGAATTT 


AAGCCCAAAA 


GTGAAAATGC 


180 


CTACAGATTT 


CGGAACATTT 


GATATGTATG 


GTTTTAAAGC 


GACATACACA 


GATGAAGAGA 


240 


TAGTTGTACT 


GACAAAAGGT 


GCAATTCGAC 


AACATGAAAA 


TGTACGCTTA 


CATTCTGCGT 


300 


GCCTTACAGG 


CGATATTTTC 


CATAGTCAAC 


GTTGTGATTG 


TGGTGCTCAA 


CTTGAATCGT 


360 


CTATGAAGTA 


TATCAATGAA 


CATGGTGGCA 


TGATTATTTA 


TCTACCTCAA 


GAAGGTCGTG 


420 


GCATAGGATT 


GTTAAACAAA 


TTACGCTCGT 


GCCACGAGGA 


T 




461 



(2) INFORMATION FOR SEQ ID NO: 1033: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033 : 



AAATTTCCAA 


GGCCAATTGC 


CATACCATTT 


ATTGTTCAGT 


ACATATTTAA 


ATAAAATTCC 


60 


TTTGTAGTAA 


TTCTGTTTCT 


AATGTTTCTT 


ATCGCTTGTA 


GCAAAGTCGT 


TTTTATAAAT 


120 


TTCGATTGGA 


GGTAAACCTT 


GAGATTGAGG 


TGTGTCTCTA 


ATCAATACGT 


ATGAAATTGC 


180 


GCCAATGATA 


AGTGATAAGA 


GTGCAGGGTA 


AATGAATACA 


CCTTCGAAAC 


CTTTTAAATA 


240 


ACCAAAGTTG 


ATAAATGCTG 


TTGTTGTAAT 


ACCCCAAGCA 


GCAATAGGTG 


CCATAATACC 


300 


TCCACCAACA 


TTATGCGCAA 


CGTTCCAAAG 


GGCAGTCTTA 


CTTCCGCTCG 


TGCACTTACA 


360 


CTAAACCAGT 


GAACGAGAAC 


ACGGCCTGAA 


GGTGGCCAGC 


CCATACCTTG 


AAACCATCCA 


420 


TTTAAGAATA 


ATAGGACAAA 


CATAATACCG 


ATACCTGATG 


TAAAGAACGG 


TACAAATCCC 


480 


ATTAACAAAT 


TGACGATAGC 


AGTGAGTGCT 


AATCCAAGAA 


CTAAGAATAT 


CCGAGCATTG 


540 


CTCCGATCAC 


TTACAGTACC 


CATAAAGAAC 


TTACTAAATC 


CATATGCGAT 


GGAAACAGCA 


600 


GAAAGTGCAA 


AACCTAGTTC 


CGCTTTTGTA 


AAACCTTGCT 


CTTGCAATGC 


TGGCATCGCT 


660 


AATGAAAAGT 


TTTTACGTAA 


TAAATAGTCT 


CGTGCC 






696 



(2) INFORMATION FOR SEQ ID NO:1034: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1034: 



ATCTGAGAGT 


TNCAATTTCA 


CCAATGATGC 


AGTACGGGTG 


TGTCTCTGTA 


ATCGACAATA 


60 


TTGCGGTAAT 


TATTAATATC 


TGAAACCGTT 


GCCTTACCCT 


TATCAATTAA ACGCTCAAGT 


120 


GTAGATGTTT 


CGATTACCTT 


TTGCAAATAT 


CCNTTCTAAA 


CGTTTGAGTT 


CTNTTTGTAT 


180 


TTTTGATTAT 


TAGGTTCTGA 


ATTTAATAAA 


AATAACAATT 


CTCCATAGTA 


TTGGTTTAAT 


240 


ACTGGGCGAT 


AATTTGTATC 


GTTTTTUTCT 


GATTTAACTT 


TCGTTTTTAA 


AATGCTTGTA 


300 


TCACTTTTTG 


AACAATGAAA 


ATTTTGGCAA 


GATTGATAAG 


TCATACCTTT 


AAAAGTAGTA 


360 


TCTTCGGCAG 


ATGGTGTAAT 


TTAAAGGTAA 


AAACAATTTG 


CGCTAAGATT 


AAACTAATTA 


420 


ATACCATGAA 


AGATGCAATA 


AAACAATAAA 


TCATTACGGT 


ATTCGAATGC 


TTGTCCTTTT 


480 


GTAATGATAA 


ATGGTAATGT 


AAGTGCCATT 


GAAAGTGAAA 


ATTGTACTCG 


TGAATACCAC 


540 


ACATCGTCAT 


AATAAATGCG 


TAACGCACAC 


GAGAAGGTGG 


TGTTTCATGT 


GAATCATGTT 


600 


CTTCGTCTAG 


ATAAGATTGT 


ATATTTTTCG 


GGAAATAAAA 


ATCTTTGTAC 


CAGAAATAAA 


660 
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CCCAAACAAA TCGACATGCA TAAATTGCAA TAGCGATTAA TATAGTGATC ACTATTAAAA 720 

TGAAAAGTTG TCAGGCTCTG TTTGGAAAAT ATCTATGACT ACAGTAGGAA TCATAAAACC 780 

TAATACAACA AATACAAAAC CATTTAAGGC ATAACTAAAT GTGTTCCAAA TTTGATGGTA 840 

ATTCATTTGT AGTTCCGTTT GTGCTCTAAT TAAACGGTCG CTCGTGCC 888 

<2) INFORMATION FOR SEQ ID NO: 1035: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 1035: 



CTAGTTTTAA 


CACTGGAACA 


TTGAGCGTTT 


TGCTACTTGT 


AAATACTAAT 


GGAATCAGTA 


60 


AGACTAAACC 


TACTTCAAAG 


AACAATGCAA 


TACCGACGAT 


AAATGCTGCA 


ACCAAGCATT 


120 


GCCCATTGTA 


CATGTTTTTG 


ACCAAATTTT 


TGANTCAACG 


TGTCTGCAAT 


TCGAGTTGCA 


1.80 


CCACCACCAT 


CAGCAAGTAA 


TTTCCCAAGT 


ATGGCACCTA 


AACCGAATAT 


TAGTGCAATG 


240 


TGNCCGAGCG 


TNCTGCCCAT 


CCCTTTCTCA 


ATCGTCTCCA 


TAATTTTAGT 


CAATGGTATA 


300 


CCTAGCATTA 


ACGCTGTAAT 


CATCGATGTG 


ATAATTAATG 


AAATAAATGT 


TATTTAATTT 


360 


AAACCCAATA 


ATTAATACTA 


ATAAAATAAC 


GATACCTAAA 


ACAACACTGA 


TTAACGGCCA 


420 


TATTTCGTTA 


AACATGACAT 


TCCCCTCTTT 


CTCTTTTCAA 


TAGAATGTAA 


CACCGTCGTC 


480 


GCGCACGTAC 


CTTCCAGATA 


ATATATAGTA 


GTCATGGCAC 


AACAACTTTA 


TCTAAGATGC 


540 


TTGAGTGACG 


TATTATTGTG 


TTT 








563 



(2) INFORMATION FOR SEQ ID NO: 1036: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036: 

CCGATATCAA ATATATGTNG AAATAGCTCT AATTTTTCAT TACTTTTCTT ATCACTACTT 60 
AATTCTTTGG CCTGTTTAAT AGGATCTCAC AACATGTGAT TGATGATACT TTGTGTATGT 120 
TTAGATATAA TTTTACGTTC TCTTTCGCTT AACCCTGGCA ATTTACGATC AATACTGTCC 180 

884 



WO 97/30070 



PCT/DS97/02318 



ATCGTTTCTG CTTGAATTGC CATAGCTTTC ACACGTAAAG CTCTAATCAC TGGAACAACA 240 

CCCAACATAC TAATCCACTC ATTGTGTGCA TGTATTTCTG CAGGAATTTG TTCCGAAATT 300 

GTTGCAGCCG CTAATTGTCG CTCACGTAAG TNTGCATCAA CTTAAACCTT TTAAGTCATC 360 

AACAACATAA TTAAAGATGT TTGGTGATGG CACTTAATAC AGGTTCCAAT ATCTCGAGGA 420 

ACTGCAATAT CAATCAATAC TAGTGATCTT GCTTTCTATT CTCTGCAACT CCTTCTATCA 480 

ATTCATTTGG TATGATATAG ATGG 504 



(2) INFORMATION FOR SEQ ID NO: 1037: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037: 



CATCCAAAGA 


CATNTAGTAC 


TNGATGACTT 


NCGNTTAATA 


TGTAAANCAC 


GAGCGTGCAA 


60 


TTTGCTCAGT 


TACNTCTGAC 


ACTGCTGCTG 


NCCCCANAGA 


TTTATAACCA 


TGGAATATCC 


120 


CTCTCCATNT 


CGCTAATACA 


GGAATAAAGA 


TAACAACCAT 


ACTAATGATT 


NNTATAATCC 


180 


ANGTAATATC 


ATCGACTGAC 


CANCCGGTTT 


TATCATGAAT 


GTTTTCTAGC 


TNATNGTAAT 


240 


TCAGAAATAT 


AAGGTGCTAA 


GNAATNCAGT 


CACGTGCCAA 


GAAACCTAAA 


ACACCGGTAA 


300 


TACTCATTAC 


AATAAAACTC 


GATTTATAAA ATTTCTGACT 


TACTTTATAT 


GCCCCAATAG 


360 


CATTATATCT 


CGCAACATAT 


TTCGAAGCTG 


CTTAATGGTA 


CACCTGCTGT 


CG 


412 



(2) INFORMATION FOR SEQ ID NO: 1038: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1038: 

GGCACGAGCT CATAATATCA AAAAAGGTGC AGAAGAAATG GTTGAAGCGG CAGGCGATAA 60 

AATCAAAGAT GGTGCATCTT GGTTANGCGA TAAAATCGGC GATGTGTGGG ATTATGTACA 120 

ACATCCAGGG AAACTAGTAA ATAAAGTAAT GTCAGGTTTA AATATTAATT TTGGAGGCGG 180 

AGCTAACGCT ACAGTAAAAA TTGCTAAAGG CACGAGCGTA CTCATTGCTC AAAAAGAAAT 240 
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TAGTAGACAA AGTAAAATCG TGGTTTGAAG ATTTTGGTGG CGGAGGCGAT GGAAGCTATC 300 

TATTTGACCA TCCAATTTGG CAAAGGTTTG GGAGTTACAC AGGTGGACTT AACTTTAATG 360 

GCGGTCCTCA CTATGGTATC GACTTTGGTA TGCCTACAGG AACGAACATT TATGCTGTTA 420 

AAGGCGGTAT AGCTGATAAA GTATGGACTG ATTACGGTGG CGGTAATCCT ATACAAATTA 480 

AAACCGGTGC CTAACGAATG GAACTGGTAT ATGCATTTAT CTAAGCAATT AGCAAGACAA 540 

GGCCAACGTA TTAAAGCTGG TCAACTGATA GGGAAATCAG GTGCTACAGG TAATTTCGTT 600 

AGAGGAGCAC ACTTACATTT CCAATTGATG CC 632 

(2) INFORMATION FOR SEQ ID NO: 1039: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039: 



GAATTCGGCA 


CGAGCTAGCC 


CAGCCATTAG 


AGCCATTAGC 


TCAGTTGGTA 


GAGCATCTGA 


60 


CTTTTAATCA 


GAGGGTCAGA 


GGTTCGAATC 


CTCTATGGCT 


CACTACTTGC 


ACTTTCCATT 


120 


TGTGGGAAGT 


GCTTTTTTTT 


ATGAAGTAAT 


AAGATAAACA 


GATTATATGT 


GTAGCTATTG 


180 


CTTGATAATG 


GTAAGCGCAT 


ACATTATTCT 


TTAATTACAT 


AGAGCAAAGG 


GGGACGCTTA 


240 


TGACAAAGAC 


AAAAGCAATT 


GATATTATAG 


GTGCACCATC 


AACATTTGGA 


CAAAGAAAAT 


300 


TAGGTGTTGA 


TTTAGGACCA 


ACAGCAATTA 


GATATGCTGG 


ATTAATCTCA 


AGATTAANGC 


360 


AATTAGCACC 


TTGATGTATT 


AGACAAGGGG 


GGATATTAAG 


GNACCTGCTG 


TGAACATTGA 


420 


AAATTTCATA 


G 










431 



(2) INFORMATION FOR SEQ ID NO: 1040: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040: 

CTAGGCACAA TAAGCTATGT AAAATATATA GCCTAGTTAA GTATGCAAAG GGAGCGTTAG 60 
ATTTATGCAG ATAAAATTAC CAAAACCTTT CTTTTTTGAA GAAGGCAAAC GTGCCGTGTT 120 
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ATTATTACAT GGATTTACAG GCAATTCGTC TGATGTACGT CAATTAGGTC GATTTTTACA_ 180 

AAAAAAGGGA TATACTTCGT ATGCACCACA ATATGAAGGT CACGCGGCAC CACCAGAGGA 240 

AATACTGAAA TCTAGTCCTT TCGNTTGGTT TAAAGATGCG TTAGATGGNT ATGATTATCT 300 

NGTTGAACAA GGTTATGATG AAATTGTTGG GTGCTGGTCT ATC ATT ANNA GGGTGGGGAT 360 

TTTGCGTTTT AAAATTAAAG CTTAAATAGA AATGTTAAGG GTATTGTAAC CATGTGTGCA 420 

CCAATGGGTG GCAAAACTGA AAGTGCCA 448 

(2) INFORMATION FOR SEQ ID NO: 1041: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041: 



CTAGATGATA 


CTGAACCAAA 


AATTTATGCA 


GTTCGTGGTC 


CATCTGGCAT 


TGGTAAAACT 


60 


ACTGTTTTAA 


ATATGATTGC 


CGGATTACGT 


AAAGCAGATG 


AAGCTATTAT 


CGAAGTGAAT 


120 


GGGCAATTGC 


TTACTGATAC 


GACAAAAAAC 


GTGAATGTTA 


AAATNCAACA 


ACGACGTNTT 


180 


GGATATCTGT 


TTCAAGACTA 


CCAATTGTTT 


CCTAATATGA 


CGGTTTATAA 


AAATATTACT 


240 


TTTATGGCCT 


GAANCCCATC 


TAAACNACAT 


CGATCAATTA 


ATTCAAACTT 


TAAACNATTA 


300 


ATCATNTGAT 


GAAACAATAT 


CCTATGACAT 


TGTCAGGTGG 


AGAGGCACAA 


CGTGTAGCAC 


360 


TTGCACGTGC 


ACTTAGCACG 


AAACCAGATT 


TAATTTTATT 


AGATGAACCT 


TTTTCTAGTT 


420 


TGGATGATAC 


TACACAAGAT 


GAGAGTATTA 


CATTAGTTAA 


ACGTATTTTC 


ANCGANTGGC 


480 


ANATACCAAT 


CATATTTGTG 


ACACATTCAN 


ACTATGAAGC 


AGANCACATG 


GCTCATGAAA 


540 


TTATTACAAT 


TGGGTAATCA 


TTTATTTGCC 


ATTAAAGAGT 


TTAGAACGTA 


TTTAAAATTG 


600 


TAGAAGTGAA 


TGCTTCTATC 


AGCATTTTAA 


TGATGTTTTA 


AACTCTTTTT 


TAGGGGCAGT 


660 


TTTTTTGAGA 


GACATTGACG 


CGCTCGTGCC 








690 



(2) INFORMATION FOR SEQ ID NO: 1042: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042: 
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rrpaTATA'PP 

UUUAl nlAi X 


AOX 1AAAA11. 


AAub x\_A\j*jA 


lAAiuALAAl 


bLTil'lAAAG 


TTTTAGCTAC 


60 


ATOO AAT AC A 


XAX lAlAAAA 


ATA A A AO A 
AlnnAALALn 


X X lAvjAALuu 


AblAIALAAT 


GATTCTAGTC 


120 


ATAGATAATA 


AluAi XVAX X 


TAP AT ATA AT 
1 ALA1 AX AAX 


X lAAlAuALi 


A I ATI AAtaAC 


TCAAACGAAA 


180 


OTAAPARTTP 


AAuX iui 1\jLj 


XAX XotAX AAX 


CVdnTfl AT AO 
I- lu\, XuAX Av? 


AAbAL GTCAT 


TAATATGAAG 


240 


C1AAAAGCAAT 


1 \J X X A X X X V~VJ 




GTA ATPPPJT2A 
O X AA X LIAjvA 


luAi X AX\ATT 


ATCTTGAATG 


300 


AAGTGTTAGA 


A P A A TTT A A T 
nwini x X.AAX 


PAGGCACGAG 


TAPPTATAPT 
x rtV— a a x nv. x 


AGGTfiTATOT 


TTAGGATTTC 


360 


A ATHTA TPfiP 


u 1 AX XXX 


gggtsgagaat 


ATOA ATOAOO 
A X A_AA IVAtAJ 




TGTNCACGGA 


420 




Abu X ALui. l_~A 


T AO/** A ATT* A A 
X ALLnA X\? AA 


vjla X A X X XX X\_ 


aAWJaCTGCC 


TCAAAATTTC 


480 


AATGTAATGC 


GTTATCATTC 


ATTAATTGCT 


GACGGAGCGA 


CTTTTCCAAA 


TTGCTTAAAG 


540 


ATTACAGCAA 


AAAACGATGA 


AGCGATTATT 


ATGGCATTTG 


AGCATATTAG 


ATTTCCGGTT 


600 


TTTGGTGTGC 


AATATCATCC 


TGAATCTATT 


TTGAGTGAAT 


ACGGTTATCG 


ACAAGTTTGA 


660 


ATTATTTTTA 


TCGAAGGTAG 


GTTGATTACT 


GTGAGAAATA 


GAATATAATT 


ATCGCTACTA 


720 


TTTAACTGAA 


AATGAATATA 


AGCAATACCA 


TATTCAATTA 


AAGGATTTAT 


CAAGAAGTAT 


780 


GTTGCTACTA 


AGTTGGCTGA 


TGTGGGAGAA 


GTGATACACT 


TTGCACAAGC 


GCCA 


834 



(2) INFORMATION FOR SEQ ID NO: 1043: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043: 



GGCACGAGCG 


CTTGTTGTGC 


ATCGACTAAG 


TTCATTTCAA TTTCTLTAGG 


ATTTTTAAAA 


60 


CTTTCTTTAA 


CAGCGTCTTT 


AGTTATTTCG 


TTGAAAACAA 


GCGTGCCGAT 


TTTCTTTAGA 


120 


ATCTTCAAGC 


TCTAAAATTT 


TTGATAAATG 


CCAAGCAATT 


GCTTCACCTT 


CACGGTCGGG 


180 


GTCACTTGCG 


AGAAAGACGT 


TTTTCGCTTT 


TTTTGCATGT 


TTTTTCAATT 


CTTTTACAAC 


240 


AGGACCTTTT 


CCGCGTATTG 


TTATATATTT 


TGGGTCGGAA 


TTATCTTCAG 


TGTCGACACC 


300 


CATTTGACTT 


CTTGGTAAGT 


CTCTTGACGT 


GTCCCATTGA 


AGCTATAACT 


TTATATTTCT 


360 


TACCCTAAAT 


ACTTTTCAAT 


GGTTTTTGGC 


TTTTGGCAGG 


GCCAATCAAC 


AATGAATAAA 


420 


TTATCTGCCA 


ATGTCATTTC 


CCCCTCGCTA 


ATCTAATTAC 


TAAAGATAAT 


GATAAACGCT 


480 


TAATTTTGCT 


TTTGTCAACG 


TTTTATATCT 


AAACAAATCA 


TTTCCCTGTC 


TATTTTTATA 


540 


GATTTATAAA 


ATAATACAAG 


GTACAAAGAA 


TATGAGCACT 


CGTATTTAGT 


TT 


592 



(2) INFORMATION FOR SEQ ID NO:1044: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 803 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044: 



V. l_ l_ W./\v_~/\. 1 Art 


1 Inunlni 1 




AAGAAATNPT 


Afi A ATTAfyT^F 


CAGAAGC'TAN 


60 


ATCNGGACCA 


AGGTTGNACA 


ATTGTCATGG 


TTCTTCATGA 


TATCANCCAA 


GCGATTCGTT 


120 


TCTCAGATCA 


TCTTATTGCG 


ATGAAAGAAG 


GGGATATCAT 


CGCTACAGGT 


TCAACAGAAG 


180 


ACGTATTAAC 


ACAGGAAATA 


TTAGAAAAAG 


TTTTTAATAT 


TGATGTTGTT 


TTAAGAAAAG 


240 


ATCCTAAAAC 


TGGAAAACCT 


TTACTGGTAA 


CTTATGACTT 


ATGTCGCACG 


AGCTTATTCT 


300 


TAATTAAGTA 


AGTTAATATG 


ATAAAAAGGA 


CAATTAACAT 


GACAAATAGA 


GAGAACCCAA 


360 


CGCCATTGAA 


GTTTTTNTCC 


TATNTTAGAG 


GTTTAAGTAT 


GATACTACTA 


ATCACACTAT 


420 


TTATTTCTAC 


ATTAATAGGT 


GACGCCAAAA 


TTCAAGCCTC 


TACAATTATA 


GAGGCTATTT 


480 


TTAATTATAA 


TCCTAGCAAT 


CAACAGCAAA 


ACATCATCAA 


TGAGATTAGG 


ATTCCCAGAA 


540 


ATATAGCAGC 


AGTAATTGTA 


GGTATGGCGC 


TTGCAGTTTC 


TGGTGCGATT 


ATACAAGGTG 


600 


TTACTCGTAA 


TGGTCTTGCT 


GATCCGGCGC 


TCATAGGTTT 


AAATTCAGGT 


GCTTCATTTG 


660 


CTTTAGCATT 


AACATATGCA 


GTTTTACCAA 


ACACTTCATT 


TTTAATATTG 


ATGTTTGCTG 


720 


GATTTTAGGT 


GCTATTCTAG 


GAAGTGCTAT 


GTATAATGAT 


AGGCCGATCT 


AGACGTGATG 


780 


GATTTAATCC 


GATGCGTATA 


TTT 








803 



(2) INFORMATION FOR SEQ ID NO: 1045: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045: 



GGCACGAGCT 


AATGCGAAGC 


GTGATCAAAT 


CANTTCAAAT 


CCAGATTTAA 


CACCTGAGCA 


60 


AAAAGCAAAA 


GCGCTCAAAG 


AAATTGACGA 


AGCTGAAAAA 


CGAGCACTAC 


AAAACGTTGA 


120 


GAATGCTCAA 


ACTATAGATC 


AATTAAATCG 


AGGATTAAAC 


TTAGGTTTAG 


ATGACATTAG 


180 


AAATACCACA 


TGTATGGGAG 


GTTGATGAAC 


ACACCTGCTG 


TAAATGANAT 


TTTTGAAGCA 


240 


ACACCTGAGC 


AAATCCTAGT 


TAATGGTGAA 


CTCATTGTAC 


ATCGTGATGA 


CATCATTACA 


300 


GAACAAGATA 


TTCTTGCACA 


CATAAACCTA 


ATTGATCAGC 


TTTCAGCAGA 


AGTTATTGAT 


360 


ACACCATCAA 


CTGCAACGAT 


TTCTGATAGC 


TTAACAGCAA AAGTTGAAGT 


TACATTGCTT 


420 


GAGGGATCAA 


AGTGATTGGT 


AATGTTCCTG 


TAAAAGTTGT 


GAAAAGATTG 


TCAGTAGTCA 


480 
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ACAACAGGCA ATGATCAATC GAAATGCGGA CACAAAGTTG GTGAATCAAT ATAGTGTGAC 540 
ATTACACTGG ACAAAAGGAG CTGCAATGAG AGTTAAT 577 

(2) INFORMATION FOR SEQ ID NO: 1046: 

J (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 1046: 



TTATTGGACC 


CGGAAAATAT 


TACCCAGGTT 


AAACCCTATT 


CAGTNCAGCT 


ATTTCCCTAT 


60 


CCCTATGGTC 


ATGGGTGTGG 


AGGTGTTCCC 


GCACCATGAT 


GTCAAACAAT 


ANCCACATAT 


120 


CCCTGCTACA 


GAAATTATGA 


GACATTGNCC 


GACACATCCG 


AACAATCAAC 


ATAAAGGTTA 


180 


TGCTGGACAT 


CACTTATTCA 


TTAAGCTTGN 


CAATTATCAT 


CAAACGGATA 


TTTCCAGAAG 


240 


CATATGTACC 


ATTTTTACAA 


AAATAACAAT 


CCTAATATTC 


GTAGACGGCA 


CTGGTAAATC 


300 


CTTAGAAGAT 


CTTAAAGGTA 


TTATCGATCA 


AGGTCAACCT 


GTGTTAATAT 


ACCATACTAA 


360 


TCTTGGCTCT 


AAACCATTAC 


TACGTGTTTT 


CCGCTTTGAT 


AACAAGCCAG 


CGAAACAAGT 


420 


TTCAAATATT 


CACGTAACGG 


TACTCATTGG 


GTACGATGAT 


TACTATTACT 


ATTATATCGA 


480 


TCCACTTTGG 


AGTCATATTC 


GTCGTGGCCT 


AGTTTTACCT 


GCCATCATTC 


CAAATCGCAA 


540 


ACAAATTATT 


AAAATTCGTA 


AAGAGAAAAT 


GGAATACAGC 


TTTAATTCAC 


CAGGAAGAAA 


600 


ATGCATTTAT 


GTGCAACCAC 


ATTCATATAC 


AATTGAAAAT 


CAACAACAAA 


ATAAACACAC 


660 


GTAAGTAAAC 


ACATTACTCG 


CTCGTGCC 








688 



(2) INFORMATION FOR SEQ ID NO: 1047: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047: 

CCGTATCGTC AGCATTTAAT TCCTGAATTT GATGAGTGAA ACAATTGCTA GTCAACATAA 60 

TGCCTATCCA ACTGTAATTA GTGGTGCTGG ACCAACTATT TTAATATTTA GTCGTAAAGA 120 

AATAGTGGGA ATTGGTTCGC TCCTTTAAAT AGTCAGGTAG TATCATGCCA TTCGGAATTG 180 

890 



WO 97/30070 



PCT/US97/02J18 





nln l\,nVi Xuu 


lUi InnAUUA 


CCGAATTGTA TACCAATAGA CGCTTTATAT 


240 




Inl Innnlut 


nun/V X f\\3£\\J-t\ 


GGAGATTTAA 


TGCGATATGA 


CAAATTATAA 


300 


nul IVtIaJuI 1 


1 InuAlnluu 


a nv2 a p ap & tt 

niuALiiv.nl 1 


GCTAAATTCA 


GATAATGTGA 


TATCAGAAGA 


JUU 


AAL. 1 \jL-AAA1 


T A 111 AACAIj 


LAA1 lUiAbA 


TGAAGGTTAT 


TATGTTGTTT 


TAGCATCTGG 


A 9 A 


i AliALL 1 AU 1 


IjAAljO 1 A 1 \»A 


rnrapr»» APTP.P 
1 RLAAL lot 


TAGAGATTTA 

x nuvivsn x x x 


AAATTACCTG 


AA PA TP AT AH 


>! Q A 


CTAGATTATG 


CCGCTAGTGA 


CAACGGTAGT 


AAAACGATTA 


ACATGACTAA 


TGAAGAAGTA 


540 


GAAGTAA6TA 


AATCGATTGG 


TAAGCAAGAT 


TTCGATGAAA 


TTGTAGATTA 


TTGTCGAGAC 


600 


AGAGGCTTTT 


TCGTTCTTAC 


ATATCATGAT 


GGTCAAATTA 


TTTACGACAG 


CGAACATGAG 


660 


TATATGAATA 


TTGAAGCAGA 


ATTAACAGGT 


TTACCGATGA 


AACGTGTTGA 


TGATATCAAA 


720 


GCGTATATTC 


AAGGCGATGT 


CTCGTGCC 








748 



(2) INFORMATION FOR SEQ ID NO: 104 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1048: 



TTTGTAAACA 


ATAACTTTAG 


GTAANAAACC 


TTGTTCTAAA 


TTGTACCCGT 


CNCCCNACGC 


60 


TCTTTGTCTG 


ANCCATACAA 


GNNACGTAAT 


TGGTGTGGTT 


TACCTGGTTT 


CCTCGTTCGT 


120 


AAACGCCATC 


CTTCATCATA 


ATCTTCATTA 


AAAACAATTT 


TGCGAATCTC 


TTTAGGGAAC 


180 


GCATGCATTG 


GTAATGAGAT 


TTTACCTTCT 


CCATCTGTAT 


TCCCTGATTC 


AATTGGACCG 


240 


CCATCAGCAT 


AAGCTGTTTT 


TAGTTCTCCA 


CCAGTTTCAA 


CACCAATTTT 


TTGTAATCCT 


300 


CTTGTTTTNG 


TAATATCACT 


ATATTTTAAT 


TCCGCGCCTT 


CTTTCGTTAA 


TTTAGCGAAA 


360 


CCTAAACCAG 


TAATGTTAAT 


ATACGCCTTT 


GGCGCACTTG 


CATGTTTTAC 


TGCCATTTAA 


420 


TTTTCCTCCT 


TATAAAAAAT 


GCCCTCGTAA 


ACGCTCGTGC 


C 




461 



(2) INFORMATION FOR SEQ ID NO: 1049: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 712 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S i ngl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1049: 
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WfA ATTA A AA 


lAAluLLlAL 


ATTACTTTCT 


TV IV IV A f+ *k m/"*s^m 

AAAAGATCCT 


60 


uniAAinl 


nnunC 1AI»1 1 


blni 1 1AL-AA 


T T*C A IWP A T» A TV* 


AAA A TV* A AAU 

AAAAl\jAAArf 


iv 3V iv mm il rrw"^/"* tv 

AAAl lATCCA 


120 




Af Af rY!5/"**P A a 


AAtAALtuAI in 


LuAL I AAu 1\- 


AATTTTATAA 


AGAGTTAATG 


180 




univjl 1 iuun 


A TiTTiPR T.p»P 

AA 1 AuAALAL 


wuj A 1 TiL. 1 A 


TVTV^A AfV*fIV^*Tt 

AITjAALHjTLjT 


ATTATTGTTA 


240 


11 A Af"* A AfV TV 
AAAC AAl?l_A 1 


/TUP A /VmnV^ TV 


lAAiuLl 1 AC 


fV*A TV /"» 7V A /~»/TP 

UvJAAVj AAC L I 


TTATTGATTT 


TAAAGCGGAT 


300 




AAIvivjAAvrviA 


AUA1 lATGAA 


GCAATCATTG 


AATTATTAAC 


ATAGGTAGAT 


360 


GAAGGAAGAT 


TTGGATCCTA 


CATTCATGTG 


GCATTTAGCG 


TTCGCCTATG 


GTCAAGAAGA 


420 


GCGAGATAAA 


GAAGCTCAAC 


ATTTCTTTGA 


ATTAGCATAT 


CCAACGATGA 


AAACAAATAT 


480 


TGATTTTATG 


AGTGATTATT 


ATTTCT ATTT 


AATTGAAATT 


GGTCAAAAGG 


AACAAGCAAT 


540 


TACTATTTTA 


AATCAATTGT 


TGGAACTAGA 


GCCAAGTAAC 


GAAAATTGGC 


ATGACGAATT 


600 


ATCAAGATTG 


CAATCTTAAA 


ATGGGGTATT 


TGATGTCAGA 


GACACTCAAT 


CAAATTAAAG 


660 


AAAGTTTTAT 


TGAATATTTA 


CTTTTTCAAT 


ATCGATTTAA 


GTCGCTCGTG 


CC 


712 



(2) INFORMATION FOR SEQ ID NO: 1050: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1050: 



CCCAANATCT 


AAAATATTTG 


ATTCCATCTT 


TGCAGTGATT 


TTTNACCAAT 


NTCCNANCCA 


60 


TTTTTCAATA 


TGGTTGATNC 


TTGAGTCAAT 


TGATCGTACA 


CTAGGCTCAA 


TTTCTGCATA 


120 


TTGCGACATA 


ATTAATACGC 


AACCTGACCA 


TATGATTCTA 


ATGGTGCATC 


TTGATATAAC 


180 


TTTTCAATAT 


TATAATAATT 


TCTTTCGTCT 


TTATGGAAAA 


CATGTACCAC 


AACATCGAGC 


240 


TAAGTCAATT 


AATATCCAAC 


GCGCTTCATT 


GTATCCTTCC 


ATACGTTTTA 


CTTCTATATT 


300 


TTGTTCATTG 


GCTACTTCTT 


TCACCGCTCT 


AGCAATCGCT 


TGAACTNGTC 


GTTCATTATT 


360 


TCCGTGCGTT 


ACAACAAAAT 


AATCNGTCAT 


ATCGCTGATA 


CCTTTCATTT 


CTAAAGAAAT 


420 


CGTATCTTCG 


CCTTTTTTAT 


TNTCAATGNC 


ATCCACAGCA 


ATGGCTAATA 


ATTCTTGTGA 


480 


ATTCATTTAA 


TCATCCTTTA 


TTCTTTCGTC 


ACTATAGTTA 


TAATAATTTA 


AACAGTCAAT 


540 


CGTCTTATTA 


TATACCGTAA 


TATCTTTCTG 


TTATTAAAAA 


TAGTACTGTG 


CGTTTTAGAA 


600 


ATTTCATAAA 


TTGTCTTATC 


CTAAACTACC 


TTGATTGTAT 


GCCATATCTC 


GAAATATCAT 


660 


CAACTCCTGG 


GATTGTCTCC 










680 



(2) INFORMATION FOR SEQ ID NO: 1051: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: lin ar 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1051: 



AA i\iAAu A lo 




a nnr* a c & r*crz 

AuVjuAl~Al~l~U 








ou 


CTGANTGAAG 


CTAGTGTTGG 


TGAAGAAGAT 


GTAGTAACAA 


CTTCAAGATT 


TCCCTGGANC 


120 


NACTTTAGAT 


ATGATAGATA 


TTCCTTTAGA 


TGAATCATCA 


TTTATGTATG 


ATACACCAGG 


180 


TATTATTCAA 


GATCATCACA 


TGACGCATTT 


AGTTAGTGTG 


AAAGAATTGN 


AAATTATTAT 


240 


GCCTAAGAAA 


GAAATNAAAC 


AACGCGTATA 


TCAATTAAAT 


GAGGCGCAGA 


CATTATTCTT 


300 


CGGCGGGCTA 


CGAGCGCATA 


GATTATGTAT 


CAGGTGGTAA 


ACGTCCGTTA 


GTTTGTTTCT 


360 


TTTCTAATGA 


CTTGAATATA 


CATCGTACTA 


AANCGGAGAA 


GGCTAAGGAT 


TTATGGCGTA 


420 


ATCAACTTGG 


CGATTTATTA 


ACGCCACCTG 


GAAATCCACA 


AAATTTTGAT 


CTTAATGAGT 


480 


TAAAGGCTGT 


TAGACTTGAA 


ACAGGCAAAG 


AGAAACGTGA 


TGTTATGATC 


TCTGGTCTAG 


540 


GCTTTATAAC 


TATAGGACCA 


GGAGCTAAAG 


TAATCGTTCG 


TGTTCCTAAA 


AATGTTGATG 


600 


TTGTATTAAG 


AAATTCTATT 


TTATAAGGTG 


ATTTAAAAAA 


TGAAATTTGC 


AGTTATCGGA 


660 


AATCCTATTT 


CACATTCCTT 


GTTCGCCCCG 


TTATGCCTAG 


AGCAAATTTT 


AATTCCTTTA 


720 


GGAATTAGAA 


TGAAACTTTA 


TGAAGCTTTT 


AAATATTCCA 


ATTGAAGATT 


TTCATTTAAT 


780 


TAAAGAAATT 


ATTCCGAAAA 


AAAGAATTAG 


ATGGCTTTAA 


TATCACAATC 


C 


831 



(2) INFORMATION FOR SEQ ID NO: 1052: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 1052: 






GGCACGAGCG 


ATAGTCGCTA CTACTTCCAC 


ATGTGTCATA 


GTAAATCCCT 


CATTTCAGAT 


60 


ACTCATTATT 


TTTATTGTAA ACAACAACTT 


TTTATCCTTC 


AATCAAACAA 


TGCTATAAAT 


120 


GAAAATGATT 


AAGGTATTTT CGAATGGGTA 


AAGCATAAAT 


GTATTTTAAA 


TTAGGAGGTT 


180 


ATAAGAATGG 


CAATACATTA TGAAACTAAA 


GCGACGAATG 


TTGGCGGACG 


TAAAGGACAT 


240 


GTTTATACTG 


ATGATCGAGC GTTAGATATT 


GATATCGTTC 


CGCCTGCTCA 


AGCAGATGGT 


300 


AAAGCTACTA 


ACCCCGAACA ATTATTTGCA 


GCAGGTTATG 


CATCTTGCTT 


CAACGGTGCT 


360 


TTCGACCTAA 


TTTTAAAGCA AAACAAGTGC 


GTGATGCTCA 


TCCAGAAGGT 


AACACTAACA 


420 


GTGAGACTAG 


AAGATGATCA GACTCAGAAA 


GTCCTAATTA NGTGGTTCAA 


TTGATGCGAC 


480 
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AATTAAAATG T 491 
(2) INFORMATION FOR SEQ ID NO: 1053: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053: 



GGCACGAGAC 


ACCTGCATAT 


ATATATGCAA 


TACCAGTTGC 


ATCCTTAGCA 


TTATTAATCG 


60 


CAATCACATT 


GTTTGTTAGA 


AAAAAATCTA 


AAGGCAATGT 


GGAATAATGA 


GAAATGTTAA 


120 


ACAAATTGCT 


ACAAAATCTA 


TTATAGCTAT 


TATTAGCTTA 


TGTATACTTA 


CATATACAAC 


180 


AATGATTGGT 


AGCGTGTTGG 


CTGATGAGAT 


AAAATATCCA 


TCAGCCAAAT 


TTAATCAACC 


240 


TGAAGCAAAA 


GATAAAACAG 


AATTAACTAC 


ATCAATTTTT 


GATGAAAAGA 


TAAAAGAGAA 


300 


TAAAGCGTTA 


GAGTTACTAA 


TTTTTAATCA 


AGAAAATAAA 


AATGTAACTG 


AGGAACAACA 


360 


ACTAGTTGAC 


GAAAAGGCGC 


AATTGATTTC 


AGATATGACT 


GGTAAAATTT 


ACTTGCAAGT 


420 


AAAGCTAAAA 


GGTCAAATAG 


ATAAAGAACA 


ACTTGTTTTT 


CAAAATGACA 


AAAATGAAGA 


480 


ATTTCCTTTT 


GTTATAAAAG 


ATGAAAAGGA 


TGACCCATAG 


TAAGAATTTT 


AATTGAACAG 


540 


CATATGGATA 


AA 










552 



(2) INFORMATION FOR SEQ ID NO: 1054 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054: 



GGCACGAGAC TGTTTCGAAA TTGTGTAAAT CCCTTGATGT AACTTTGAAC GAATTTTTTG 60 

GAATCAATGA AAAAGAGGGG GTCAAATAAA TGTCTAGAAC AAAATTGCAA GACTTCCCAT 120 

CAAAAGAAAA TACAGTTACA GAACCGAAGC AAGTTGTAGT AAATCCGTTG TTTGCGAAAC 180 

CTAATGCACT AGCTGGTATT TTTGGAATTT CATACAGTTC GGTGAATCGC ATTTTAAAAG 240 

AATGGGAAAA AGATTCTAAA GGTGTTGATG ATTTATATTA TTCACTATCA TCAACATTGA 300 

TTGTTATCAG TATTCCGCGA TTCNAGGAGT ACATGAAGGT GCGTCCTAAA AAATGGATGT 360 
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TTGAGGTAAA GCAATGAAAA TATACTTAAC NTTATATCTG GTTTAGTTTC _ 410 

(2) INFORMATION FOR SEQ ID NO: 1055: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055: 



GGCACGAGCG 


AGCGTAAACA 


TCTGTAATAA 


TTGGCATGTT 


TATAATCTCC 


TTTAAATAGT 


60 


TATTCTATAA 


TTATTATAGC 


TTTATTCTTT 


CATTTAACAA 


AATGATATTA 


CTATTATTTA 


120 


CAAAAAATAA 


GTGTCATTAA 


GCGTTTACTT 


AACATCTTTT 


ACAATATTAG 


TGTTTAATTA 


180 


AAGATTCACC 


TGTCATGTCT 


ACAGGTTGTT 


CTACATTTAA 


TAAATCTAAT 


AATGTAGGTG 


240 


CTAAGTCACC 


TAAGCGACCA 


GTTTCTCGAA 


GTGTAACGCC 


TTCTTTTGTT 


ACAATCACTG 


300 


GTACTGGGTT 


CGTTGTATGC 


GTAGTCATTG 


GTTGATCATC 


ATCCGTCAAT 


ACTTGATCAG 


360 


AGTTACCATG 


GTCAGCAGTA ATAATTGCAT 


AACCGTCCAT 


GTCTAAAATC 


TTATCAACGA 


420 


CTTCACCTAA 


ACATTCATCA 


ACCGCTTCGA 


TTGGTTTGAT 


TGTCGGCTCA 


AGCATACCAC 


480 


TATGTCCACC 


ATATCAGGTT 


AGCAAAGTTT 


AAATATTAAG 


TCCAAGTCAC 


TNTATTTAAC 


540 


TCTCTAATAA 


TGCATCTT 










558 



(2) INFORMATION FOR SEQ ID NO: 1056: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1056: 

GGCACGAGCT AGAGGATCAA GAGAATTTAA ATGGCGATTG CCNNACAAGA TGACNATNCG 60 

AATCAGTNCC ATTGGTGGCN TANGTCTGGG ATTTGGTGCG TCANTNGCTG GTGGTTGTTC 120 

TATCGGTAAC GGTTTGGTTG AAACGGCAAC GATGACTTGG CAAGGATGGA TTGCGCTAGC 180 

ATCCATGATA GNTGGTGTAT GGACAATGAG TCATTTTNTC NTTGTTCGTC CAATGAAAAA 240 

NGTACAACAA CNATCCGCAA TAGGTTAAAC AGCAAACGCN AATAGTATAG AAGATTATTA 300 

TGCCCATGAT GTTGATCAAA TAATAAGTGA TTG 333 
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(2) INFORMATION FOR SEQ ID NO: 1057: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057 : 



GGCACGAGCT 


CGTGCCGAAT 


TCGGCACGAG 


ACATTACATC 


ATTAATTTCA 


ACCATCTTCT 


60 


TATTGATTAT 


GGTTCCATTC 


TTCTTAATTT 


ATATGTTAAA 


AGACCATGAA 


AAATTTATCC 


120 


CAGCTGTTGC 


TAAATTTTTC 


AAAGGTGAAC 


GTAAAGTATT 


TTTCGTAGAT 


TTATTAACTG 


180 


ATTTAAACTT 


TACATTAAAA 


TCTTACATTC 


AAGGTCAAGT 


AACTGTCAGT 


GTTATTTTAG 


240 


GTATATTCTT 


ATATATCGGT 


TATTCAATTA 


TTGACTTACC 


ATATATTCCT 


TTATTAGTTC 


300 


TATTTGCTGG 


TGTCGCAAAC 


TTAATTCCTT 


TCTTAGGTTC 


ATGGTTATCA 


TTTGCACCTG 


360 


CAGCAAATAC 


TCCGGTATTA 


ATAGATAGTC 


CTACTACATT 


TATATGGGTT 


TGTATCATTA 


420 


CATTAATTGC 


ACACCAACTT 


GAAGGTAATA 


TCATTACGCC 


AAATGTTAAT 


GGGTAAATCT 


480 


TTAAGTATCC 


ATCCTTTAAC 


AATTATCGTT 


GTTA 






514 



(2) INFORMATION FOR SEQ ID NO: 1058: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058: 



GGCACGAGCG 


CCCGTCTACA 


TCGTTCAAAG 


CCCATTGAGT 


ATAAGACAGT 


CCATGAGGTG 


60 


TTGCGGCCTC 


GTCAATGGCT 


TCTACAACGT 


TCTCAAGAGG 


TACGTATTTT 


GATTTAAAAA 


120 


ATGGATTGTT 


TTTATCTTTG 


AGCGGTTGTT 


TTACTTCCTT 


GCGAAATGCA ACCATAGCTT 


180 


TATTTATTTC 


AACAACTGTT 


TCTGATTTAT 


TCATCACTTA 


ATCACCAGAC 


TTTCTGTTAC 


240 


CTTTAATTCA 


ACGCCAGGAA 


TATCTTTCCC 


AGCTTTCAAA 


TCATCGATTA 


GTTGCTTAGA 


300 


ATTAAGTTTC 


GGGGCTTGTG 


ATAGCCAATA 


ATCCTTTGGA 


ATAAGTTTTT 


CATCGATAAT 


360 


ATTTTTACTA 


GCTCCGTTTT 


TGCGTTTAAA 


AATATGATTA 


GTAGCTGTGC 


GGTAACTATC 


420 


TACTTCTTGT 


GTTTCTAACA 


TTTCTTTTAA 


GTAATCTCTT 


AATCGATCAG 


TTAAATTTTG 


480 
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rrmvr i Ti ' tttaaatttt gaagtcgtta tctcttatct atgacatcta tgtcactaag_ 540 

TTCACGTCTC CATGACATGT TATCTACTTG ACATCATTCT GCTGATAATC TA 592 
(2) INFORMATION FOR SEQ ID NO: 1059: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1059: 



GGCACGAGCT 


ATAAATGATG 


AAGTTGATTT 


AGATGAAAAT 


TAATCTTGGA 


TATTTAATAC 


60 


TTTGTTGATT 


TCATCAAGAT 


CAACATGGTA 


CATTGGATTG 


CCATTCAACA 


AAATAAACGG 


120 


AGTTGAAAAA 


GCATCAAAAT 


CTATCATTTC 


GTTTCGATAT 


TGTTGATTGT 


TGATATTTCT 


180 


CTCTTCAAAA 


TCAATTTGAT 


GCTCATTTAG 


ATAATTTTTT 


ACAAATGTAC 


AAGGTGGACA 


240 


ATCATTCTGC 


GTATAAACGA 


TTATTTCTGA 


CATGTTATCA 


TGCTCCTTCG 


TGCTTTATTC 


300 


TAATGTAATA 


TACACAATTA 


TACATTGAAA 


TTCCAAATTT 


GTGAACACAT 


TGTGAACTGA 


360 


CATAAATTAT 


ACACAATTTC 


CATAGCGCCA 


TAATGTTTTT 


TAAAAATACA 


CATTATAGCA 


420 


ATTATAATTA 


GTTCCATGTG 


TTGGTCCATC 


AAAAAAGAAA 


AGGTGATGTT 


TTTTAATGGA 


480 


TACAGTTGAA 


TCCGTCCGTT 


TTGAC 








505 



(2) INFORMATION FOR SEQ ID NO: 1060: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060: 

ATCCTTTATT TNTAAGGCGT TTCATCAAGC TAACACTTCA TTATCTTTAG TCGCTTTAAT 60 

GCCCTTCTAT TAAACTCGGA ACTAAAAATG GATTGNCTTT TCAACCACCC GANCACTAAA 120 

ACATTGTCNT TTTTGATGGC CNATTAAGAC ACAATTTCCC GCTGTTTGCT TCAAAGTAGC 180 

TTGCTTCTTG NATTNATTTT CAATATCTTT CTTGTTAAAA ACAAGANTGT TGCACAGTTT 240 

GATTGGCATC TTTATTTAGA NCAATGGCAT CTGCTTGCCA CTTATCAATG CCTTCTTTAT 300 

TCATATTGAT AAGACCATTC GCCAATCCAG ATAATAAAAA TAGCAAGTAA CTAATCATCG 360 
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1 1 AACACAC L 


AATAAl 1 AGT 


CCAAACTTCA 


ATTTGTTGCG 


CCGTATTTCA 


TTCCAAGCTA 


420 


TV TV TV XPKTW^K 

AAAAJLAIviCA 


1 1 1\, l\, 1 vJCC 


TACTACTATG 


ATTTAAACAT 


TGTTTATATT 


CTTAGATGCA 


480 


CGTACGTCGT 


GTTGCGCTCT 


GTAATGTTAT 


ACATACACTT 


ATCCTTCATT 


ATACCCGANC 


540 


TTTTTATATT 


AAAACCAAAT 


TTATGGAAAA 


TGCAANNANT 


TGTCTATTAT 


TTTTGTGCGG 


600 


TACATTTAAA 


ATTAAGGATC 


AATTTAAAAA 


CGCCTACATA 


TACCTTTAAG 


TACATGAAGA 


660 


CGTCCAATTC 


ATATATTATT 


TAACTTCGCC 


TGTTTTAGGA 


TCGGATTGCT 


TAATAGCATT 


720 


TTTACGTAAT 


TTATCTTl'rG 


CTTTGTCACT 


TGCTTTATAG 


TTATTGTTGT 


AAATCGTAGC 


780 


TTCCCAACTA 


CCATACATTG 


^VlflHii « ^^^^^^ m 

GGTTAGGGAA 


AATGATATAT 


TTCTTACCGA 


AATCGTCTTT 


840 


ATGTTTTTCA 


ATTAATGCTT 


CACGAGATTC 


AGCTGTAGCT 


TCTTTTGGAT 


CTGTAAAGTC 


900 


TAATAAATTA 


TCTCCAAATA 


GCATGACAAG 


TTTATGATCC 


TTTTGAACCA 


TTTGTCTGCG 


960 


TGATTCTTTA 


CTCTTATCAT 


CTTTACCTTT 


TAGTAAAATA 


TGACTCTTCT 


TAGCTTGAGG 


1020 


GATACCTTGT 


TGTTTTAAGT 


TCTTTTGTGT 


TGCCTTTAAA 


TCTTTTTCTT 


TATCTCTATC 


1080 


AGAAATATAG 


TAGATATCGA 


CACCTTTTTT 


GTCAGCATAT 


TTCAA 




1125 



(2) INFORMATION FOR SEQ ID NO: 1061: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061: 



CGTCCTTCAG 


TCATTTTTAA 


NNATTTGCTT 


AATTTCATCC 


AATTTGTTAA 


ATTGTTCAAG 


60 


ANNAAAATAT 


TACCCCTCCG 


AAATACGGGA 


TTGTCATGTT 


CTTGTAACCC 


ATGCTTTCGC 


120 


TTTNCGCCAA 


GATGTNCAAC 


TTGNTGAAGT 


AAATAATTGT 


TACCCATACA 


TCTCACTCTC 


180 


CTATTTGAGT 


GAATAAAATT 


CATTGCTTAA 


AATTTAGTTA 


TAAGATCAAG 


ANAAAACATT 


240 


TTTTTCTAAA 


ATTCTTAATC 


GTTACTATTT 


ATTATAACTA 


TCTAACATTA 


AAATTAAATG 


300 


AGANAAACCT 


AATTTTTCAG 


ATAAGTTTCT 


ACACTTATAA 


AAAAGATTAT 


TAATCCTTTG 


360 


TTAGTAGTAA 


GTTATACGTA 


TATTCTAACA 


CATCTTACAT 


TTTTAAGAAA 


TACTGTTATA 


420 


ATGATAATTA 


TTAAAATATT 


ACTAAGAAAG 


TAGGCATTTA 


AATGGAGACA 


TTATTTTCAG 


480 


GCATCCAACC 


TAGTGGAATT 


CCTACTATTG 


GAAATTATAT 


TGGCGCACTA 


AAACAATTTG 


540 


TTGATGTGCA 


AAATGACTAT 


GATTGTTATT 


TCTGTATCGT 


AGATCAACAT 


GCAATTACAA 


600 


TGCCACAAGA 


TCGTTTAAAA 


TTACGTAAAC 


AGACCAGACA 


ATTAGCAGCG 


ATTTATTTAG 


660 


CTTCTGGTAT 


AGATCCAGAC 


AAAGCAACAT 


TGTTCATACA 


ATCTGAAGTC 


CCTGCACACG 


720 


TACAAGCAGG 


ATGGATGTTA 


ACTACGATTG 


CTTCTGTTGG 


AGAATTAGAG 


CGTATGACGC 


780 


AATACAAAGA 


TAAAGCTCAG 


AAAGCAGTTG 


AAGGTATACC 


AGCTGGTCTA 


TTAACATATC 


840 


CACCTTTAAT 


GGCAGCTGAT 


ATTGTTCTTT 


ACAATACTAA 


TATCGTTCCA 


GTTGGAGATG 


900 


ACCAAAAGCA 


GCATATCGAA 


NTGACTCCGT 


AACCCTTGTA 


GATAGATTTA 


ATAGTCGCAA 


960 


TAATGATGTG 


CTTGTGAAAC 


CCTGAAATTC 


GTATGCCTAA 


AAGTTGGTGG 


ACGTGTCATG 


1020 
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A\7 X X X #VL>A/U3 


appp a Af* A AG 


AAAAATGAGT 


AAGAGTGATG 


ATAATGCTAA AAACTTCATT 


1080 


TPATTATTGfi 


ACGAGCCGAA 


TGTTGCAGCT 


AAAAAAATTA 


AAAGCGCAGT 


AACTGATTCA 


1140 


RATGC3TATTA 


TTAAATTTGA 


TCGTGACAAC 


AAGCCAGGTA 


TAACAAATTT 


AATTTCAATA 


1200 




TAAPPHAPAT 


fiCCAATTAAA 


GATATTGAGG 


CAAAATATGA 


AGGGGAAGGT 


1260 


TATY^flTAAAT 


TT A A AfyVPfJA 

X 1AAAUU1UA 


CCTTGCTGAA 


AT AGTT AAAG 


CATTTTTAGT 


AGAATTCCAA 


1320 


nxm AAA ATAf* 


fJA A AfiTTTPT 


ATAACTCAGA 


TAAACTTGAT 


GATATTTTAG 


ATCAAGGTAG 


1380 


AHATAAACCA 


PAPAAAGTTT 


CATTTAAAAC 


TGTCAAAAAA 


ATGGAAAAAG 


CGATGGGATT 


1440 


AGGACGTAAG 


AGATAAAATA 


AAGAGGTACT 


AGAATTAACT 


AGTGCCTCTT 


TTATTATTTA 


1500 


ACGTCTACCT 


CTAGGATTAA 


AAGCATCTTT 




CCTACAAAGT 


TTATACTTAA 


1 CCA 


TATAGTTAAT 


GTAATCATTA 


TAGCAGGTGG 


CATCCATATC 


CACGGTTTAC 


CACTTATAAC 


1620 


ATCCCCTTCT 


TGAGCATCAC 


TCAACATATT 


ACCCCAAGAT 


GGTATTGCTT 


TTACTAACTC 


1680 


CGAATCCTAA 


AAAGCTCAAT 


CCTGATTCCA 


CTACAATCAT 


ACCGGGAAAT 


TACAATGTTG 


1740 


GGTTGG 












1746 



(2) INFORMATION FOR SEQ ID NO: 1062: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062 : 



ACTGGGTCCT 


ATTTATTGTC 


AGGTAAAGAT 


GTCTCAATCA 


ACTATGGAAT 


ATATGTATGG 


60 


GAGATATTGT 


AAACTTTTCC 


AAGAGGTTCC 


ACGGGTTAAT 


ACTAATATCT 


TTATCATAAT 


120 


CGATCAAGAC 


ATAAAGATCC 


AGAGGCGGAC 


TTTAATGCAA 


CATATGTTTC 


TTTTGAAACG 


180 


TTGTTAGCAG 


AAAGTGATTT 


TATCATCTGT 


GCAGCGCCAC 


TTACACAAGA 


AACACATCAT 


240 


AAATTTAATG 


CTGAAGCATT 


TGAGCACATG 


TGAAATGATG 


CAATTTTTAT 


TAATATCGGT 


300 


AGAGGGCACA 


TTGTAGATGA 


AACAGCATTA 


ATCGATGCAC 


TAGACAATAA 


AGAAATTTTA 


360 


GCATGTGGTT 


TAGATGTATT 


AGCAAATGAG 


CCGATTGATC 


ATACACATCC 


ATTAATGGGA 


420 


CGTGATAATG 


TTCTGATTAC 


ACCACACATT 


GGTAGCGCAT 


CAGTAACAAC 


ACGGGACAAT 


480 


ATGATTCAAT 


TATGTATTAA 


TAATATAGAA 


GCGGTTATGA 


CAAATCAGGT 


ACCACATACC 


540 


CCAGTAAATT 


GAAAATAAGT 


ATGGTTTTTA 


ATCATTGGTA 


AAACAAGCAA 


AGCAATTGTT 


600 


AAATATGTTA 


TGCTTGACTT 


GCGAATGTAT 


ATATTTGCGC 


TGATGATAAT 


TCAATAACAT 


660 


TATAAACTCG 


TCATATTAAT 


GTGATGAGTT 


TATTTGATTT 


TGACGTATAT 


AACAAAATTG 


720 


TTATATAGAC 


TGTAATTGTT 


TATTGTTTAA 


ATAACCTTTA 


AAATGAATGA 


TGTTCTCAAC 


780 


AAAGTGTATC 


ATTTACAATA 


ATTGAGTGAT 


GACAACCAAT 


TACTATGTTG 


AATCTATAAA 


840 


ATTAAAATTG 


AATTTCATTA 


TGAATATCGA 


TATAATGAAT 


TGTAAAAAAC 


AAATTTTAAA 


900 


ACGTATTTTA 


TATTGAACGG 


TCTTTATAAA 


GGCGTTTTTA 


CTAAGAACTT 


TAAGAGGTGC 


960 


AATATGAAAT 


CTAAAAGTAA 


ACAGCCACCT 


AATAAATATG 


TTGAAGCATT 


CAAACCATAT 


1020 
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1 1AJ. I AAJUAL, 


TATTGTATTT 


GGCAATAlIl 


ATTACTTTAT 


ATTTAATTTA 


TGGCAGTGGC_ 


1080 


UAvJAL~AL»Av~A 


ATA ArTWIV A f» 


mm* m« x mp x /->» 

TTATAATGAG 


TTCTAATGAG 


GGAGACTTAA 


TATGACAGAT 


1140 


ATTSTTi Af^A 
A± 1A1 1 AALA 


Auk. ITji-AAljV. 


GTTTGCGGAT 


viUAAATCCAC 


AAAGCATTGC 


TGTTAGACAC 


1200 


A r* A A A TV"" 


AAl iAALi 1 A 


I\, AAuALj 11 A 


ATGGATGAGT 


CTAGTAAATT 


AGCACATCGA 


1260 


nvp a r* A A rV2T A 
i 1 ALAAuu 1 A 


f2*PA AO a mnr 
ulAAbAAALv. 


GATGATTTTA 


TTCGGTCACA 


TGTCACCATA 


TATGATTGTT 


1320 




vjlvHJvJAl 1 AA 


AGCAGGATGT 


GGATATGTAC 


CTGTAGACAC 


CTCCAATTCC 


1380 


IoliAAUAL.v,lj 


IATTAAAATG 


x nvnxnwnx x/*x 

ATTATTAACA 


AGGGTCCAAC 


CNAGAGTTTG 


TATTTAATAC 


1440 


UAL A lvj r TVjxA 


TCATTTGAAA 


GTTAAAGAGG 


CCGAAGTAAT 


TACCAATTTG 


AAGATATTAA 


1500 


aacatctcna 


AGACCCAGTA 


ATTTTTTGAT 


AGTCAAGATT 


AAAGATACCG 


ACACAGAATA 


1560 


CACAATCTTT 


XOX HV VTyWfin^ 

ACATCTGGTT 


CTACTGGGGA 


ACCTAAAGGT 


GTTCAAATTG 


AATATGCAAG 


1620 


TTTAGTTCAA 


TTTACTGAGT 


GGATGTTAGA 


ACTTAATAAA 


TCAGGAAATA 


AACAACAATG 


1680 


GCTTAACCAA 


GCGCCATTTT 


CATTTGATTT 


ATCTGTAATG 


GCTATTTATC 


CATGTTTAGC 


1740 


ATCAGGCGGT 


ACATTAAATC 


TTGTAGATAA 


AAACATGATT 


AATAAACCTA 


AATTATTAAA 


1800 


TGAAATGCTA 


ACAGCAACAC 


CGATTAACAT 


TTGGGTATCA 


ACACCATCAT 


TTATGGAAAT 


1860 


GTGTTTATTA 


TTACCAACGC 


TTAATGAAGA 


ACAATATGGT 


AGTCTTAAC 




1909 



(2) INFORMATION FOR SEQ ID NO: 1063: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1348 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1063: 



CCATCGACTA 


AGCAATGGTA 


TTTGTTCATT 


AAATCCTCAT 


GTTGATCGTT 


TAACGTTAAG 


60 


CTGTCGCATG 


GAAATCGATG 


CTAGTGGTCG 


CGTTGTTAAA 


CATGAAATTT 


NTGATAGTGT 


120 


TATACATTCT 


GATTATCGAA 


TGACGTATGA 


TGCGGTAAAT 


CAGATTATTA 


CTGAAAAGGA 


180 


TCCTAACATT 


CGCGAACAAT 


ATAAAGAAAT 


TACGCCTATG 


TTAGATTTAG 


CACAAGATTT 


240 


ATCTAATCGN 


TTGATTCAAA 


TGAGAAAACG 


ACGTGGGGTG 


AAATCGATTT 


TGGATATTAG 


300 


NGAAGCAAAA 


GTATTAGTTA 


ACGAAGTCGG 


GTATACCAAC 


AGATGTTCAA 


TTAAGACAAC 


360 


GTGGCGAGGG 


TGAACGTCTA 


ATTGAATCAT 


TTATGTTAAT 


TGCAAATGAA 


ACAGTTGCTG 


420 


AACATTTTAG 


TAAGTTAAAT 


GTACCTTTTA 


TTTACCGAGT 


GCATGAGCAA 


CCTAAATCAG 


480 


ATCGCTTAAG 


ACAATTCTTT 


GATTTTATTA 


CAAACTTTGG 


CATCATGATT 


AAGGGCACTG 


540 


GCGAAGATAT 


TCATCCAACA 


ACACTTCAAA 


AGGTTCAAGA 


AGAAGTAGAA 


GGTCGACCTG 


600 


AACAAATGGT 


CATTTCAACA 


ATGATGTTAC 


GTTCAATGCA 


ACAAGCGCAT 


TATGATGATG 


660 


TGAACTTGGG 


ACATTGTGGC 


TTATCAGCTG 


AATATTATAC 


GCATTTNACA 


TCACCAATTA 


720 


GACGTTATCC 


TGATTTAACA 


GNTCATCGTT 


TAATCCGTAA 


GTATTTAATT 


GAGAAATCAA 


780 


TGGATAACAA 


AGAAGTGAAG 


CGTTGGGAAG 


ACAAATTGCC 


TGAGTTAGCT 


GAACATACTT 


840 


CTAAACGTGA 


ACGTCGTGCT 


ATTGAGGCAG 


AACGTGATAC 


TGATGAATTG 


AAAAAAGCAG 


900 
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AATATATGAT 


TCAACATATT 


GGTGATGAAT 


TTGAAGGTAT 


TGTCAGCTCA 


GTAGCTAACT 


960 


TCGGTATNTT 


CATTGAATTG 


NCAAATACGA 


TAGAAGGTAT 


GGNTCATATT 


GCGAATATGA 


1020 


CTGATGATTA 


TTACCGCGTT 


GAAGAGCGTC 


AAATGGCATT 


AATTGGGTGA 


GCGTCAAGCT 


1080 


X K A / Til Jl f^^l^l A 

AAA\j LAL I 1 A 


uAAl luu luA 




fiTTAAAHTRA 


W x u x x u/\ 


TfiTAfiATGAA 


1140 


CGATTAATTG 


ATTTTCAAAT 


TGTTGGAATG 


CCTTTACCTA 


AAAATGACCG 


CTCACAGCGA 


1200 


CCAGCAAGAG 


GTAAAACGAT 


TCAAGCTAAA 


ACGCGTGGCA 


AATCTTTAGA 


TAAATCGAAA 


1260 


TCTGATGATA 


AGGGTCGGTA 


AGAAAAAAGG 


TAAGCAACGT 


AAGGTAAAAA 


CCAACGTAAT 


1320 


AATGATAATC 


AGGTAATAGT 


AAGCATAG 








1348 



(2) INFORMATION FOR SEQ ID NO; 1064: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064: 



CCGTCATTAA 


TGATATGATG 


TGTATCTATA 


AACAGGTATG 


CATGTAAGGG 


ACTTCTAATG 


60 


TATCTCACTC 


TAATTTGACT 


TGGCTTTTCC 


AAATTAAAAG 


GTGCCACAAA 


TTGGCGCATA 


120 


ATTTCTTGTT 


CATCCGTAAA 


ATGCGTGTTA 


ACTTCTTCAA 


AGTCAGCTAC 


AACATCTGCC 


180 


ACAATACGTT 


GTCGAACCTC 


ATCATCTACA 


ACAATATATT 


GTGTTCGTAG 


AATCTCATGT 


240 


CGCGCTATCA 


AATGCTGAAC 


TGCTTGTTGC 


AATTGAGCTA 


CATTAAGTCC 


TGATGATAAC 


300 


CGCCATAAAA 


AAGGTACGGT 


TATACAACCG 


TATCTTTATG 


GGTNGGGATT 


TCCATAATAA 


360 


ATACAATACG 


CTTTTGGGGC 


AGGAGCTCCA 


AGCAACATAA 


TCAACTTTNN 


ACTTATAGNT 


420 


TCTTGGAATC 


ACTTCATAGT 


TTTGTTCTTG 


AACCTTAGCA 


ATCGCTCGTG 


CTAGTTCAAA 


480 


TACAGTTGGC 


TTTNGTAATA 


AATCACCAAT 


TTGTAATCGT 


TTCCCAGTAG 


ATGCCTCTAT 


540 


CCGATTCACC 


ACTAACGTTG 


CTTTTAATGA 


ATGGCCACCT 


AATTCAAAGA 


AATTATCATG 


600 


AATACCTACT 


TGATTCACAT 


GTAAAATATC 


TGCACAAATT 


TGGCATAGCA 


AGTGTTCGGT 


660 


ATCTGTACTC 


GGTGCTACAT 


AGGCATCCGT 


ATCGACATAG 


TCCATGATAG 


GCAATGCCTT 


720 


CTTATCTAAT 


TTCCCATTAA 


TAGTAATAGG 


AATTTGCTCA 


ATATGCATGA 


AATTAACTGG 


780 


TATCATGTAC 


TCCGGTAAGG 


TCATACGTAA 


TTGTGATTTA 


ATCTTATTAT 


GTGATAATGT 


840 


ATGCATCGCT 


TCATAATAAG 


CAACGATATA 


CTGATCTTGA 


TCATGATTTT 


GAACAAGAAC 


900 


AACTGCTTTA 


TGAATACCTT 


GTATACGCTC 


GAGCGCATGC 


TCAACCTCTG 


ACAACTCAAT 


960 


CCTAAACCCT 


CGAATCTTAA 


CTTGTTTGTC 


CTTTCGATAT 


AAATAATCTA 


TGTTGCCATC 


1020 


GGGTAACAAA 


CGAACGATAT 


CACCACTTCT 


ATACATCAGN 


TGATTTATAT 


TTGAATCTTT 


1080 


GATAAATTTA 


TCTGCTGTCA 


ATTCTGGNTG 


ATTTAAATAA 


CCTGCAGGTA 


ACCCAAAGGC 


1140 


ACTTGTACAT 


AATTCTCCAG 


GAATACCAAC 


GGCACACGAN 


GGTCGGCTNG 


ATGATATAAA 


1200 


CATGAGTACC 


CAGAATCGGG 


NTTACCAATA 


GGAATACGAT 


TGGACTTGGG 


TAGGTATATT 


1260 


ATCGCGTGGG 


AATGTGNTTT 


CAA 








1283 
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(2) INFORMATION FOR SEQ ID NO: 1065: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1065: 



CTGAAAAACT 


TAAACTCATT 


TAACTATGTA 


CGTAAAGGTT 


TAGAATATGA 


AGAAAAACGC 


60 


CAAGAAGAAG 


AATTGTTAAA 


TGGTGGAGAA 


ATCGGACAAG 


AAACACGTCG 


ATTTGATGAA 


120 


TCTACAGGTA 


AAACAATTTT 


AATGCGTGTT 


AAAGAAGGTT 


CTGATGATTA 


CCGNTACTTC 


180 


CCAGAGCCTG 


ACATTGTACC 


TTTATATATT 


GATGATGCTT 


GGAAAGAGCG 


TGTTCGTCAG 


240 


ACAATTCCTG 


AATTACCAGA 


TGAGCGTAAG 


GCTAAGTATG 


TAAATGAATT 


AGGTTTACTG 


300 


CATACGATGC 


NCNCGTATTA 


ACNTTGACTA 


AAGAAATGTC 


AGATTTCTTT 


GAATCAACAA 


360 


TTGGAACNAG 


GGTGCAGATT 


GTTAAATTTA 


ACATCTAACT 


GGGTTAATGG 


GTGGCGTNAA 


420 


CGAATATTTA 


AATAAAAATC 


AANTAGAATT 


ATTAGATACT 


AAATTAACAC 


CAGAAAATTT 


480 


AGCAGGTATG 


ATTAAACTTA 


TCGAAGACGG 


AACAATGAGC 


AGTAAAATTG 


CGAAGAAAGT 


540 


CTTCCCAGAG 


TTAGGCAGCT 


AAAGGGTGGG 


TAATGCTAAA 


CAGATTATGG 


GANGATAATG 


600 


GCTNAGTTCA 


AATTTCTTGA 


TGGAAGCAAC 


AATCTTCTAA 


AATTGGGTAN 


ATGGAAGCAT 


660 


TAGACAAATA 


ACGAACAATC 


NGTGGAAGAT 


TACAAAAATG 


GTAAAGGCAA 


AGCTATGGGG 


720 


CTTCTTAGTT 


GGTCAAATTA 


TGAAAGCGTC 


TAAAGGTCAA 


GCTAATCCAC 


AATTAGTAAA 


780 


TCAACTATTA 


AAACAAGAAT 


TAGATAAAAG 


ATAATTTANA 


TCATCAAACT 


ATGAAGATTT 


840 


AAAAAATAAA 


CCCTTGATTG 


CTGACTTAGA 


TGCAATCGAG 


GGTTTATTTA 


TATCTATAGA 


900 


AGTCATATTA 


CTTTTAACTT 


TATTCATTGN 


ACATGTTAAT 


GGTAAAAATA 


TTAATTTTAT 


960 


TAATGCGTTA 


GCTTTAATTA 


TATTAAGGCA 


AACTGTATAA 


TAAAAAGGTA 


TAAAACATTT 


1020 


GTGTATAAAG 


ACAACATTAT 


ATTTACAACA 


TCATTTTAAA 


GGTAAAATAG 


CATAACTGAC 


1080 


GAAGTCTATA 


TAATGAAGAA 


CGGCAAAAAA 


TGCTGAATAA 


ATAACAAGCT 


TTGTACATAT 


1140 


TGAGATAGTA 


TTTGTTTAAG 


ATACAAGTTG 


GTCTTTAACG 


ATATTAAGAA 


TGATGAAATA 


1200 


AGACTGAGCC 


TGGGTCATAA 


ATTCAATGTC 


CTAGGCACTA 


CAATGTTAAT 


ATTGGCAGTA 


1260 


GTTGACTGAA 


AGAAAATACG 


CTTGTAACAA 


GCTTNNNTCA 


ATTCTAGTGG 


GGCCCCAACA 


1320 


TAGAAGCTGA CTTTCTGTCA .GCTTACAATA ATGTGCAAGT 


NGGGGTGGGG 


CCCCAAACAA 


1380 


AGAGAATTTC 


GAAAGGAAAT 


TCTACAGACA 


ATGCAAGTTG 


GGGTAGAACG 


AAATAAATTT 


1440 


TGTTAAATAT 


TATTTCTGTC 


CCACTCCCTA 


TTAGACGAAA 


CAAAGATGAA 


GTCAAAATAT 


1500 


ATGAATTTTA 


AGTAGAAGGA 


TAAGATATGA 


ACAAACGTGC 


TAGAATCATT 


TATAACCCGA 


1560 


CATCAGGTAA 


AGAGCTAATT 


TAAAAGAGAA 


TTACCTGATG 


CCTTAATAAA 


ATTAGAAAAA 


1620 


GCGGGATATG 


AAACGAGTGC 


ATATGCAACC 


GAGAAAATAG 


GTGATGCCAC 


ACTTGAAGCA 


1680 


GAAAGAGCTA 


TGCATGAAAA 


TTATGATGTA 


TTAATCGCTG 


CAGGTGGTGA 


TGGAACATTA 


1740 


AATGAAGTAG 


TTAATGGTAT 


CGCAGAAAAG 


CCTAATCGTC 


CTAAGCTAGG 


TGTCATTCCT 


1800 



902 



WO 97/30070 



PCT/US97/02318 



ATGGGTACTG TTAATGACTT TGGACGTGCA TTGCATATAC CTAATGACAT CATGGGGGCA. 1860 

CTTGATGTCA TCATTGAAGG TCATTCTACT AAAGTAGATA TTGGTAAAAT GAATAATCGA 1920 

TACTTTATTA ATTTAGCTGC AGGCGGACAA TTGACGCAAG TCTCTTATGA AACACCGAGT 1980 

AAATTGAAAT CTATTGTTGG TCCATTTGCT TATTACATCA AAGGTTTCGA AATGTTACCT 2040 

CAAATGAAAG CTGTAGATTT AAGAATTGAA TATGATGGTA ATGTTTNCCA AGGAGAAGCA 2100 

TTATTATTCT TTNTAGGCTT AACAAATCCA ATGGCAGGAT TCGAAAAATT AGTGCCCGGA 2160 

CGCTAAGTTA GATGACGGCT ATTTCTACGT TNAATNTATA G 2201 

(2) INFORMATION FOR SEQ ID NO: 1066: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE0NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066: 



GGCACGAGCT ATCATAAGAA AAAAGAATGT GATTGGAAAT TACTCAATCA 


ATTAACTAAA 


60 


CGTGCATATA 


CGAATCGTGT 


GTTTATCACT 


CATTTATCAA 


ATATCATTTA 


CTTGCCTAAA 


120 


AATACTCGTG 


TTAACTTTTC 


AGGTGACGAA 


AAAATGGCAT 


TGCTAAAAAT 


TAGNAGNAGC 


180 


ATCAAAGATA 


TATTTTATGG 


CACGAGTACT 


TTTAAGAGAG 


AAGATGATTC 


TGTAGAAACT 


240 


TTACGTTCGA 


CTATAAAAGC 


ACTTGAAATT 


AGTGGTGAAA 


ATCAAATTAA 


AAGTCATATA 


300 


CTATACGAAG 


TTCTAATGNT 


TTATCGACTG 


CTAGACAGCA 


GATATGCTTA 


ATTTAAGTAC 


360 


CTTATGCTAA 


TTGGCACCAT 


GGGAGTGGGA 


CAGAAATGAT 


ATTTTCGTAA 


AATTTATTTC 


420 


GACGCCCCAC 


CCCAACTCGC 


ATTGCCTGTA 


GAATTTCTTT 


TCGAAATTCT 


CTGTGTTGGG 


480 


ACCCACACCC 


CAACTTGCAT 


TGTCTGTAGA 


AATTGGGAAT 


CCAATTTCTC 


TTTGTTGGGG 


540 


CCCCTGACTA 


GAATTGAAAA 


AAGCNTNGTT 


ACAAGCGCAT 


TTTCGCTCAG 


TCAACTACTG 


600 


NCCAATATAA 


CTTCGGNGGA 


GCATAGAACA 


TTGATTTATG 


TCCCCGCCTC 


TNNTTATCGG 


660 


ATATAGCATG 


TCCCTTTTTN 


AAATTACAAG 


CATAAATATT 


CAGACTCNAA 


TATAGCCACA 


720 


CGCCTAAACT 


NCGGAGGGAT 


GTGGCTGTCT 


TTTNTGATTG 


TAATTATGTA 


TCAGATTTAA 


780 


ATCATTAATA 


TAAAGAGGCT 


AATTCATTAA 


TAAATTTAGT 


AAAATCTGAT 


TTAGCAATAT 


840 


TTATCCTTAA 


TGAAATACCA 


GATTCTGTTG 


CCATGTTTGA ATGCCTTAAA 


CCAGAATCAA 


900 


AATCAATATA 


AATACAAAGC 


AGTTCTTGAT 


TAAAATCAAT 


AATTGTAAAA 


TGTAAATCAG 


960 


GTTCNATAAA 


ATATATTNCA 


GNAACAGGAT 


TGCCGNAATC 


CATATTGTTT 


CAAATTGGCC 


1020 


ATGTCACTAT 


CAAAAAGTGC 


AAANTCAAAT 


TTAACTTCTT 


GATCTTCGAA 


AGTTGCACTA 


1080 


GTGTCGAAAT 


AATAAATTTT 


ATTATCTTCT 


TCGATAACTA 


CATCTTTTAA 


AATAAATGTT 


1140 


ATCGATNCTT 


TGCCATGTAA 


ATAAGATGCA 


TCTCTTANAA 


TCACTTTAGT 


GGNCTTCATC 


1200 


GTCAAATTCC 


TTTCATCATT 


CATTTTTTCA 


TGTTGCATAA 


GCACCATTAC 


TAG 


1253 



(2) INFORMATION FOR SEQ ID NO: 1067: 

903 



WO 97/30070 



PCT/US97/02318 



(i) SEQUQJCE CHARACTERISTICS: 

(A) LENGTH: 908 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067 : 



CATATTTGAT 


AATAAAGCCT 


CGCTCTAATA 


TCCCAGGGAT 


TTTTNAGAGG 


CGGAAGACCC 


60 


TAAAATAANN 


TGGATGTACC 


AATTTTTTTG 


ACATTAGTCC 


ATCATTTGCA 


TATTTTTTTG 


120 


CAGTTGATGG 


ATTATTTTAT 


TTTTGCCAAA 


AACAATTTTT 


TTAACTTAAN 


NACCCAACTA 


160 


TACTTAGTTT 


TTTNACCAAG 


AACATTTGAG 


CCCGAGACAT 


AAATCAATGT 


CATAGGCTCC 


240 


ACAATGTTAT 


ATTAGCTGTA 


GTTAACCGAA 


CGAAATGCGC 


TTATAACTAG 


TTTTTTCAAT 


300 


TCTGGTCGGG 


GGTCCCAACA 


TAGAAGCTTA 


TGGAAATTCA 


GCTTACAATA 


ATGAGCAGGT 


360 


TGGGGTGGGC 


TCCAACACAG 


AGAATTTCGA 


TAAGAAATTC 


CACAAATAAT 


ACAAGTTGGC 


420 


CCACTCCCAT 


TCTGTAATAA 


ATATAGGAGG 


TCATTGTTAT 


GCAAATAGAA 


CTTACTGATG 


480 


CAGCAGTAAC 


TTGGTTTAAA 


AATGAACTTG 


AGTTGCCTGA 


AAATAATAAA 


GTGCTCGTGT 


540 


TTTTTGTAAG 


ATATGGTGGC 


GAATTCCAAC 


TCAAGCAAGG 


ATTTAGTCCT 


GCTTTTACAG 


600 


TTGAACCAAA 


GGAAGATGTT 


GATATTGGCT 


ATGAACAACA 


ATATGACGAT 


TTAAATGTTG 


660 


TCGTAGCGGA 


AAAAGATTTG 


TGGTACTTTG 


AAGATGACCA 


CATTATTGTA 


AATGTAGNTG 


720 


ATCACGAAGA 


TGAAATTTCT 


TATTCCACAA 


AATAACTAAA 


ATAAAACGAA 


CTTGTTTACA 


780 


CATTCTGATG 


TGTCCTGAAC 


CTCGGAAGTT 


GTACTCTGCC 


TTAGCACATC 


TGATGTTGTN 


840 


TGAACAANTT 


CGNNTGNGGG 


TATATATTAG 


CTTATCTTCA 


CGCTAATTAC 


ATAATGTTAA 


900 


CGTATGGG 












908 



(2) INFORMATION FOR SEQ ID NO: 1068: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068: 

GGCACTAACT TCATTTGAGC ATCCTCCTAT CAATTGCTAT ATAAATTAGT ACCCTTTTGC 60 
CACTTAATTA TAACAAATTC TCAAATTTTA AAAATTGAAA ATCTAGTTAA TGTATTAGCT 120 
CGATTTTGAA ATCTAATAAT AATTGGCATA AAATGGAAGT AATATTATGT GAGGAGTGTT 180 
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ATGAAATGAC 


AAAAATATCA 


AAAATAATAG ACGAACTGAA 


CAATCAACAA 


GCTGATGCAG_ 


240 


CATQvaAl 1AL 


AA*_~fV_V,\»X 1u» 


AATGTATATT ATTTTACTGG ATACCGTAGC GAACCCATGA 


300 


ft ft ^* ft l"!* ft WTW 


U4JAX XA1 lOA 


TTAAGAAAGA TGGTAAACAA GGTACTATTT TGTCCAAANA 


360 




*-»ft ft/^a»ftffy™«ft 


AAGCATCACC 


CTTCACAGGT 


TGAAATCGTT 


GGATATTTAA 


420 


GACACTTGAA 


ATCCTTTTTC 


ACTTTATCCA 


CAACNCAATC 


AATAAATTAC 


CTAATTGTAA 


480 


GCGGAGCACC 


TTAACAAGTA 


GCACCGCCAC 


AAACAATTAA 


TCTCTGTTTC 


NATGTCAATT 


540 


CATTCGGAGA 


TGTTGAGTTA 


ACAATCACAC 


AATTAAGAGA 


TATTAAATCC 


GAAGATGAAA 


600 


TTAGCACAAT 


ACGTAAAGCT 


GCTGAGTTAG 


CAGATAAGTG 


TATCGAAATA 




660 


ATTTAAAAGA AGGTGTGACT 


GNACGTGAAG 


TAGTCAACCA 


TATTGAGCAA 


ACTATCANAC 


720 


ft ft mft nwziT'n* 
AA1A IvHjV-Aj I 


Uutlunnn iwr 


AGTTTTGATA 


CGATGGTTTT 


ATTTGGAGAT 


CATNCCGCAT 


780 


C ACCTu A Ivnj 


/** ft ft r^c ft r^o ft 
LALALUiuuA 


GATCGCAGAT 


TAAAAAGCAA 


TGAATATGTA 


CTATTTGATT 


840 


Tift/"WIV" , 'Pft ft*P 

1 ACjVj Iaj I AA1 


■ mm ft mr* ft f%r+ ft m 


TATTGTAGCG 


ATATGACACG 


TACCATTAAA 


TTTTGGTGAA 


900 


CCTAGCAAAG 


AAGCACNAGA 


AATTTATAAT 


ATTGTATTAG 


AAGCAGAAAC 


ATCTGCAATC 




CAAGCAATTA 


AACCTGGAAT 


ACCTTTAAAA 


GATATCGATC 


ATATCGCTAG 


AAATATTATT 




TCAGANNAAG 


GTTATGGTGA 


ATATTTCCCT 


CATCGCTTAG 


GTCATGGTTT 


AGGATTACAA 




GAACATGAAT 


ATCAAGATGT 


TTCAAGTACT 


AATTCTAATT 


TGTTAGAAGC 


TGGCATGGTT 




ATTACAATCG 


AACCAGGTAT 


TTATGTACCA 


GGTGTTGCAG 


GTGTAAGAAT 


TGAAGATGAC 




ATACTTGTCA 


CTAATGAAGG 


ATATGAAGTA 


TTAACACATT 


ACGAAAAATA 


AGGAGTGGGA 




TAAAAATGAA 


AAGCTTGTTA 


CAAGCACATT CTCATTCAGT 




LAAJ A X AALA 


1 190 
X J ^ u 


TTGTAGCGCC 


TAAGACATAA 


ATTTTTATCC 


AAGTCTAAAT 


GCAATATGTA 


ACAAACAAGC 


1 IRPi 
IjOU 


TAGAAACACA 


TATGCAGGTA 


TGTTCATCAG 


TAACATGTAA 


TGAATCAAAT 


CAATATCATT 


1 dan 


CATGTTCGAT 


GATTTCTTCG 


CATTGTTTCT 


AGCTTTAATT 


TATCATTATT 


TAATTTTAAT 


1500 


AACCAAGGAG 


ATGATAACGT 


CAGTCTTTAG 


TACGCTGTAA 


TCCATTCCCT 


TTTCATCAAA 


1560 


TTCAAATTAT 


AATTGTAATG 


CTTCTTCTAC 


AGATTTATAT 


TCCATTTCAA 


ATGCCTCTGC 


1620 


AACGCCTTTA 


TTGGTTACGT 


GACCTTTGTA 


AGTATTTAAA 


CCTAATGATA 


ATGGTTGATT 


1680 


TGATTTAAAT 


GCTTCTCTAT 


ACCCTTTAG 








1709 



(2) INFORMATION FOR SEQ ID NO: 1069: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069: 

AAAGAGGAAG ATGTGAACCA CCACTAGTAT TGAATCTAAA GATTGAGGAC GTTCCTTTAT 60 

TAATTGGCTA NNNAAGNNAN ATCNNNAAGA TAGCCAATCC AAAGCCAAAA AGTCAGCATC 120 

AAGAAATACT TCTANNAAGG TAGCAGCTNN NNAGAAGNGN NAGAAATCTA AGANAAATNN 180 

NAAATAATTT GTTTCTTTGC TAAATAGAGG AGCACCGATT GACATCACAT CAGTCGGTGC 240 
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TCCTTTTATT 


TAiivi'rrrr 


x x nvn x x/ihiwiix 

AATTAATTTA 


TACAAANCCN 


ATTCCCTGTT GAGCGTGTTG 


300 


AAlTJoiA* 1 1 v~ 


CNTTGTTTTG 


I it-TCNCGGT 


x xwv»/"* x mnvn x 

ANCCCATTTA 


ACCAAATNAT AAACCNANAT 


360 


CTTTNTCCAA 


ATATTTCTAT 


TTGATCAAAA 


TAAGGTTTGA 


AATTTGCGTT TTTCACATAA 


420 




(-CAATGCTAT 


^oitw* X X MUM X 

CGTGCAATTA 


GC I i ltiAGTC 


TGTATATAAT AGTGCGTTTT 


480 


uAAUii 1 Inn 


1 IVAtAjTwiL.A 


M\j I"l VTAla Ivj 


/~« x mx x X rr»x o x 

LATAAATACA 


TGCAGCCCAT TCTGCAGTGT 


540 




i"TvA»CCTAAC 


TCATGTGTAT 


x rrv"*fT> X nrvT x mo 

ATGTATNATG 


CTGCTTATCT TCTTTGATTA 


600 




NGTACTTATG 


CXTTGGATTTC 


CTNTTCGTCG 


CAGCATCAAA ATTTATGTGC 


660 




ACCTACTTTC 


TATTCAATAC 


TTAGTTAAAG 


TTACTATTAC TGTAATACAA 


720 


AAIAIXJTTIjpG 


GTAATCCATT 


AAAAAACACG 


CATCACTTAA 


ATAAGTAAAC ACGTTGTTAA 


780 


AAATACTTCG 


fulfill 'i m pftfti/i m « 

CTTGATTCAA 


AAGATGAtTT 


TCTAAATACG 


TAGTNCTTGT AAAATACTTC 


840 


TV X X XI XXX m 

CTAAANAAAT 


CATCTTCAGG 


CTGGGGACAT 


AAATCAATGT 


TCTATGCTCC TNCCGAAGTT 


900 


x rp x rpnyipri m /-» 


TAGTTGACTG 


x xv^o x x x x nv"* 

ANCGAAAATG 


CGCTTGTAAC 


AAGei-l^ra^* CAATTCTAGT 


960 


CAGGGGCCCC 


x x x /Nik x x yi 

AACACAGAAG 


CTGNCGAAAA 


GTCAGCTGAC 


AATAATGTGC AAGTTGGGGA 


1020 


TGGACCCCAN 


CAAAGAGAAA 


TTGTATTCCC 


AAATTCTACA 


GACNATNCAA GTTGGGGTGG 


1080 


GNCGACGAAA 


TAAATTTTGC 


GAAAATATCA 


TTTCTGTCCC 


ACTCCCTTAA AACTTATTCT 


1140 


TTTGTGTAGT 


AAGTGCGTTA 


ATAGCCTTGA 


TCTAACTTAT 


CAATCTTACC TTTACGATAA 


1200 


AATGATTTAG 


CAATATATCC 


ANNTGGTACA 


TTGAAAACTG 


TTGAAGCTAA TTTTAATACG 


1260 


TAAGTTGTAA 


TAAATATTTC 


NAATNCAACT 


GTACCAGGTA 


AACTTCCGAT AAAAGCGATA 


1320 


GCTACAAATA 


AAGCTGTATC 


CAATTATTGA 


GCTTG 




1355 



(2) INFORMATION FOR SEQ ID NO:1070: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070: 



CCTTTCTGGN 


CCANACATTA 


TTTCTCGGGG 


NATTTGAAAA 


ATGAGGGATC 


AGGTCAATTA 


60 


ATTATTGATG 


CACCCACGCA 


CAATCCNAAT 


TGATGTTATT 


AGTAAGTTAC 


ATCAAAATAA 


120 


AGATATTCAA 


TGGCATCAGA 


TTAAATCTTC 


TATCATTGAA 


ACATTACAAC 


CTTATTTATT 


180 


TGAAAAAACA 


GCTAGAAAAC 


CAATGATTTT 


ACCAGTCATT 


ATGAAGGTAA 


ACGAACAAAA 


240 


AGAATCAAAC 


AATAAATAAT 


CAAAAAGCTA 


CTAACTTTGA 


AGTGAAGTTT 


TAATTAAACT 


300 


CACCACCCAT 


TGTTAGTAGC 


TTTTTCTTTA 


TATATGATGA 


GCTTGAGACA 


TAAATCAATG 


360 


TTCAATGCTC 


TACAAAGTTA 


TATTGGCAGT 


AGTTGACTGA 


ACGAAAATGC 


ACTTGTAACA 


420 


AGTTTTTTCA 


ATTCTAGTCA 


GGGGCCCCAA 


CATAGTAGCT 


GGCGGAAAGT 


CAGCTTACAA 


480 


TAATGTGCAA 


GTTGGAGTGG 


GACGACGAAA 


TAAATTTTGG 


GAAAATATCA 


TTTCTGTCCC 


540 


ACTCCCATAT 


TCCCGATAGA 


AAAACAGGAC 


TTGAACTTGC 


AGAAAAATAT 


GGCATTGATG 


600 


GCGTCATGAT 


TGGTAGCGGC 


ATTTTCCATA 


ATCCATTCGC 


TTTTGAAANA GAACCACGCG 


660 
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AACACACAAG CAAGGAACTA TTAGATCT _ 688 

(2) INFORMATION FOR SEQ ID NO: 1071: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1071: 



GGCACGAGCG 


GCACGAGCGT 


AAATAGCTGG 


AAAGATGTAG 


AACAGTATTT 


TTTAGATAAT 


60 


ATTGAGAAAT 


AAAATATATC 


ACTTGAAAAA 


TTTCATGTAG 


AAAAGATGAT 


GGATAGGCTA 


120 


TAAAGTAATT 


GTGACTGAGA 


TGAACTTTTA 


TGTCTTAGAC 


ACTACAACAC 


TATATTGGCA 


180 


GTAGTTGACT 


GCCGGGCCCC 


AACATAGAGA 


AATTGGATTC 


CCAATTTCTA 


CAGACAATGC 


240 


AAGTTGGCGG 


GGCCCCAACA 


TAGAGAAATT 


GGAACGCCAA 


TTTCTACAGA 


CAATGCAAGT 


300 


TGGGGTGGGC 


CCCAACATAA 


AGAAATACTT 


TTTCTTTAGA 


AATTAGTATT 


TCTTATGGGA 


360 


GTGGGACATA 


AATCAATGTT 


CTATGCTCTA 


CGAAGTTATA 


TTGGCAGTAG 


TTGACTGAAC 


420 


GAAAATGCGC 


TTGTAACAAG 


LTrnTiu\A 


TTCTAGTCAA 


CCTTGCCGGJC 


GGGGCCCCAA 


480 


CACAGAGAAA 


TTGGATTCCC 


AATTTCTACA 


GACAATGCAA 


GTTGGGGTGG 


GACGACGAAA 


540 


TAAATTTNGN 


GAAAATATCA 


TTTCTGTCCC 


ACTCCCTCCT 


ATTTTTAAGT 


ACACATTAGC 


600 


TTTGACTAAT 


GATAAAGAAT 


CGCTACATAA 


TCAATCATTA 


GTCGTTCTTT 


ATCATTTCCG 


660 


TCCCGCTCTC 


AATAAATGTT 


AGTCTATCTC 


ATTATTATAA 


ATCGGATGAA 


TGTGTTAATC 


720 


TATNGCAGAT 


TACACGTCAA 


CCCGATTTTT 


TATAGAATTT 


GGGAAAGACG 


GATAAACCAC 


780 


TATGATTTNN 


NNTTCAACAT 


CAATCANTTT 


AGGGGGNATG 


CGGCAAAATT 


ATATGTCTNT 


840 


TTTTGGGACA 


GGGGANTAGG 


TTAAAGCTTA 


TAAAAACGAT 


ATAAGGTGCG 


TCGAATCTTA 


900 


TGATTACAAT 


CCAAACCCAA 


TATAATATCG 


GGTCAAGATC 


AANCCGGATG 


GNTTACAAAT 


960 


CANTGACAGG 


AAGTAACTTG 


AATAGCAATN 


TGGGATAACC 


TCAATATCAA 


TAGGTGTTTC 


1020 


TAATGAAATT 


TCGCCATCAA 


TATCAACTNT 


CATTGCTGGG 


ACCTGTTGTA 


AGTGAAATCT 


1080 


TTTTNCCAGG 


AATATGCTCC 


AATACCTTGA 


GTAATTTCAT 


CCAATTCATG 


CTATCACGCT 


1140 


TTTAAAAATA 


TCATTTAAAA 


TACTGAAACT 


TTGTCCATTA 


AAAATGAAAG 


TGTTCAGTTC 


1200 


ACCCATCTTG 


AGGAGACAAA 


TCAGTCAATG 


GTANACGACT 


ACCACCAATG 


AATGGACCAT 


1260 


TTGCTGTTAG 


TATCATGGTC 


GTTTCGCCAG 


AATATGTCTT 


ATCATCTATT 


GATAATTGAT 


1320 


AATTAAATTG 


TGTTGGATTT 


AGCAGTGTTT 


TGACAGTTGA 


TCCAATATAA 


CTCAATTTAC 


1380 


CAAATATATC 


TTTTGANCCA 


TCTTGTACGT 


TTTCAGCGTT 


TTGAACAATG 


AGACCTAAGC 


1440 


CAACAAAGTT 


GAGTGCATAT 


TGATTATTTA 


TTTTAATTAC 


ATCGTATGTA 


CCAACTTGTG 


1500 


CAGAAATCAT 


TTGTTCACTA 


GCTTGTTTAT 


GATTAGGTGC 


TATATTTAGC 


GTNTGTGTAA 


1560 


AATCATTAAA 


AGTACCGCCT 


GGTAAAATGC 


CAATAGGGAG 


TTGAAGGTCA 


TGTGACATAA 


1620 


CACCATNTAT 


AGTTCGTAAC 


CGTGCCATCA 


ACGGCAAGGA 


ATAAATAATA 


TNTCTACATC 


1680 


TTGNGGATAG 


TTTTAGTGTG 


GTTTCTCGGC 


AATATTTAAN 


AC 




1722 
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(2) INFORMATION FOR SEQ ID NO: 1072: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072: 



GTGTTACGGT 


AGGTTGTTGA 


TCCTAATTTT 


CCCTGNCAGT 


TGGTCCCAGT 


ACCCACCCAT 


60 


TTCCCAACTT 


ACTAAAAAAG 


AAATCCAATT 


ANNCCGAAAG 


AAAATATTGA 


AGTCATTGAA 


120 


ATTAACCGAA 


GCGTTCCAGT 


GCACAGGTAG 


TTGCCTCCCC 


AACCAAGCTT 


TAAATATTTC 


180 


CAAATACGCA 


ATTAAATATA 


TGGGGTGGTG 


CATTAGCATC 


AGGTCATCCA 


TACGGTGCAA 


240 


GCGGTGCCCA 


ATTAGTGACT 


CGATTATTTT 


ATATGTTTGA 


CAAAGAGACT 


ATGATTGCAT 


300 


CTATGGGGAT 


AGGGGGAGGT 


CTAGGAAATG 


CAGCATTATT 


TACTCGATTC 


TAACCAGCGA 


360 


TTAAATGTGT 


CATTTTCTAA 


GGATAGTGTG 


GCTGCATATT 


ATCAGTGTTT 


TANCCAACCT 


420 


TATAGAAAAG 


AAGTCTCGTG 


CCCATTAATG 


TGTNCGTCAT 


TATGGCCACA ATTTGATTTA 


480 


TTTAAAAAAA 


TATCCAAATA 


GCGAGCTGAT 


TTTACCACAA 


ATCAGCAATT 


AATCAAACTC 


540 


NCAAGATAGA 


AGTAGAC ACA 


ATATATGTAG 


GGCATTTAGA 


AGATATTGAA 


TGCCGACAGA 


600 


CTCGCAATAT 


CACACGTTAT 


ACAATGGCTT 


TAACATTAAC 


TAAAAATGAT 


CAACATGTCA 


660 


TANCGGTTAC 


ACAAACTTTT 


ATTAAGGCGA 


TGAAGTAGAG 


ATGAAGTTTA 


ATGAGATATG 


720 


GATAAATGAA 


TATTTGGCGC 


TCGTAAATGA 


TGATAATCCA 


ATACATAATG 


AGATTGTGCC 


780 


AGGACAATTA 


GTGAGTCAAA 


TGATGCTGAT 


GGCTATGTCA 


TTAGAGACAA ACCAGTGTCA 


840 


AATTAACTAC 


GTTAAACCTA 


TTTTAATAAA 


TGAAAATATC 


GAATTCATTG 


AACAACACGA 


900 


ACACGAAATT 


ATAGCAATTA 


ATGACGATGG 


AGAGATTAAA 


ATAAAAATTT 


CTTTGAGCAC 


960 


AAAAAAATAA 


CCGATATTAG 


CTGCATGAAC 


GCATATTAAT 


TAGGAGATGA 


AAGGACAGCT 


1020 


AATATCAGTT 


ATGTATTGTT 


ATTATTATTG 


GGAACAGAGA 


TGAATATAGG 


TTACGTTTCT 


1080 


TTCTTTGCAC 


GGGGATGCAT 


TAATCTAAAA 


TAATAATAAC 


AACTATATCA 


ATGTTTAATA 


1140 


AATTCTGGAT 


TATTGGAACG 


ATTAGTCAAT 


TTAACTAACT 


TNCATATGAT 


CTATATCGTC 


1200 


TTGTNATAAA 


GAGAGCAATT 


TGAATATTTC 


AGTATCACTA 


AATGAATCGN 


CACATTTAAT 


1260 


TGAAACATGC 


TGAAACGTTT 


GGGTTATAAT 


TTCATAAACT 


GGTGCGCCTT 


CATGGTGATA 


1320 


CTGTCGAATA 


AATAATCATA 


ACCATATTTA 


CCTCCTTNGG 


CTACTCTATG 


GGTATATTAT 


1380 


AAATAACATT 


TTTATGTGTG 


ACATCAACCT 


TAAGTATCAA 


CTTTTTATCA 


GACATAGAAC 


1440 


GTANGATTTA 


CTAAGACTAT 


TTATGTATAA 


AAGTTCTAAA 


TAAATATATA 


TTTATAGAGT 


1500 


CGCCTGGCAG 


NCATTTGGGA 


AATATAACAT 


ATATGATTAG 


AGAGGCATCT 


ATCGCAAAAG 


1560 


AATGATAATG 


ATAGAGGTAT 


TGAGCATATA 


GATGAGTTTA 


AGTTCATCTT GAAAATAAAG 


1620 


GGTTATTTAG 


TCATAGATGT 


AGATGTATAG 


GAAATATTTG 


TATGTATTGN 


TCGATATGTA 


1680 


TGAAATTTTC 


AATAAAAGCT 


AATAACGCTT 


ATATGTAACT 


TTCAAATTTA 


AATTATATAC 


1740 


AGAGCATGAT 


GATTATAAAA 


AAATANCCAC 


ATCACATAAA 


TTGAGTTCAT 


ACCCAATTTA 


1800 
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AGTGGTGTGG 


C T AAT AATGT 


TC3ATTTATAG 


ATGAACCGCC 


TAATCGTTAA 


AC CTCTGTT A 


1860 


PTTT" AAP ATT" 


RATATf5TTCA 


£\ l/UvW X X w X 


ATGTACCGTG 


ATCCACAGGA 


CCAACAAAAT 


1920 


r* & 'Pfv^ at'P'PT 

V— t\ X X\»f\X A * X 


LUflnLUU * X X 


TTAATAGCAG 


AAGCGACGAA 


AGCTTTCGCT 

XaxJX^ X A X V^VJX,^ X 


CGTGCTAATC 

w a V3v» x nn x ^ 


1980 




i\>uuiun\.l x 


niiUu x x /iwv- x 


AAATATGCAG 

IW*4 **X Wwxiw 


GTGTTGCCGC 


AGCAAATGTA 


2040 


C AAC CAGC AC 


PATGGTTATA 


ACTTTGTTGG 


AACATGTCTG 


TTGTTAGTTG 


ATAAAATGTG 


2100 


TGACCATCAT 


AGTATAAGTC 


ATACGATTTA 


TCTTGATCTA 


AAGCTNTGNC 


ACCTTTAATG 


2160 


» m/-i » « » fTV*V*IH 

ATGACATGCT 


GTGCGCCTTT 


ATCAAAGA 1 A 


A 1 1\»N X\J*JAlj 


CLI 1 xTJACAT 


ATCTTCAATT 


2220 


GAATTTAATT 


TACCTAATCC 


TGATAATTGA 


CCCGCTTCAA 


ATAAGTNTGG 


TGTCACTACC 


2280 


GGTGGTTTAG 


GTAGTAAATA 


TTTAATCATC 


GCCTCAGTAT 


TTCCAGGATT 


AAGCACTTCA 


2340 


TCTTCGGCCT 


TTACAANCCA 


TGACAGGATC 


TTACTACAAA 


ATATTGTGCA 


ATTAGATGCT 


2400 


CATATACTTC 


TCCAGNACGG 


GTGGATATCT 


CCTCAAG 






2437 



(2) INFORMATION FOR SEQ ID NO: 1073: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1073: 



GGCCCCTCCT 


GAAGGTTGTN 


CCNCCCCCAG 


TATGGTGCAT 


GGACGAAATG 


CCTCAATTCG 


60 


CTTGTTGCAG 


ATTATCTCCT 


CCACCACCAC 


AAGTTTCGTC 


TTTATCGTCC 


TCCNATAGAA 


120 


GTAGGGTTTC 


CTACTACTTG 


NCCATTCATA 


ATCTGTTGfCA 


TTTCATGTTT 


GATNTGGTGA 


180 


AGATTTAGGA 


GCCACAAATT 


GCTTGCATAT 


TATTGCCGAC 


TTCTAATACA 


CCAGATGCGC 


240 


CTAAATCTTT 


CACACCAGGA 


ACATCAACTT 


TAGATTTGTC 


GTTAACTTCA 


ACACGTAGAC 


300 


GTGTGATACA 


AGCGTCTAAA 


TGTTTAATGT 


TTGCTTTGCC 


ACCCATAGCT 


TCTAATACTG 


360 


CATATGGTAA 


TTCAGTTGCT 


GAAGCAGTAA 


CAGCTTGTGA 


TTGCTTATCT 


TCACGACCTG 


420 


GTGTTTTGTA 


TTTTAATTTT 


ACAATTAAGA 


ATCGGAATAC 


GAAGTAGTAA 


ATAACTGCGT 


480 


ATACAAGACC 


TACAGGAATG 


ACTAACCACC 


ATTGTGTCTT 


ATTAGGTAGT 


ACGCCAAGTA 


540 


AAACGTAGTC 


GATGAAACCA 


CCTGAGAAAG 


TATAACCTAA 


GTGAACATCC 


AATAAGTACA 


600 


ATGTTAAGAA 


TGATAAACCA 


TCAAGTACTG 


CGTGAATAAA 


GAATAATAAT 


GGTGCTACGA 


660 


ATAAGAATGA 


GAATTCTAAA 


GGTTCTGTAA 


TACCAGNTAA 


GANTGATGTT 


AAAGCAGCAG 


720 


AACCCATTAA 


ACCTGCTACT 


ACTTTCTTAT 


TTTCAGGGTT 


AGCTGTGTGA 


TAAATTGCTA 


780 


AAGCTGCTCC 


AGGTAAACCG 


AACATCATAA 


CAGGGAATTC 


ACCTTGCATG 


AATTTACCAG 


840 


CTGTCAAATG 


TGCGCCTTCA 


CGAATTTGTT 


CGATAAAGAT 


ACGTTGGTGC 


ACCGTGAATA 


900 


ATTTCACCAG 


CTCGTGCAAA 


CTGTGCATTG 


CGTTATTTAA 


TTCTGTTGCT 


TTTGNAGNTT 


960 


CTTGATTTAC 


ACCAGCTACT 


GTAGTTGCAC 


CATCAATGGC 


ACGAGCGTTA 


ACGCATCTTT 


1020 


TTGTGCATTG 


TTAATTGATG 


TTAAGTTATT 


CAAGTTGTTT 


TTCGCTGTTG 


TCTTAGCTTG 


1080 


AGCTAAGTTT 


TGCGCACCAT 


TTAATGCATG 


TTCTTTAGTT 


GTCACTTGTG 


ATGTTGCTTG 


1140 
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CGTAATAGTA 


TTTGGTTCCA 


TCGTTGGGTT 


CGATGTTTGA 


TTAATGATTG 


CTTTTGCTGC^ 


1200 


AGTAACTGCG 


CTGTCGTACT 


CGTTACGATT 


ATTTGGACTT 


GCGTCAGTGT 


AGTTTTGACC 


1260 


TGCTTTAATC 


GTTGCTTCAT 


TCGCAATACT 


ATCTCTTAAG 


CCTTTCATCG 


CTGTATTCAA 


1320 


CGTTTGAGAA 


CTATTTTTAA 


CAGTTTGAAC 


ACCATCTAAT 


GTAGTTGCAC 


TTTGAACTTG 


1380 


TTCTTTTAAT 


TTTGCTTTTT 


GTACATTATT 


CAATTGTGCT 


AAGCCGTCAA 


TTGTATTGTT 


1440 


AGCATGCTCT 


TTCGCTACAC 


GTAAGTTGTT 


ATCACCATTT 


AATGCTGTTT 


TAGCCGCATT 


1500 


GACTTGTGAT 


GCCGCACTAT 


TTACGTCATT 


GACTGTTAAT 


GTCGCATTTG 


GTGTACCATT 


1560 


AATGATATTT 


TCAGCGTTAG 


TCACTTTTGA 


ATCATACGCT 


TGACGTTTTT 


CTTGATCAGC 


1620 


ATTGACATAA 


TTACCGCTTG 


CTTTTGTTCT 


TTCTTTATCC 


GCAATAGCTG 


CNTTAAGTGC 


1680 


TGTCATCGCA 


CCGTTCAATT 


CAGTAGCTGT 


ATGTTGAACG 


TTATTTGNAT 


CAGATACACG 


1740 


TTGTGCAGTG 


GGAATTCCG 










1759 



(2) INFORMATION FOR SEQ ID NO: 1074: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1074: 



TGAANTATTT 


TNCATAAGCT 


TATTCCTCCT 


TATATTNTCT 


CCACTTCATT 


GTTTNANGTG 


60 


NCTCCTGCAN 


CATAGCATTA 


GAATCAAAAG 


GAACCATTAG 


NCTGCACAAC 


AAGNTAATTT 


120 


CGATACGTCT 


TNATCAAACG 


TCAAGGGTGC 


AAGCTTTAAT 


CCTTCAGCCA 


TCGTNAAATA 


180 


AGGNGCAAAG 


CTATCCGTAA 


GGTCTCCAAT 


GGTCANACCC 


AAATTTGAAC 


CGCTAACCTT 


240 


GNCGCATAAA 


ATAACATCTC 


CAGCATTTTC 


ACTCACAATG 


TGCGCTCCGA 


TCAAATTTCT 


300 


GGGTTTGGGC 


GTTGNCTACA 


AGTTAATAGA 


CCCCTGTTGT 


TTCGTGATTG 


ACTAAGGCCC 


360 


GCGGTACAGC 


GTCCAACGGA 


AGGACCGATG 


TTTTGACATC 


GTAACCTTTT 


TCTTTTGCCT 


420 


GTTGTTCAGT 


CAAGCCGACT 


GTGGCGATCG 


ATGGATTGGT 


GAAGGTTACG 


CCGGGAACAA 


480 


AGCGAAGATC 


GATTTTGCGT 


TTCGCTAGAC 


CCAACGCATT 


ATTTGCCACA 


ATCCCGCCTT 


540 


CATAAGCTGC 


AACATAAACG 


AATTGCGGAC 


CGAGGGTCAC 


ATCGCCCGCG 


GCATATATTC 


600 


GGTTATTCGA 


CGTTTGCAAA 


TATTCATTGG 


TCAGCACTTC 


GCCTTTTTTC 


CCTGTTTTCA 


660 


CACCTGCTGA 


TTCAAGGTTT 


AAAGTCTCTG 


TGTTCGGCTT 


TCTTCCTGTT 


GCCACGAGGA 


720. 


CTTGATCGGC 


TTCGATGACT 


TGTTCTTGAC 


CGTTCACTTC 


AATATAAATG 


CTTGTCGACT 


780 


TACCGTTTTG 


CTCAACCTTT 


TGATAAGTGA 


CCCCAGTGAT 


CAGGTTAAGT 


CCTTGCTCAG 


840 


TTAAGGATTC 


ATCGATGGCT 


TCGGAAATTT 


CAGGATCGTA 


GGTTTTAAAC 


AGACGCTCGC 


900 


TTCTTTGCAT 


GAGAGTCACT 


TCTGTTCCGA 


GGTTGTGAAA 


CATTTGACCT 


AATTCCGCTG 


960 


CGATATAGCC 


AGAACCGATC 


ACTGCCAATC 


GTTGTGGAAC 


CTCTTTTAAT 


TCGAGTGCGG 


1020 


ATGTACTTGT 


TAAATAATCA 


ACCTCATTCA 


TTCCCGGGAT 


TTCCGGAACA 


GCCGGAGAAG 


1080 


CCCCCGTTGC 


GATTAAAAAG 


CTTTTAGACG 


TGATGTTGTG 


TCCATTCACT 


TGTATCGTCT 


1140 
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TATCGTCGAT 


AAACGAGGCC 


TCGCCACGAA TGAGATCAAA 


TCCATATTCT 


TCGATCAAGT 


1200 


CTATATATTT 


TTCTTGACGC 


ATTTGACTGA 


CTAATCCATC 


TTTTTGTTCG 


GTCAATTGGG 


1260 


CAAGGTCCGC 


AGCACCGGTA 


CTCGTTTGAA 


GTCCGGTAAA 


CGGATTGTTT 


TGGGCGAGAC 


1320 


CGTTTATTTC 


ACCGGCACGA 


AGCATGGTTT 


TTGACGGTAC 


ACAACCGATG 


TTAACGCAGG 


1380 


TCCCCCCGAC 


GGTTCCCCGT 


TCTACCATGG 


ACACTTTCGC 


CCCGTTTTCA 


TTGGCCTTGA 


1440 


TAGCTGCAGA 


AAACGCCGCA 


CCGCCGGAAC 


CAATAATCAG 


AAGATCGTAA 


TCGCCATCCC 


1500 


GATTGAAATC 


TACACTGTTT 


TCAGAGGGCT 


GGCTTTCTTC 


CTCTCCGGGT 


TGATAGCCGG 


1560 


CTGCCGAAAT 


ATTTTGCTTA 


GCCTTTTCGA 


TCTGATCATC 


GNTGAGTTCA 


AAAATGGCCG 


1620 


CACCAGCGTC 


GGAAATCGGC 


CGAAACATCT 


TTAGCTCCGG 


CTTGTTCCAA 


TGCTTCGGTT 


1680 


A\^A X VJ X X V* X 1 


CACAGCCFGT 

v>n\«nvjv\> x * x 


GCATGTCATG 


CCTTGAATGG 


GTATTTTATA 


TGAATTTTGA 


1740 


fTTT* A r T w PTY^A A 

\J X V>A1 X X 


X ^^K^W X W XXX 


TATTTAAAGT 


TTGATAGACA 


GTTCTGTACA 


TTTATCTCAT 


1800 


ATTTATCTTG 


TCTCATGTTT 


CCGATCCACG 


CTCTCTGGAG 


GCGTACAGCA 


ATCCGTTTTA 


I860 


TTTTTrGTrPT 

X A X X A X %^ X A X 


TTCCATTGCC 


TTCAGGAGAA 


TTGTATTTGT 


TCTTCAACAA 


CTTGCTGAGG 


1920 


ATCAGAGCAA 


TCATAACAAT 


GGCCAATGCT 


GCAAAAATGC 


CGGTCAACCA 


CCAATTTCCT 


1980 


GTCGCACCAG 


CAAAGAGGGC 


ACCAATACCT 


GTGCTCCCTA 


ATGCAACAAG 


GAGAATCGGT 


2040 




AAAGTAAGGA 


ACCAATAAAG 


GAACCAATAA 


TATCCCGAAA 


CAGGCCCAAA 


2100 


AACACCAGCG 


ftRTACTTTTP 

x nwv< xxx v» 


GAGCGATTCT 


CATCCATCAT 


TCTCACCACT 


CTTTCTAAAA 


2160 


AAAAATTTTA 


TTCTTTGGGC 


ACGAGGTACA 


TACATGNCCT 


ATAGCCTAAC 


TTGAAATTTG 


2220 


TTCGTTTAGC 


TTTTTTACAT 


CTTTCATCTA 


TTTTTTGTTA 


CACCTGATCG 


GTTGGATTTT 


2280 


AAAATGTACA 


AATGCCACCA 


GAACCGAAAT 


ACCAGTAATA 


AGCCATGTAC 


AAAACGGAGA 


2340 


GGAAAATCAC 


AACAGCTGCT 


ATTTTTTGGA 


TAGAACCCAT 


ATAGTTGTGC 


AGAAACTTCT 


2400 


GTACCAAGTG 


GTCGTGAAAT 


CAANGAGACC 


ATCGTGATCG 


CTGNCACCAC 


AACTCCCATA 


2460 


CCAAGGGAGT 


AGATGATGAA 

nun x wv« x 


CTTGATGATC 


ACGGCCGTTA 


CGCTATTGTC 


ATTCAGCGAT 


2520 


GCAGAGACCA 


PCAAPATAAN 


GGCTGGCAAG 


GTACAACCAA 


GTGATGTCAC 


GGCATAGGCT 


2580 


ATTCCGTAAA 


AATAGATAGA 


CCATTTCCCT 


GGCTTGACTT 


GAAAAGATCC 


TATTTTANTC 


2640 


GGCAAATGCT 


TCCCGAATAG 


CATGCCCAAC 


CCCAATAAAG 


CAATGAGTAT 


ACCCATAACC 


2700 


AATGAAAGAA 


TCGGAAAAAT 


CCCTGTTAGT 


GCGCTTCCCA 


ATCCTCCTAT 


CAATAAACCA 


2760 


GCCAATACAA 


AAATCGTTAT 


TTAACCCCGT 


GGTCATCGCT 


CCACCAAGCC 


CTAATCCTTT 


2820 


AAAAATTGNA 


TAGNGGAACG 


AATGATCCTT 


TGTTTCGCCT 


CCAATTAAAT 


AGGAGATATA 


2880 


AGAAGGAAGC 


AAGGCGATAC 


CACAAGGATT 


AAAGGCCGCA 


ACCATTCCAG 


CGGTCAGAAT 


2940 


GAAGAGAAAC 


GAAAAACTCA 


TAATTCCACT 


CCTATTTGAG 


TCAACGCTTC 


TTTTAAATCA 


3000 


TCAAAGGACG 


GTCGAACAGA 


ACGATAGAAT 


ACCTCATTTT 


CAGAATTGAC 


CAGTACGATC 


3060 


TCTTCCAATT 


GTTTGACGCC 


ATAGGTATCC 


GCAATTTCTT 


TACCATTCTT 


TAAAACATGG 


3120 


GGCCAATCCC 


CGCCATAATC 


CTGTTTAAAT 


TTCGCAAGGC 




ATCTGTGTTA 


3180 


GGGTCTAAAC 


TAACTGTGAT 


CAATTGAACA 


TCGTTCGGGT 


TTAGTTGATG 


CATTTCCTTA 


3240 


AAGATTTCCT 


CATTGTATAT 


ACAAGATGGA 


CACCAGGTTG 


CCATAAAATA 


AATGAGTGTG 


3300 


GGWTCTCAT 


TAGGTAGGCT 


GACTGGATTT 


CCTTGAATGT 


CTTGAACCTT 


TGATTCATTC 


3360 


GCTCGTGCCA 


TCAGATTCTG 


CTCCACAAGC 


TGTTAAACCG 


ATTAACAACA 


CCGTCATTAT 


3420 


AGCGGTGAAG 


GAAATGCGTT 


TTTTCATTCA 


AATCTTCCCC 


CATTTATTTA 






CCATTAACGT 


TTCAATAATA 


GGACAGGTAT 


ACATCGCCTT 


TTCATCTGGA 


CACTTTTCTT 


3540 


TTAATTCCTC 


TAATAACCGT 


TGGATTCGTA 


ACAAACCCTG 


CACTTTCCGC 


TCGATTTCTT 


3600 


ItSGmTlTllS 


GAACGGTAAA GGCGTACATA 


TCTTTACATC 


TCTCCCCATC 


TTGATCAACC 


3660 


ACACCAAACA ACAGGTCGAT 


TTCCTTTAGC 


GAGAAATCCA ATTCCTTCAA 


TCGGTTAATA 


3720 
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AACCGTACCG ATCTGCTGTT TCCTCTGAAT ATATTCGATA CCCTGATTCG TATTCTGGGA 3780 
GGCCCGNCTA TCAATCCTTT TCCGCTCGGT AATACGAACG GG 3822 

(2) INFORMATION FOR SEQ ID NO: 1075: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075: 



GGCACGAGAN 


TTTTANGCCG 


AGAAAACTTG 


TTGGAGAGAT 


TAATTTTTTT 


ATTTTTTTAT 


60 


TTTTTACTTT 


TATTTTCTAC 


AAATATTGTG 


GNAAATCTCT 


TTATGAGTAA 


ATAAAAGATA 


120 


GAAGCATAAA 


GAATAAATGA 


AAATGTGAAA 


ATATAAGTTT 


ATATGAATGT 


TGTAGAAAAT 


180 


CTTCAAAATT 


ATTAAGTTTA 


ATACGATCAT 


ATTAGCAGTT 


ATAAAGAATA 


TGAATAAGAT 


240 


AATATTATTA 


AATTTCTTTT 


TGTATNTAAT 


ATGNAAAAAT 


ATTANTGGGA 


AAAAGTTCAA 


300 


GAAAAATAAA 


TATTAGATAA 


GTTACTATAN 


AGAATAATGC 


AATATAGCCA 


ATGTTATAAA 


360 


ACATATTATT 


TGAATACCAA 


TTATTGTTAT 


TAAAGAACTT 


ATATTAAATT 


ATCTGCTAAT 


420 


GATTGGAAAC 


CTATTATGCT 


AAAAAATGGT 


ATTGAGAATA 


CTAATACGGA 


TATAGGTATG 


480 


ATTATTATTC 


CGAGTAAGAA 


CGTTACAATT 


ATAAAGTTCA 


ATAAGTACTC 


TTTGTTTACT 


540 


TTTTTCATAA 


TAATTTAATT 


CCTTTTATGT 


ATATAATATT 


TAATAATAAA 


AAATTTTAAC 


600 


ATTTATAATT 


GGAGGATAAA 


AGTCATATGA 


AGAATTTAAA 


AAGTTTTTTT 


ATTAAAAATA 


660 


CTTTGGTACT 


AACATCTACT 


GCATTATTAT 


TTTCTAGTTT 


TGAACCTGTA 


GTTCATGCTG 


720 


CAGAAAATAA 


GGAAGTTGTT 


AAAACAGTAG 


AAAATGATAA 


TGTAGAATTC 


AATGAATTGT 


780 


TAGAAGGACA 


AGAATATATT 


TGGGAAGTTT 


TATCTAGAGA 


TGATGAAGGA 


TTTAAAATGT 


840 


TTCTAGATGA 


GCAAAGAGCT 


TTTAATCCTA 


ATTTTGATAA 


TGATGTTTTA 


CTAGCAAAAC 


900 


AAGAAGGTGC 


TTTAAATTCT 


TCAGTATTTG 


CTAATAATTC 


TATGACTTTA 


AACTCAAATA 


960 


AGCCTACTCC 


TAGAGGTCCT 


ATCGGTGCAA 


CTTTAAAAGC 


TATTAAGGCT 


CTATCACCTT 


1020 


CTTTGAGACA 


TGGTGGGAAC 


TGCTATATCT 


TGGATCATTA 


AACCATTAAG 


TAGAAAACAT 


1080 


GCTGTTCTTA 


GTAAAAAAAT 


ATTCAAGGAA 


AGATGTCTTA 


ATGCAAATTG 


ATAGGTTAAG 


1140 


ATAAAGTCTT 


ACGGAAATCA 


GCTGTTGAAA 


AAGCTTTGGG 


TAAAGCAGGT 


ATACCAAGAN 


1200 


GGTGGATGCG 


AAAACTTTAA 


CTTATATTGG 


ATTTTTAGGT 


TATTTAGACA 


CTGAGCTGAC 


1260 


TTGCCCTAAG 


ATTTTATAAA ATCTTAGGGC 


AAGTTTTTTT 


CTTTTTTCTT 


ATCTNGATAA 


1320 


AGTAGTATTT 


TCAATTTGTG 


ATTGGGGTGA 


TGATTTTCTA 


GATACAGTTA 


AAATTAATAA 


1380 


TCATAGATAA 


TGAGAGAACC 


TACAATTATC 


ATAAGTTTTC 


ACCTAAAAAG 


GATATATTAA 


1440 


AGTTTTACAA 


ATCTTTAATG 


CACCTTTTAG 


AGTGTTTTTA 


AATCTATTCT 


TTAATAGACT 


1500 


ATAATATTCA 


CTATTTATTA 


CACCGTTTTT 


TCTTTAGATT 


AATGAATTTA 


GGTGATGATT 


1560 


TTGAACTGCG 




TGTCTTGATA 


CTATAANAAN 


CGAAGTTATA 


TAACATAAGA 


1620 


CGACTTATCG 


GAAGTAGATA 


TGCTTGTATA 


TTCACAGATG 


GTATAATAAT 


AGAGTCGCCT 


1680 
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AIv* 1 V- IT— AoO 


LulvAAAl 1 1 


AuAVJvi\~AuAU 


AGGAGGTGTT ATTCATGCTA GAAATTCTTG 


1 74fl 


I 1TJAV-. A 1 \_~A\_ 


VjA^UAl^AV? IV 


AlVAvluul 1 


GTATTGTTGC 


GTTATTTACG 


CATTGGCTAC 


i pfin 


AA 1\AjvJAA 


l-lsA 1 AAA 1 Ala 


rw* a r^fP & f a 
liCuAl. IAIAu 


TCACACCTAA 


AAAATCCCCT 


CACTACGGCA 


looU 


AlAUT(jA\iWj 


^* JA rpnV^^TV^T A 

VxAl 1 1 o 1 A 


1 1 V-A\_vj»V_ lAb 


AATTCTTGTT 


AATCATATTA 


TAACACTGGT 




T* A P«P A ^* A a. 
lAuiiVbAoun 


aiwa & & a Af* 


A T A A CI A A A CYZ 


TAAGTGTAAC 


TCTATAATTA 


GAATTATTAA 




(pfiwpnv'nMnv* a 
in i \- in i vzA 


rniv/-»m> rpiv AAA. 
1 xW 1 A 1 AAAn 


UaI L \_ 1 Vj X i. o 


CAAAGTTGAA 


TTTATAC3TAT 


AATTTTAAC A 




AAAAGGAGTC 


TTCTGTATGA 


ACTATTTCAG 


ATATAAACAA 


TTTAACAAGG 


ATGTTATCAC 


2100 


TGTAGCCGGT 


GGCTACTATC 


TAAGATATGC 


ACTTNCCGCT 


CGATTTCTTT 


GGTTTNGTGA 


2160 


ACGGNAAAGG 


CGTACATATC 


TTTACATCTC 


TCCCCATCTT 


GATCAACCAC 


ACCAAACAAC 


2220 


AGGTGGNTTT 


CCTTTAGCGA 


GAAATCCAAT 


TCCTCATTCC 


GNTTAATAAA 


CGNACCGATC 


2280 


TGTGTTCTCT 


GAATATATCC 


GATNCCCNGA 


CTCG 






2314 



(2) INFORMATION FOR SEQ ID NO: 1076: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1076: 



CCCCAAGTCT 


CATATGAAGT 


AGTTGGATTA 


GTTCCGAAGT 


ATGAGGATGG 


GAATTTCTTG 


60 


ACATAGTAAA 


GTGCCGGATG 


ATAGTGCAAG 


AGGTATCGTT 


GCACAAAATA 


TGACAACTGC 


120 


TGAGATTGAA 


ACATTTGGTT 


CAGATCCAGT 


TATTATGCCA 


ACGGGGTGCC 


CCTGGTATTA 


180 


TTTTCGGTAA 


AACAACAAAC 


TCAATGATTA 


ATACAGGATC 


AGCGGCTTCC 


ATTGTTTACC 


240 


AACAAGGCGC 


TATTTATGCT 


AATGCTGAGT 


TCATTCAAAT 


TCATCCTACT 


GCAATCCCTG 


300 


GTGATGATAA 


ACTCCGACTA 


ATGAGTGATC 


AGCACCNTTG 


TTGAAGGTGG 


ACGAATTTGG 


360 


ACATATAAAG 


ATGGTAAGCC 


TTGGTACTTC 


TNAAGAAGAG 


AAATATCCTG 


ATTATGGTAA 


420 


CTTAGTACCT 


CGTGATATCG 


CAACCGCGTG 


ANATTTTCGA 


TGTATGTATT 


AACCAACAAA 


480 


TTAGGNATAA 


ATGNCGAAAA 


CATGGNATAT 


CTTGATTTGT 


CACATAAAGA 


TCCAACATGA 


540 


GTTAGATGTA 


AAACTAGGTG 


GTATCATTGA 


AATTTATGAN 


AAATTCACTG 


GTGATGACCC 


600 


ACGCAAAGTA 


CCAATGAAGA 


TTTTCCCAGC 


TGTTCACTAT 


TCAATGGGTG 


GTCTATATGT 


660 


AGATTATGAT 


CAAATGACAA 


ATATTAAAGG 


GTTATTTGCA 


GCTGGAGAAT 


GTGACTTCTC 


720 


TCAACATGGT 


GGTAACCGCT 


TAGGTGCCAA 


TTCATTGTTA 


TCAGCGATTT 


ATGGTGGTAC 


780 


AGTAGCAGGT 


CCAAACGCGA 


TTGATTAATA 


TTTCAAATAT 


TGATCCGATC 


CATATACTGA 


840 


TATGGACNGA 


AGTATTTTTG 


AAAAACGTAA 


AGCTGAAGAG 


CAAGGAACGT 


TTTGATAAAT 


900 


TATTAGCTAT 


GCGCGGTACA 


AGAAAATGCC 


TATAAATTAC 


ACCGTGAACT 


TGGTGAAATT 


960 


ATGACACCAA 


ATGTAACTGT 


TGTTCGTGAA 


AATGAAAAAC 


TGTTAGAAAC 


AGATNAAAAG 


1020 


ATTGTTGAAT 


TGATGAAACG 


TTATGAAGAT 


ATTGATATGG 


AAGATACTCA 


AACTTGGAGT 


1080 


AACCAAGCGG 


TATTCTTTAC 


CCGTCAACTA 


TGGAACATGT 


TAGTACTTGC 


ACGTGTTATT 


1140 
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ACGATTGGTG 


CATATAACCG 


TAACGAATCA 


CGCGGTGCCC 


ATTATAAACC AGAATTCCTG 


1200 


ATAAGTATTT 


TCGTCAGAAG 


TACATAATAT 


TTAAATCATT 


TAATTTTGAG 


AAGCATGGAA 


1260 


AATTTTGGAA 


CAAATGGTTT 


TACGTAAGAA 


AATGGAAACA 


TAAGATTTTA GATGGTCATC 


1320 


AGCTTAATCA 


AAATATATAT 


GATCAGCGTC 


ATTTAATGAC 


AATCAATACT 


GATGAAATTG 


1380 


AAAAAATGAT 


TATAGAGACA 


AAGAGGGCAG 


AGTTGATTCA 


TTGGATATCG ATACTTCCAG 


1440 


TCATCATATT 


CAATAAAGGC 


TCTCGTTTAG 


TAAAGTATAT 


AAATATTTTC 


TATGCAATGA 


1500 


TAGCTAATGT 


TCCAATCATT 


ATTGTGCAAC 


GCTATAATCG 


ACGAGATTAA 


CGCAGGTACT 


1560 


ACGCATTTTA 


AAACGAAGAG 


GTGAACGTCA 


TGACTAAACA 


TATCATCCGN 


TATTGGGTGG 


1620 


GNGGCTTAGG 


TGGGATTTCN 


TGCAGCAATT 


CGAATGGACA 


AAAGTGGCTA 


TTCGGNCTCA 


1680 


TTATATGAAC 


AAAATACTCA 


TATAGGAGGC 


AAAGTGAATC 


GNCATGAATC 


AGATGGCTTT 


1740 


GGCTTTGATT 


TAGGTCCATC 


TATTTTAACG 


ATGCCTTATA 


TTTGTGAAAA 


ATTATTCGAA 


1800 


TATAGCAAGA 


AGCAAATGTC 


AGACTACGTT 


ACAATCAAGC 


GTTTGNCACA 


TCAATGGCGT 


1860 


AGCTTTTTTC 


CAGATGGCAC 


GACTATCGAT 


TTGTATGAAG 


GTATTAAAGA 


AACAGGTCAG 


1920 


CATAATGCGA 


TATTGTCGAA 


ACAGGATATA 


GAGGAACTGC 


AAAATTATTT 


GAATTATACA 


1980 


AGACGAATCG 


ATCGTATTAC 


TGAAAAAGGG 


TATTTTAACT 


ATGGTTTAGA 


TACACTATCT 


2040 


CAAATTATTA 


AATTTCATGG 


GCCATTAAAT 


GCTCTTATTA 


ATTATGATTA 


TGTACATACT 


2100 


ATGCAACAGG 


CCATAGACAA 


GCGTATCTCG 


AATCCATACT 


TGCGACAAAT 


GTTAGGCTAT 


2160 


TTTATCAAAT 


ATGTAGGTTC 


TTCATCATAC 


GATGCGNCAG 


CTGTATTATC 


TATGTTATTC 


2220 


CATATGCAAC 


AAGAGCAAGG 


CCNTTGNTAT 


GTAGAAGGTG 


GAATCCATCA 


TTTNGCCAAT 


2280 


GCCTTGGAAA 


AGCTAGCGCG 


TGAAGAAGGT 


GTCACAATTC 


ATACAGGTGC 


ACGTGTGGAC 


2340 


AATATTAAAA 


CATATCAAAG 


ACGTGTGACG 


GGTGTCAGAT 


TAGATACAGG 


TGAGTTTGTA 


2400 


AAGGCAGATT 


ATATTATTTC 


AAATATGGAA 


GTCATACCTA 


CTTATAAATA 


TTTAATTCAC 


2460 


CTTGGATACT 


CAACGATTAA 


ACAAATTAGA 


GAGGGAATTT 


GAGCCGGCAA 


GCTCAGGATA 


2520 


TGTGATGCAT 


TTAGGTGTTG 


CTTGCCAATA 


CCCGCAATTA 


GCACATCATA 


ATTTCTTTTT 


2580 


TACGGAAAAT 


GCTTATCTCA 


ATTATCAACA 


AGTTTTTCAT 


GAAAAGGTAT 


TGCCAGATGA 


2640 


TCCGACCATT 


TATCTAGTAA 


ATACGAATAA 


AACTGATCAC 


ACACAAGCGC 


CAGTAGGGTT 


2700 


ATGAAAATAT 


CAAAGTCTTA 


CCACATATTC 


CATATATTCA 


AGATCAGCCT 


TTTACCACTG 


2760 


AAGATTATGC 


GAAGTTTAGG 


GATAAAATTT 


TGGATAAATT 


AGAAAAAATG 


GGACTTACTG 


2820 


ATTTAAGAAA 


ACACATTATT 


TATGAAGATG 


TTTGGACACC 


GGAGGATATT 


GAAAAAAATT 


2880 


ATCGNTCTAA 


TCGTGGTGCA 


ATATATGGTG 


TTGTCGCAGA 


TAAAAAGAAA 


AACAAAGGAT • 


2940 


TTACCTTTCC 


TAAAGAAAGT 


CAGTATTTTG 


AAAACTTGTA 


CTTTGTAGGT 


GGATCAGNAA 


3000 


ATCCTGGTGG 


TGGCATGCCA 


ATGGTTACAT 


TAAGTGGGCA 


ACAANTCGCA 


GACAAANTNA 


3060 


ACGCGCGAAG 


AAGCGAANGA 


ATAGGANGGT 


GANATCTATT 


GAAATCGGTN 


NTCACGACTA 


3120 


TTTAANAGAC 


AATAGTGGNN 


ACCCATCNTC 


TTNTGGGGNC 


TTGGGAGCAT 


TTGGATNTTC 


3180 


NNCGGGGGNC 


A 










3191 



(2) INFORMATION FOR SEQ ID NO: 1077: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077: 



OOXyOX.rpx TV T» 


AAAGAGCG1.A 




innl X X Innu 


Tern & TY^rr* a 

XAj 1 vjA l\,l, 1^ A 


AAblnnl 111 


OH 


CGTGATTCAT 


TGAAAATTGT 


TT\r*r*/^ x. x x /"*ivT7A 
TCCGAAAGNA 


t-t-GM I\jTTtLA 


XX.CAAGAUAA 


TATTTACCTT 




AACAATATTG 


ATTTGGGGTT 


ACCTGGAACvi 


ATGTTATTAA 


GTTTGGTTAG 


CTGACAAAAA 


180 


TAGTCGTTAC 


CGCCATCGCA 


GGACCCAATG 


CCGATTGTAA 


TGGTTAGCAT 


mil hum n » it mn m 

TATTAAATCA 


240 


ATGATGCATC 


CAAAAAATGG 


fv ik mik mx mx /-» x 

AATATATAGA 


TGCAAGTCAT 


ATTGTGATTG 


CGTTGTTAAT 


300 


CATCCTTGTG 


ATAATCACAT 


TGTTTATCTT 


ATTATTCATT 


GAATTAGTAG 


AAGTGAGAAT 


360 


ACCATATATC 


GATTTAATGA 


ACGTTTCCGC 


JV X.y X X x m IV fTV» 

AACAAATATG 


iv iv iv fW'miiiiiv mm 

AAATCTTATT 


TATtTTTGGAA 


420 


AGTTAACCCT 


GCAGGCAGTA 


TTACTTTAAT 


GATGAGTATT 


TCAGCATTTG 


TTTTCTTGAA 


480 


AAGTGGCATT 


CATTTTATTT 


TATCTATGTT 


TAATAAAAGC 


ATATCAGATG 


ACATGCCAAT 


540 


GCTGACATTT 


GATAGTCCAG 


TAGGTATTTC 


AGTATATTTA 


GTGATTCAAA 


TGTTATTGGG 


600 


TTATTTTTTA 


TCGAGATTTT 


TAATCAATAC 


TAAACAAAAA 


TCCAAAGATT 


TCTTAAAGAG 


660 


TGGCAATTAT 


TTTTCAGGAG 


TTAAACCTGG 


TAAGGATACA 


GAACGTTATT 


TAAATTATCA 


720 


AGCAAGACGC 


GTATGTTGGT 


TTGGATCGGC 


ATTAGTTACA 


GTCATTATTG 


GTATACCGCT 


780 


TTATTTCACA 


•l~lTjl"l"ITi"rAC 


CGCAlI lATC 


m IV / ■HIS"* IV TV IV 1 1 M 1 1 

TACTGAAATT 


TATTTCTCAG 


miv yiv x /'vii/^ tv m 

TACAACTGAT 


840 


TGTATTAGTT 


TACATCAGTA 


TTAATATTGC 


AGAAACAATT 


CGTACATATT 


TATATTTTGA 


900 


TAAATATAAG 


CCATTTTTAA 


ACCAGTATTG 


GTAAGGAGGT 


AATTATGAAA 


TACTTTATTC 


960 


CAGCTTGGTA 


CGATGACCAA 


CGATGGTGGC 


AAGACACGAC 


TGTGCCGTAT 


TATCAACTAC. 


1020 


AAAATAAGAC 


GGAATTTGAC 


GATATGATTA 


GTTTAATGGG 


AATGCACCTT 


GAAAATGACT 


1080 


TAGATTATCA 


ACTGATTGTT 


CTCAATCATG 


CACCAAATTT 


AAGAACATTT 


TTACATCGAT 


1140 


ATGACTTATA 


TGAAACAAAG 


TATTCGTCTG 


TGTTTGATGA 


AATTCAAGGA 


TTCAGTCACC 


1200 


ATGCGCCACA 


AGCGATTAAT 


m m fT\y > fiM x ym 

TATCATCACT 


ffiiv iv iv iv fnr^oivTr^ 

TAAAATGGNC 


GGATGATGTT 


GGAGTNTGGG 


i t c n 


TACACGCCCG 


NATTTATTAA 


AAATGTGTGA 


r*f~* » tv iv /"* iv 

CGAGTGAACA 


GACCTATACA 


MKmH ItVTI/TT X fTVT> 

NATATTTATT 




TNAGNCAAGA 


AGGGTATTCA 


NTTGGGTTTG 


»•¥ n » X Tiny iv nvn 

NAGANTGATT 


\Tn iv iv iv ^> iv iv m 

NGAAAGAGAT 


CAAGTNACAA 


1380 


CVAAAii xATA 


TTTTTGATGA 


/"*x. yx. /"v 'Mwnx rn 


mm n nv* IV fT*f* TV 

x 1 A IvJiGCoA 


TACGTTATTT 


TTaATGATCAG 


144U 


GG AJjAGW- 1 1 


L/1 1 ACuAAlJA 


TV T»TV I IWI" !• TV JH-y» 

AlAxT. 1 AAUG 


GAI 1 AAIvtGA 


GA1 PGTGTAA 


/"*rp*TV"' TV mpn x x 

CTTCATGGAA 


1 jUU 


r» ATHYSPAlvIA & 




iuAL lu 1 A Xv_ 


KTTAlVTIV JV /^»/^» IV T> 7k 
NAIMAAUurAlA 


X 1\JAA<JA1\JA 


pm» TV^* x x y x x 

L1A lXJAAwvx 


IddU 


Af* lk.CZ A 

ALAunnlnln 


ap a at* ATYinr* 


*F*PA ITT 
iVAAL 1AA1 1 


UUlUiuAAAAA 


TTTPiai^raa 

1 1 lvJUttiLnA 


luAl IvsUALA 


1DZU 




f^Ti AflATnATT* 
unnVinluAli. 


nXVXUt/xX XOX 


UUV. X Lwlun X 




A TTY* A O A A A T 
A X\.k7AL~AAA 1 


IDDU 


AGCCAATCAT 


ATTCCAGCGA 


AATTGTTAAG 


TTATTCATTT 


TTTAAAAATA 


GAAATGAANC 


1740 


TGTGTCAGAT 


GAGGAATATC 


AATCTATCGT 


AAAGAATGCC 


CATTTAATTG 


TTGATAGTGT 


1800 


GCAACTAGAA 


CGTGATTTAA 


TTAGTCATCA 


AGAGAAGTAT 


CAGCGGGAGA 


ATACAATGAT 


1B60 


TCGAATCACA 


CCATTTGAAA 


CGAGACAATC 


ACCTAATATA 


AGTAGTCAAT 


TGATGGAAAC 


1920 


ATTTATAGGT 


GTATGGATAG 


ATGGTATGAG 


TGACGCTGAT 


TTGCAACAAA 


TGATGCAACG 


1980 


ACTTGTGGAT 


TATATCGCAC 


AGGAAGATTA 


TTACCGTTTA 


ATTTTATTAT 


CGCGCCATCA 


2040 


AAATGACATA 


CCGATGTGGC 


TTCGTGAATG 


TATTACGTCG 


GTAAATGAGG 


AATACCAAGC 


2100 


TAAACAGAAT 


GCGGATGTTA 


ATGTTTCAGC 


ATTAATGACA 


CCTGAAGATC 


AAGATGACAT 


2160 


CATTGCTGTT 


AAGACGATAC 


ATGCTGAACA 


TGATGTTGTA 


GAAGCATTGC. 


GGACGTTGCG 


2220 


ACTTGTGATA 


GATATGTCAA 


AAGAACCTGA 


TTTGTATTTA 


CAAATTAGTG 


CAATTAGCGC 


2280 
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TGGGATTCCA 


CAAATTAATG 


GTCAACAAAC 


AGATTACGTC 


TCTGATTATG 


ACAATGGCCG 


2340 


TATTATAAAT 


ACAGCTGGAT 


GAATTAGATG 


ATGCGTTAAA 


TTATTATTTA 


TTTTATTTGA 


2400 


AAAATTGGAA 


TTATGCGTAC 


GCCTATTCTT 


TAAAATTAAT 


AGATGCATAT 


GCTTCTAAGA 


2460 


ATATTATTAA 


TCAGCTCGAT 


GAGTTAATAG 


AAGGTGAAAA 


TGATGCCACG 


TAAATTTAGA 


2520 


GTTTTGCAAA 


TTGGAGGAGA 


CGATTTAGAA 


CCTATTTTTC 


AACACAAAAA 


AGGTGTGAGT 


2580 


TGGGAl 1 ACT 


TCGATATTGG 


ATTGTTTGAA 


TTTGATAGTG 


GTTATGTAGA 


GGCTATTGAA 


2640 


GCGATTGTTG 


AAGCAGAAGG 


GCGCTTTGAT 


TTTATCTATA 


TTCAAGCACC 


ATACTCGGAG 


2700 


ACATTAACGA 


ATTTATTACA 


AATGATAAGC 


GAACCATACA 


ATACGTATGT 


TGATGAATCA 


2760 


TTTTGGTCAG 


TTGAATATGA 


ACAAGACGAA 


AATTGTCCAA 


AAATACGTTG 


TTCAACCAAT 


2820 


TACATTACCG 


GAATATTGGA 


AGGAACGTAA 


TAATNAATTA 


GAGGCAGGNT 


AGCTTCTCAA 


2880 


GGACAAGGAT 


TGGAGATAAA 


GTTTCCCCTA 


AGTTAGCACT 


TGTGCATCCG 


AATTTTAAAG 


2940 


GAGATGTCGA 


ATACCCAAGG 


TAATTCAAGA 


GCTCACGTTG 


AGTGGAGAAT 


TTGGAAAAGA 


3000 


ATTTAAACCT 


ATCGCATCTT 


GNCAAAATAA 


TCTCGTTTAC 


GATAAAGATA 


AAGTCATTCA 


3060 


MATATGGCCA 


GAATTTGATA 


TTGATGGTGC 


GGTCGAGTTG 


CAATATACAT 


TTAGATTGAT 


3120 


TCAGACTGGC 


GCTGATGGTG 


CATTAATTGA 


ACAAATCATA 


TTAACTGATG 


ATATGTTAGA 


3180 


CAGTCCTTTA 


GAGATACCTG 


CGAAACCATT 


TGATGCTTAT 


ATAAGTGTAA CTGTTAAGGC 


3240 


GCGTGGGAAC 


GGGACGGTAC 


ATTTAGGACC 


TAAACACACA 


CGATGGTCCA 


GANTAGANAN 


3300 


GGNTCAATTT 


TTACNTGGTG 


GGAATCGTTT 


CGAAGATANN 


CNCCGNCNGG 


AATTTAATTA 


3360 


TTAATCCCNC 


CCTGGTGATA 


TGANACCCCC 


ACTAAACGTA 


AATTTTAGTG 


GTTATCGANC 


3420 


ACCGGAAGGT 


TTCGAAGGAT 


ATTATATGAT 


GANACGTATG 


AATGCNCCGT 


TTTTACTTAT 


3480 


CGCCGATCCT 


CGTGTTGANG 


GTGGTAGCTT 


TTATATCGGT 


TCATCTGAAT 


ATGAACAACG 


3540 


TGATTATCAA 


TGTTATTGAC 


GAGACAATAT 


TCATGCTGTA 


AATCCAACAT 


CAGGTGCAGC 


3600 


TGGTAAGTAT 


CAATTCTTAC 


AATCAACTTG 


GGATTCAGTA 


GCACCTGCTA 


AATATNAAGG 


3660 


TGTATCACCA 


GCAAATGCTC 


CTGGAAGTGT 


TCAAGATGCC 


GCAGCAGTAA 


AATTATATAA 


3720 


CACTGGTGGC 


GCTGGACATT 


GGGTTACTGC 


ATAAGCCATT 


TATGCATAGC 


TAATCAATAG 


3780 


TTATATAAGT 


AACTTTTAGA 


TCGGAATATA 


TCGGG 






3815 



(2) INFORMATION FOR SEQ ID NO: 1078: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.1078: 

GGCNCGAGAC GCTTAATTAT GGAACNCCAA GATGCNATTG GACNCGGTTA TAACATTGGN 60 

ACAGGCACTT TTACTAATTT ATTAGAGGTT TATCGTATTA TTGGTGAATT AAATATGGNA 120 

AATCCAATCG AGCNTTGAAT TTAAAGGAAG CACGAAAAGG AGATATTAAC CATTCTTATT 180 

GNAGATATCT CTAACTTAAA GGCATTAGGA TTTTGTGCTC CTAAAAAATA CAGTAGAAAC 240 
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AGGTTTAAAG GATTACTTTA ATTTTGAGGT AGATAATATT 


GAAGAAGTTA 


CAGCTAAAGA 


300 


AGTGGAAATG 


TCGTGAAAAT 


GACATTGAAG 


CTGTCCATAA 


TAATAAGGGT 


TATGCCTATC 


360 


AAAGAAAATT 


AGATGCACTC 


GAAGAAGTGA 


GAAAAGGCTA 


TTACCCAATT 


AAACGTGCGA 


420 


TTGACTTAGT 


ATTAAGTATC 


GTTTTATTAT 


TTTTAACATT 


TCCGATTATG 


TTCATATTCG 


480 


CCATTGCTAT 


CGTCATAGAT 


TCGCCAGGAA 


ACCCTATTTA 


TAGTCAGGTT 


AGAGTTGGGA 


540 


AGATGGGTAA ATTAATTAAA 


ATATACAAAT 


TACGTTCGAT 


GTGTAAAAAC 


GCAGAGAAAA 


600 


ATGGTGCGCA ATGGGCTGAT 


AAAGATGATG 


ATCGTATAAC 


AAATGTCGGG 


AAGTTTATTC 


660 


GTAAAACACG 


CATTGATGAA 


TTACCACACC 


TAATTAATGT 


TGTTAAAGGG 


GAAATGAGTT 


720 


TTATTGGACC 


ACGCCCGGAA 


CGTCCGGAAT 


TTGTAGAATT 


ATTTAGTTCA 


GAAGTGATAG 


780 


GTTTCGAGCA 


AAGATGTCTT 


GTTACACCAG 


GGTTAACAGG 


ACTTGCGCAA 


ATTCAAGGTG 


840 


GATATGACTT 


AACACCGCAA 


CACAAACTGA 


AATATGACAT 


GAAATATATA 


CATAAAGGTA 


900 


GTTTAATGAT 


GGAACTATAT 


ATATCAATTA 


GAACATTGAT 


GGTTGTTATT 


ACAGGGGAAG 


960 


GCTCAAGGTA 


GTCTTAATTT 


ACTTAATAAG 


TTCAAATAAA 


AGTTATATTT 


TAAAGATTGT 


1020 


GACCAATNGT 


TACAGTNTAA 


CGAGGANTCC 


CTTGNGACNG 


TATPAAATRG 


PATTAAARAA 


1080 


ATATGTNCCA 


TCNTTTGATT 


TGCNTGGCCA 


ATAAATACTA 


X XV. A J- V— 1 lun 


TRARATAARP 


1 1 Ad 


CATGTTAAGA 


AATTGAAAGT 


ATAGCATTAA 


NGGGGTTTGT 


A T^CIxCifVPCl A A 


nAX lnlnlnl 


luUV 


TGTATTACTA 


AAGCAGACAA 


TGGTGGTGCA 


CAAAACACAT 


PTPATTPAAP 


TPRPPAAPPA 


1260 


TTTTTGCGTA 


CACCATGATG 


TTTATGTCAT 


TGTAGGCAAT 


PATGGAPPAA 


TGATTRAAPA 


1320 


NCTAGATGCA 


AGAGTTAATG 


TAATTATTCT 


CGAACATTTA 


GTAGGTCCAA 


TTflArTTTAA 

X X UXuw XXX Jul 


1380 


ACAAGATATT 


TTAGCTGTCA AAGTGTTAGC 


ACAGTTATTC 


TPGAAAATTA 


AGCCTGATGT 


1440 


TATCCATTTA CATTCTTCCA AAGCTGGAAC GGTCGGACGA 


ATTGPGAART 


TPATTTPRAA 


IJUw 


ATCGAAAGAC 


ACACGTGTAG 


TTTTTACTGC 


GCATGGATGG 




ARRRTRTT A A 


1 RfiO 

X JDv 


ACCAGCTAAA 


AAATTTCTAT 


ATCTAGTTAT 


CGAAAAATTA 


ATRTPATOTA 

rVX O l\«Alu X n 


TTAPAGATAG 


1620 


CATTATTTGT 


GTTTCAGATT 


TCGATAAACA 


GTTAGCGTTA 


AAATATCGAT 


TTAATCGATT 

x x nn x w>wn x x 


1680 


GAAATTAACC 


ACAATACATA 


ATGGTATTGC 


AGATGTTCCC 


GCTGTTAANC 

ww x \j x x nmi\> 


AAACGCTAAA 


1740 


AAGCCAATCA 


CATAACAATA 


TTGGCGAAGT 


AGTTGGANTG 


TTGPP T AAT A 


AACAAGATTT 


1800 


ACAGATTAAT 


GCCCCGACAA 


AGCATCAATT 


TGTTATGATT 


GCAAGATTTG 


CTTATPCAAA 


1860 


ATTGCCNCAA AATCTAATCG 


CGGCAATAGA 


GATATTGAAA 


TTAPATAAPA 
X X A\~n, x nnUn 


fiT A A TP A1Y5P 




GCATTTTACA 


TTTATAGGCG 


ATGGACCTAC 


ATTAAATGAT 


x\j 1 VnVA, V-/\^ 


pa ARTTRTAr* 

Uinu X X u X AL. 




AAGCTGGGTT 


AAGAAAATGA 


TGTCACATTT 


TTGGGCAATG 


X V_,rV X lAnlvv 


R ARTP ATTT A 
X wi X X X n 




TTATCACAAT 


ACGATACGTT 


TATTTTAATA 


AGTAAGCATG 


AAGGTTTGPC 


AATTAGPATT 

nn x x ^nvjVvXi x x 


2100 


ATAGAAGCTA 


TGCCTACAGG 


TTTGCCTGTT 


ATAGCCAGTC 


ATGTTGGCGR 


TATTTPAGAA 


2160 


TTAGTAGCTG 


ATAATGGTAT 


ATGTATGATG 


AACACCCAAC 


CCGAAACyPAT 


TGCTAAAGTP 


2220 


CTGGAAAAAT 


ATTTAATAGA 


CAGTGATTAC 


ATCAAAATGA 


GTAATCAATP 


TAGAAAAPGT 


2280 


TATTTAGAAT 


GTTTTACTGA 


GGAGAAAATG 


ATTAAAGAAG 


TGGAAGACGT 


TTATAATGGA 


2340 


AAATCAACAC 


AATAGTAAAT 


TACTAACATT 


GTTACTTATC 


RGTTTARPRR 


X X X X x e\ X X V_- jt\ 


2400 


GCAATCTTCG 


GTTATTGCCG 


GTGTGAATGT 


TTCTATAGCT 


GACTTTATCA 


CATTAPTAAT 


2460 


ATTAGTTTAT 


TTACTGTTTT 


TCGCTAACCA 


TTTATTAAAG 


GCAAATCATT 

v\^mvi x w<i x x 


TTTTAPAGTT 

X X X X #*V*-*kVJ J, 1> 


2520 


TTTCATTATT 


TTGTATACAT 


ATCGTATGAT 


TATTACGCTT 


TGTTTGCTAT 


TTTTTGATGA 


2580 


TTTGATATTT 


ATTACGGTTA 


AGGAAGTTCT 


TGCATCTACA 


GNTAAATATG 


CATTTGTAGT 


2640 


CATTTATTTC 


TATTTAGGGA 


TGATCATCTT 


TAAGTTAGGT 


AATAGCAAAA 


AAGTGATCGT 


2700 


TACCTCTTAT 


ATTATAAGCA GTGTGACTAT 


AGGTCTATTT 


TGTATTATAG 


CTGGTTTGAA 


2760 


CAAGTCCCCT 


TTACTAATGA 


AATCGTTATA 


TTTTGATGAA 


ATACGTTCAA 


AAGGATTAAG 


2820 
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GAATGACCCT AACTATTTCG CTCGTGCC 2848 
(2) INFORMATION FOR SEQ ID NO: 1079: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079: 



GGCACGAGCG 


CACATCAACT 


GTTTAAAGAT 


AAATATACGA 


GAGAAATTAA 


TACTATGAAG 


60 


TATTACAATC 


TGTTGAAGTG 


AGGAGATAAT 


TATGAAGCGA 


TTATTCGATG 


TAGTGAGTTC 


120 


AATATATGGT 


TTAGTAGTTT 


TAAGTCCGAT 


TCTGTTAATT 


ACAGCATTAC 


TAATTAAAAT 


180 


GGAATCACCT 


GGACCAGCCA 


TTTTCAAACA 


AAAAAGACCG 


ACGATTAATA 


ATGAATTGTT 


240 


TAATATTTAT 


AAGTTTAGAT 


CAATGAAAAT 


AGACACACCT 


AATGTTGCAA 


CTGATTTAAT 


300 


GGATTCAACA 


TCGTATATAA 


CAAAGACAGG 


GAAGGTCATT 


CGTAAGACCT 


CTATTGATGA 


360 


ATTGCCACAA 


TTATTGAATG 


TTTTAAAAGG 


AGAAATGTCA ATTGTAGGTC 


CTAGACCAGC 


420 


GCTTTATAAT 


CAAGGATTAC 


GAATTAATCG 


AAAAACGTAC 


AAAAGCGAAC 


GTGCATACGA 


480 


ATAGACCAGG 


TGTGACAGGA 


CTAGCTCAGT 


GATG 






514 



(2) INFORMATION FOR SEQ ID NO: 1080: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080: 



CCCCAAAGAA 


ATACANTTGT 


TGGTAGAAGA 


ACCCCCAACC 


CACAAGNTGG 


GGACCCTTCA 


60 


TTTCCAACCN 


TATGGCGGAG 


GGTTAAAGTA 


AATTCTTGAA 


GCTTTGGTTG 


CCCNCAAATA 


120 


ATATGATATT 


TCCGAAATGG 


GCCGGAGCCG 


TATCCAACAA 


TTTAAAATCT 


GCCGGAGGTA 


180 


TTAAAGCATT 


ACTTAACCGC 


TCCGATTAAG 


ATGNNATTGA 


AGGAGAGATT 


AATATGGTAG 


240 


AATCAATGCT 


AACTTTTATG 


CTTGGGCCAT 


TAAGACAAAT 


CNCTGATTTT 


TATATGGAAC 


300 


ATTTACTCGT 


AAGTAATTCC 


ATCGTCATTG 


CAGGTTATTT 


TGCGACAGCT 


ATTTTTAAAA 


360 


AGAAAAAAGT 


TGTGAATTAA 


ATCANATTTG 


AGGTGATTTA 


CAAGTGAAAG 


CATTGAAATT 


420 
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ATATGGCGTA GAAGATTTAC 


f^?PATf2Afif2A 

VJVJ 1 A X\3AVJ\7A 


TAATRAAAAG 


CCAGTCATTG 


AAAGTGCGAA 


480 


TGACGTTATT 


GTTAAAGTAC 


PAf2PP.APTfiP. 


PATATP.TP.P.T 


TCAGACACGT 


CACGAAACAN 


540 


AAANATGGGG 


CCATACATTA 


AAGP/TATGPC 


ATTTGGTCAT 


GAATTTTCAG 


GTGTAGTAGA 


600 


TGCCATTGGA AGTGATGTTA 


CGCATGTTAA 


TGTGGGCGAC 


AAAGTGACAG 


GTTGCCCAGC 


660 


AATACCTTGT 


TATCAATGCG 


AGTAGTGTTT 


GAAAGGTGAA 


TATGCACGAT 


GTGAAAAAGT 


720 


TATTCGTCAT 


TTGNCTCAAT 


ATGAACCTTG 


GATCGTTCNG 


CGGAATATGT 


CAAATTCCCA 


780 


CCCCAAAATG 


TTTTNAAGGG 


TTCCAAGCCA 


ATGTTGATTA 


CCATTGAANC 


AACCAATGGT 


840 


TTGAGCCATC 


AGCCCGTTGT 


TP^GPATGGG 


TTTTATAAAT 

X X X X *» X X 


CCGAATANAC 


AACCTGGTAT 


900 


GACTGTTGCA GTAATGGGGT 


PTPP.P AC2TAT 


APjPjTTTPjTTA 


RPTATTPAAT 

X AX X WU\ X 


nnnr APfi AA T 




ATTTGGTGCT 


GCACATATCA 




TATAP-ATfiPfS 


PATAAAPTAfi 
V,* A x nnn\. x Av> 


ATATTGCAAC 
a x .rt. x x vJwvnL 


1020 


ATCATTGGGC 


GCACATCAAA 


PAATPAATTP 


AAAAGAAGAA 
aaaauaauaa 


AATCTTGAGA 

AA X \«- X X \3fl\Jfl 


AATTCATCGA 


1080 


AAATCATTAC 


GCCAATCAAA 


TPf2ATTTAfiP 


TATAGAATPA 




AAGTTACGAT 

/i/vj x x nvun x 


1140 


TGGTCAAATA 


TTGACGCTAC 


PTA AAAA Af2P. 




PJTATTAPTPP. 

\J X #V X X X \»u 


GAATACCATA 


1200 


TGATGATATT 


GAGATTGATC 


bUul xv-Ax l -I 


1Y2AA AAAATT 
lwuvinnAl 1 


v— luv>u 1/lnv.u 


AGTTGAPAGT 
nvj x x ununu x 


1260 


ATGTGGCTCT 


TGGAACTGTT 


Ivj IVLAb 1 AA 


111 l\-WOVjVJV_ 


A A APAPTPPA 
nniiunu luun 


PP^PPA APPTT 
LUUV.MAL.V 1 X 


xOaU 


ACATTATATG 


AAGACGAAAG 


AlAi. IAAIajx 


A A A PPPT A TT 


Al 1 IT* XV- AX 1 


111 1AIAA?1 1 




AAGAAAAAGG 


CCCGGAGACA 


111 lunlnAA 


Hwpii 1 ^Wl' A AT*A 
1 Inul lAAin 


AP.A AAP.APPP. 


A TTTP A T A A A 
AX X lunlAAn 




GTCATGTTTA 


CGATTTATTA 




TTGARP.APGA 


AAAPGPTGGT 


ATAGTTATAG 
Ainux xx^x no 


1500 


CTATGAAAGT 


GCGAATGCCG 


TPfY^TPTAP 
1\» 1 1A. 1 Av_ 


APJATAPTATP 
AVanlAl. X AX V- 


GAAATAATTC 


ATPTTPGAAT 

f\ X w X X \~*Jr\t\ X 


1560 


ATACGTTGAT 


AAATAGCCGG 


nwT**pRP'1 M l'f!i'l 
1 1 InL 1 lu lu 


Tf2 A A A T A TflP 


X X\3XUAAX\»U 


P.TTP.TTTTGP 

\3 X X \JJ X X X X \3\_> 


1620 


ATTTTGTATA 


CTTAAAATGA 


fiATPJVA ATA 


TTTPJ A 1* A A TT 
1 1 lUAlnnl 1 


TTTAAAP.TP.A 

XXX AAAV7 X V3A 


AAATPAAGTA 

Af%A X Vm^xVI&VJ X a\ 


1680 


CAGCCACTTA ATAAGATAAA 


TTTATTATAA 


T ATATPPTA A 
InlAiuulnA 


A ATP. ATPJfiP A 

AA x UA I A 


OTA ATA A TP- A 

UlAHlAAluA 


1 7AH 

J. / %LF 


ATTTGAAAAA 


GAGTAAACAT 


TAATACCTTT 


AALAAI 1 1AA 




r*TT A A TP. ATT 


JLOUU 


AACTGCATGG 


CAAAACAACT 


1 AuAA IvtLj 1\- 


AP.TT APWN A A 
/VV7 X 1 AV-IMIMAA 


AXAV»AX X X X X 


ATAAAAAATT 

AX AAAAAAX X 


1860 


ATCACACTAT 


TGTGACAACT 


ATCTTTGGAT 


TAATAAAAGA 


GGCAAGTGAG 


CAATAGGTTA 


1920 


GGCTTATGTG 


CGGGCATAGG 


TCAGTAATGT 


AGAAATGGAA 


ATGATGTAAT 


ACGTTAATTC 


1980 


GTTTGAATCT 


TTAAAAGTAT 


TGGATAAATC 


AGATAATCGC 


CTGTTATACG 


CATCAATCAA 


2040 


TGTTTTATAC 


ATAAATATGT 


CTGATATTTC 


TAAGTCGTTA 


TTTTTGAATG 


GAATAAATGG 


2100 


AATCTTACCC 


CACGACCCTG 


TACTAAAATG 


CGTTTTT 






2137 



(2) INFORMATION FOR SEQ ID NO: 1081: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081: 



TCATTTTTTT GATTGAGCCA AAAGGAGTCC CCCCTGTGAG CGGAATATTT AATTTATTGA 60 

919 



WO 97/30070 



PCT/US97/D2318 



GCTATTTAAT 


ATTAAACGTA 


CGCTTAACCC 


CCCTANAGTG 


ATATCtfrTTC 


TAGCGTTGTC 


120 


ATTATCATTA 


AGCGAAACAT 


TTTAAAGACA 


AATACACACT 


GTACGATCAC 


CAAACTGCAT 


180 


GTCGAACAAT 


GTAACATTTG 


GATTCGATAT 


TTAAAATTGC 


TTGTGATGAT 


AAACTTTCTC 


240 


ATTTAGAAAA 


CGCTTCCACG 


TACATTCAAA 


AAAATAACTT 


TGTTAACCAT 


ATTGTAACAT 


300 


TATTTCATAT 


ATTTTGGGGC 


ATGAGAATGA 


TTCTCACGCC 


CAGTAATTTA 


TTTATGCAAT 


360 


TGTTCATGTA 


GGTTCTTTGC 


GACGTTTTCA 


GGAATACCTA 


TATTTTTAAA 


ATCTTCAAGT 


420 


GTAGCTCGTG 


CCTTCATTTT 


CTTGATTGAA 


CCGAATGAAC 


GCAATAATAA 


TGTTTTAACG 


480 


TTTGTTACCG 


ATACCATCTA 


TATCATCAAG 


TATTGATTTC 


AAGCCTGTCT 


TTTGACGTGT 


540 


TTGt'C'rATGA 


AATGTGATTG 


CGAATCTGTG 


AACCTCATCT 


TGGATACGGA 


TGCAACAAAT 


600 


AAAATGCCTG 


CCTATTTTTC 


TTCAGTGGAC 


AATTTCCTGC 


ACTAGCGCCA 


TAATAATAAT 


660 


TCAGATGTTT 


GGTGTTTATC 


ATTTTTCTGC 


AAACCTGCAA 


CAGGGATATC 


AAGACCTAAT 


720 


TCGTTTTGTA 


GCACATCAAT 


AACCCCGTTC 


ATATGTCCTT 


TACCACCATC 


GACTATTATT 


780 


AAATCAGGTA 


ATGGTAATCC 


TTCGTTTAAA 


ACGCGAGAAT 


ATCGTCGTCT 


TACTACTTCT 


840 


CTCATTGATT 


TGTAATCATC 


TGGACCTTTA 


ACCGTTTTGA 


TTTTATACTT 


TCTATAATTT 


900 


TTCTTATCTG 


GTTTACCGTC 


GACAAATGTA 


ACCATTGCTG 


ACACTGGATC 


CACACCTTGA 


960 


ATATTAGAAT 


TATCGAATGC 


TTCAATTCTA 


ATTGGTGTTT 


GAATTCCCAT 


TTGTGTTCCA 


1020 


AGTTCTTCAA 


TAGCTTTAAT 


CGTTCTTGAC 


TCATCACGTG 


ATATTAATTC 


AAATTTATTA 


1080 


TTTAAGGATA 


CTTTAGCGTT 


ATGTGCAGCT 


AGGTCAACCA 


TATCTTTTTT 


GGGACCTCGC 


1140 


GCGGGTTGAA 


CGATTTTAGT 


GTCCACAACA 


GATTGAATCA 


TTTCTTTATC 


CAAATTACGT 


1200 


GGTACATGAA 


CTTCCTTAGG 


TAAAATATGT 


TGGTTTAAGC 


TATAAAATTG 


TCCAATAAAT 


1260 


GTATAAAATT 


CTTCTTCTTC 


TGTTTGGTGT 


AATGGAATCA 


TCGGNGTATC 


TCGCTTTATC 


1320 


ATATTACCTT 


GTCGGATTAA 


AGAAAACTTG 


GATACACATC 


CATCCTTTAT 


CAACACAATA 


1380 


ACCAAAGACA 


TCACGAATCG 


GNTTATCTGA 


TGACATAATT 


TNGGGTGGGT 


GGGCAAGATT 


1440 


TGGATATTGT 


TGGATTAAAT 


CTCTATATTC 


TTNAGCCGCN 


CAAAATCAAG 


TGATTCACTT 


1500 


GGAGGNCACA 


TNCGCTCTTC 


TAANCTTT 








1528 



(2) INFORMATION FOR SEQ ID NO: 1082: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082: 

AGGTTGATAC ATATGTCTTG TAAATACATG TGTCATTGCT TGACCAATTG CAGCAATCTG 60 

GACCCAGGAT NGGTGCCTAC CCAGTTATCT CCAAATTTCA CATTTCTGCG TAATCTGGTG 120 

CTACTTGAAA TGACTTTTGT ACCTTAATAG CGGACTTCTG TCATAAAATG TGCATCCGGA 180 

GTACGTGTTA AAGGTACATT AGAGCCCCAC ATAATAATGT ATGATGCGTT ATACCAGTCA 240 

CTTGATTCAG GCACATCTGT TTGCTCTCCC CAAATTTGTG GAGAGGCAGG TGGTAAATCT 300 
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GPATAPPAGT 


P ATA AAA APT 
UlliwlnnnL X 


a a fSP ATTTP A 

AAOVAX X IWt 


CPACPAAttPA 

V* V- AV» wUVJ\*/V 


AATTRATRAA 
AAX IVJAluAn 


TTT3 A nr* A PPT 
XVV3AOLALL 1 




GPTRCATAAC 


TAATPATTRA 


PATpRpTfifiA 

V»A X VVJV* X OUA 


ATARRTfyPAA 

AlAUVll VJXAA 


ATPPTfJPO A T 
A X L V- X VjV-VjA X 


TPP.ATPTRRA 
XVuAXV luun 


Aon 




TTATTY2TATA 
X XAX lUlAln 


PARTA ATTRT 


RPTPpPaTTA 
UV< luUun X X A 


TPTPTPTA AP 
IV XV IV* 1 AAL 


vj IV 1 1 1VLAA 


4QA 


1 I 1V\AAL\»LA 


Ub 1 vjLLV IVL 


LA 1 AlJL 1 VAaXj 


GCTTGCTTAT 


ATTGTTTGGC 


TTTGtCTTCA 


540 


nwrnv 1 ! a a a 
1 1 X IVAAV^An 


X ALALvjLV LA 




PPRI^PAPP AT 


ASj 111 IV 1 IV 


TAATGCTTCA 


600 


GTCPATAAAT 


PPP x n A fiTTT 


TPPAPT3AATA 


T ATPRAT ATT 
XnluuAlAX i 


TRATTPn A AP 
IvxA 1 IVAaAAb 


l\jv»AlTGTAT 


cert 


TPATAPPAAH 


Af3 A ATR APfSP 
iivjAfi x OLrvv-VjV.. 


nCi« x v,Vj xvion 


P ATPPTPTf^i 
LA Xv> X V.V7 


vj 1 1 LAI Al IV 


ALjvjVAI ATlL 


Tift 

720 


GGACCACAAC 




AP/PTTP.TTP. A 

AO XXX VJ X X Van 


TTTTPPP JiClfZ 
X X X X V.V»V«AV3V7 


TA ATP AP APP 
X AA X VAL ALL 


Al 111 1VALA 


TO A 

faU 


A ATAPTTTPP 
nn x al xxx v.. v. 


A ARA APATPA 


PPPTP.T A P Afi 
uLv X VJ X AL AV3 


TTA APAPPAT 
1 lAAUiLLAl 


Vj 1 Vj 1 i\j 1 IV 1 


1 AlTTlTTTA 


840 


TPPTPPpTpp 


AALvj 1 IV 1 V- X 


Vjl ALA1 1x11 


TPPP A wnpmp 
iVLLA 1 1\. IX. 


TACTTTTACT 


TTCTAGGATC 


900 




LAI 1AAA1 x X 


1\- 1\j 1 ITjvjL 1 


XaAAGaATTC 


A A rrw* A A siniri 

AATCCAAATT 


TTCCCATATT 


960 


t a t a tpptpp 


lALulAlAAA 


AAAlALuAlu 


luiAunluiv. 


GTGTTNTTAA 


ATACTTTAAA 


1020 


A TRT*PP A AR A 


f*TATTf2PTTT 
V.1A1 l\jV XXX 


A ATTAPHTTP 
AA1 XAviAl Iu 


1 ALA1 11111 


^*AOA A ATlAf^A 

lalaAATATA 


AAATATTAGG 


1080 


fSAATPAPPTA 
ViAA l\*nLV. X A 


ATf*Af**'¥*'l*A A r** 

AX lAAu 


bAAl 1 1 LLw 1 


A IV* A A*PA 7\ /~V~» 

A IvAATAAUj 


GGATTTCATT 


GAAATAATAC 


1140 


APA ATPATPT 
aUAA 1 VA iu X 




TTATTGCCAA 


Iv 1 AAATCGT 


TCaaATTTGl 


CACAACGACA 


1200 


A ATA afMPTT 
AAlAAvTVjLl 1 


LAALALviAAX 


Al A'l'l V1VT1"C 


GGTTGAAACC 


TTACTTATTC 


ATTTATTTTT 


1260 


1A1AAA1 1 Ao 


IvIAlATAAlA 


CTGTATTAGC 


ATCTGCACGA 


ill/ Vl/lffVfl^ m m *a 

TCajGTTGAAA 


TATATGTTAC 


1320 


ATTTTCTTGC 


TCXTTTAATAA 


ATGCATCATA 


GTAATCATAT 


TGCCGACCGG 


AATGATATGT 


1380 


tLtAl IVAjLAI 


GTATCATTTG 




XPKPPPAITA A 

auaGCCaTAa 


LL l ivGTCAT 


ATAAATGTTC 


1440 


AP AR ARP AT A 


aitptipptp at 

AvtvtvjLVj IV A 1 


PTPTAP A APP 


AL. 1 1 AL 1 ALA 


lAAAATTGCT 


vjaGTCGGATT 


1500 


An A A/2TTf2TT 
AVjAAVj X X vj X X 


PA/2APATTTT 
unuALAl 111 


PipjpiTj »pp 
LAulAXAALL 


PlPTHPTiTPP 

LAL 1 AL 1 xL I? 


LL T AT AAT AC 


ik mii mit n/-inwH 

ATATACCTGG 


1560 


TTTPPPPTPA 
X X XVuLL 




viv. IVAJAvj 111 


/™WA A/^A AH1A 

LlLAALAATA 


TTACTTAAAC 


GCCCCTTAAC 


1620 


A AO A A APTPP 
AALAAHIU lVvJ 


mrpm A Af ArT* 

1 1 AAAALALVj 


AlvivJ 1 lVxAAA 


^ A A A fry/Tin 


ATCGGATAAT 


CAATATCTGT 


1680 


ulALlul Ivj 1 


AlvlTiUviLviA 


TAATTTTCCC 


TAAACGU H A 


ACTCCCATAT 


ATATCGCCAA 


1740 


Lvs i ALLALLA 


TTTACTAAAG 




TACGTCATTT 


IU/ W | M | k> »l| kj-» m m 

TCTTCTGAAT 


CTNTNAAGTG 


1800 


appttstap a a 

AvV. X\j X AvxriA 


AA X v» 1V1MLAL 


1 1 IALjCAAv, 


a r^r* A 0^/^TkT»T»^* 

Aliv^L vjLN 1 vj 


UNTAATCCTG 


TTTGCATCGT 


1860 


ARPARPTRpa 

AUUlUw X V3VA 


f2P A PTPf2P A A 
VJVAV, XVVjLAA 


CI A TPTP A C & P 
VxA IVj 1VALAL 


LAvjvj 1 ALAA 1 


ppp A ATA /"•r^/"* A 

OxvaNAlGlA 


ATAGTAAAAG 


1920 


TTPTTTAf2TP 
X XV> X X lAulU 


Xv7 lVAAL 1 x\. 


TTCTTGAACT 


/^/^ATO^ A A A ft A 

LX^NllAAATA 


TTGCTGGGTC 


GCCACCTTTA 


1980 


APPPTPAPPA 
AV*V*V* IVnUv^n 


ALL x 1\J X 1 A 1 


A 1 vAjAvAj IvH- 


nvvwtfnppKPP 

1 vA71 IVLALAj 


ATACATTCAT 


TTAtTI'TTI'C 


2040 


TTRPTfZAATf; 
x x V7V» iunn X vj 


lul 1 1 iuLAl 


ALAjVjV. 1 IT. 1 1 


allaAlaTlG 


ik m ik ik iiMimi/i % ^ 

ATaATTTCAG 


TaGTCAAATT 


2100 


PfiPATATTPT 
UwWilAA X\J X 


AAA A1WPA A^V* 

AAAA1 IAAUj 


^ Kiwnp A pmiv a 

vxAl IvALTxAA 


TlvjATlATAT 


AGTATAACAT 


CCGCTTCACG 


2160 


T ATT A A APtf2P 
XAX XAAAVAjL 


l\^ttJV.L 1111 


1 Avj luAAAI A 


Al ltJGGATTA 


LvTGGTCCCG 


CACCTATCAA 


2220 


GTAAACfTTR 
v nfin^>\> x x vs 


PP ATATTTAT 
UUniAi X 1A1 


PTAPAflAPAT 


A 1 A 1 A 1 ALLj 1 




AAC 1 1CTACC 


2280 


TCATAAACAT 


P/TAPAPAAPP 

V» 1AUALAAUV 


TTP ATP iP/W 
X X Ull V*A\JV? X 


TPTTRTAPA A 
XV 1 1 va 1 ALAA 


TPPT'TiP'nK'Ti'P 

IVLLALj 1 Al 1 


ha a Anrv«A a rrts~> 

TAAATCAATC 


2340 


TTTTGATTAT 

AAA A A A 


GAARrnflAPA 


RAATAPATAT 
UnAlALAlAi 


TP A PP A<"*TP & 
1LALLAL ILA 


L 1 v»l IV LL lv 


TGACAATGGT 


2400 


CCTTGrTTAT 


GPRH AP A A AT 


ATTf2Tf2^A AT 

AX XV3XV7V3AAX 


pppp»npa AiTT 

LVjLvj 1 VxAN 1 1 


GTACCACTTT 


ik ik it it n % 

lAGATAAAAA 


2460 


CCAATCCTAC 


PTPTTTPPPT 

V. XV. X X X 


TTCMPAATAA 
X lunLAniAA 


PPT'P'P'P'P'TVP 
LL 11111 1L.L 


A STTfTiPPPPfn 

AA1 1 AviGGGT 


GTTAATTCAT 


2520 


CTATAGTTP/P 
ini/w x x vj x 


PAPTTTA ATT 
LAL X X lAAl 1 


111 IV 111 1 O 


TT7lL*CATGTA 


TTAC AC CTTC 


TCCACTTCAA 


2580 


a a A 'P'p'Tfi a^^^** 
AnAl 1 1 lALb 


TUCTTGCGCA 


TTGCTAGNTA 


TTGCTTCCCA 


AGGTTCAGCT 


TCGACTGCTT 


2640 


TTTTAGCATC 


CATAATGCGT 


TCAaATAGTT 


CATTTTGTCT 


TTCTGGGTCA 


AGTAAGACTT 


2700 


CTTTTACATT 


TTCAAATCCA 


AGTCTTCTTA 


ACCATGGCGC 


TGTTCTTTCA 


GCATATATAC 


2760 


CTGTTTCGCG 


ATATAGTGCA 


tcaaagcacc 


ACATAATGTG 


NTTACTTCAT 


CTTCTGTTTC 


2820 


TACAGTTGTT 


AAAAATTCAG 


CTTTTTCGAC 


TTCTGTACCA 


CCATTACCAC 


CGATATAGAT 


2880 



921 



WO 97/30070 



PCT/US97/02318 



TTGGAATCCA TTTTCAACTG AGATAATACC AAAATCTTTA ACACCTGATT CAG _ 2933 

(2) INFORMATION FOR SEQ ID NO: 1083: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1412 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083: 



GGCACGAGCG 


TCATACCGTT 


ATGTGCNCGA 


ATGACAAACN 


TCTTATTGGC 


CCTGTTGATT 


60 


AAATCACATA 


GTGAACGATA 


AAAATTACCC 


CCAAGATNNN 


GAAATTGAAA 


CAAATGTTAA 


120 


TNANTTGANA 


CTTATCATAG 


GNATTAATGT 


TTTGACTTAT 


TGCCANANTT 


GATNACCAAN 


180 


TAAAATTAAG 


TTAAAGGAGG 


GACCGTTATG 


TTACATTATG 


TGATTCCAGT 


ATTAAGTCCT 


240 


ATTACTCGTA TTCTTAATTG 


GTTGGATTTT AATAAAGTTA 


ATTATCCGCT 


AATAAATTTT 


300 


TTATATAATA AAAGCTATAG 


TGTTTGTAAT 


ATTCGGATGC 


GGTTAAGCAG 


CCCCTGATAT 


360 


TAAACACTAT 


AGCTTTTTTG 


TTTTTAAATT 


TGTCCTTGTT 


TACGTAATAT 


ATCCTTTACA 


420 


TGTTTAAATA 


AATCATCATT 


ATGATCTATA 


GTTGTTTTAT 


TTTTTATATG 


TTTTTGCTTC 


480 


GTTGAACTTT 


CGTCGACATC 


GTGACTACTA GATAGATTGG 


AATCATTTGA 


ATGATTGATA 


540 


AATTCGTTAC 


AATGTTTTAT 


CTTCCCATAT 


AAATCATTAG 


ATGTTTTGTC 


ATTCATTTGT 


600 


TGAATTCCTC 


CTTCTAGTGT 


CTATATTATG 


CTAAGTTGTC 


TCTCCATTCA 


ACTAATGTTT 


660 


GGATATCTTC 


ACTAGAAATT 


TTACCTTCAT 


TTTCAGCTAC 


TTCAATTAAT 


TCATTGTAAT 


720 


CACTTAAAGT 


GTAAAAAGGT 


AGTTGAATAT 


TGCTAAATGT 


ATCATCTGCT 


TTTTTCAAAC 


780 


CGTAAGTAAA 


GATAGCTACA 


ACACCTAATA 


CTTCTGCACC 


TGCTTGTTTT 


AAGGCTTCAA 


840 


CTGCTGTGAC 


TGAAGATCCC 


CCTGTCGAAA 


TTAAATCTTC 


TATCACAACT 


ACTTTTTTAC 


900 


CTTCACTTTT 


AGCACCTTCG 


ATTTGATTTT 


GGTTACCATG 


ACTCTTATTT 


GATGAACGAA 


960 


CATAATTCAT 


TGGTAATTTT 


AATTTTTCAG 


CAATAAAAGC 


TGCATGTGGA 


ATACCAGCTG 


1020 


TTGCAGTACC 


AGAAATAACT 


TCTACTTCAG 


GAAAGTGTTC 


TTTAATTAAG 


TTAATTAAAC 


1080 


CATCGCGGAT 


TGCGCCTCGA 


ACTAAAGGAT 


AACCTAACGT 


AACACGGGTA 


TCACAGTAAA 


1140 


TCGGTGATTT 


AATACCTGAA 


CTCCATGTAT 


ATAAATCATT 


TGGTGATAAT 


GTTACAGCTT 


1200 


CAATATCTAA 


TAATGATTTG 


GCAAATNCNT 


TTAGCCAATA 


TNCTAACNAA 


CTTTCTTTAA 


1260 


TTTTATGATA 


ACTTTCGACT 


GGATTGTCAC 


TTGGTGTAAT 


CGGTCTACCG 


ACTACAATAT 


1320 


GCGTCGGAAC 


CAAGCTGTCT 


TGGTTCTCCG 


GTGTCGTAAT 


ACGGGGTGGC 


CAACTTGGAG 


1380 


ATGCACCTTT 


AGGTCTAATA 


CCCGGTGGTT 


AC 






1412 



(2) INFORMATION FOR SEQ ID NO: 1084: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1651 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084: 



AAGATGATTT 


AAANATCGGA 


CNGTCAATTA ATCGCTTGNA 


GACCCCAACC 


TCTTGATAGT 


60 


CTGAAGCCAT 


TACAAGAAAC 


ATATTIATTPP 


ATTGCATGCG 


TAAGAACAAC 


CATTCCTGTA 


120 


CCTGCTGTAA 


CGCTNTTTGG 


un^n^ x i-a x \— 


ACPAAATTGA 


GGTCAATAAT 


TCCAACGTCA 


180 


GTTGTTAAAC 


CAAAATCAGC 




TTTAPATTTYJ 

XXX «»hi/1 XXX VJ 


TTTCACTATC 


TGTGATACCT 


240 


GCAAATGGTG 


TTACTTCTGA 


a pprrypAPPT 1 


un lux v-r^J x 


GGATACAAAT 


GAACGTCGCA 


300 


TTTTCAGGCA 


TGCCTATTTT 


ATA Af3T aPRT 
nl/Utul ALU 1 


X XAi.v.ur\xox 


CTAGGAACTT 


TTGTTTAGCA 


360 


CCGAAGAATG 


ATGTCTCAGG 


ryTYJTTT' A A Af2 


AAPATPPAPA 


TTGCTTTTGC 


AGCATCCATC 


420 


GCTGAACCAC 


CACCAAGTGC 


A ATY2 A TTVyr A 


TP'TYSf^T'IWSA 

X \_ X Uw X X 


AATCAACCAT 


CATTTCCAGA 


480 


CCTTTATATA 


CTGTATTAGT 


iunluuul X\» 


lavX X v-Vj/V— X X 


CGCTAAATAT 


TTTAATTTGA 


540 


GGCTGTTCCG 


TTCTTTGTCT 


TA ATAP ATTP 


TP A IPTY^TTT 

X LuTUlXa X ^7 XXX 


TTGTATAACC 


GAATTCTACC 


600 


ATACCAGGGT 


CACAGACAAT 


IWIL 1 X X 1 


X X V* X lul 


CCATTGTTGT 


TAGACTCATG 


660 


ATTGCATTTT 


CTTCAAAATA 


A A TTTY5 APT* A 


nnc a ppttyi a 


AAATTTGAGT 


ATTATTACGT 


720 


CGTTTAGCAA 


TCGTTTTAAT 


ul 1 1AA1AAA 


1 V_ X\j IVViWAL 


TAACATTATG 


TGAAATTGAG 


780 


TTTCTACCGT 


ACGAACCACA 


A PPT A ATY2TT 


AAAf^APfSfSAA 


TCAATTCGTT 


ATACATATCA 


840 


CCAATACCTC 


CAACCGCTGA 


TGGTGTATTT 


ACAAGTACAC 


GACAAGCTTT 


CATTCTTAGT 


900 


CCAAAATCTT 


TTTGTAATGT 


TTCATCTTCT 


GTATGGATAA 


CGGCTCGTGT 


GTCCTAATTC 


960 


CACAGCGAAC 


ATAATTGAGA 


ATAGCAACTA 


TAACGCATTT 


ATAAGCTTCA 


GAGAATTTGT 


1020 


ACCATTCCTG 


AATAATTTTT 


ATTTTAAGTA 


AATTACACAT 


TTTTTAACAC 


ATATAACCAG 


1080 


TAACTTGGTC 


ATTTTTTTAT 


TCACAAAAAA 


ACTCTCTTCA 


TCATGTCAAC 


AAAAACTACA 


1140 


ACATGATTAT 


GAAGAGAGCT 


TTAATTTATA 


TCATCTGTAT 


AAGTACTTTA 


TACAATTTCA 


1200 


ATTATACCAG 


TGATTTTACA 


ACTTGTTGGC 


ATCGTGGACA 


TAGATGCGTC 


AATTCATCAA 


1260 


CAGCACCAAG 


ATCCTCTGAA 


TAGTTCCAAC 


ATCTTTCACA 


TTTTTCACCA 


TCTGCATGTT 


1320 


CGATGACAAT 


ATCACCATGT 


TCATAAGCTG 


TTGCCTGATC 


ATCTAACTTA 


TCTACAACTT 


1380 


TAACTTGTGA 


CACGATAAAT 


AATTGATGTA 


ATGCATCAAA 


TGAAGTTAAG 


AATTCAGATG 


1440 


CATTAAATTT 


ATCGTTACTA 


GCAATCGNAA 


CTTTAGCTTC 


TAATGATTTA 


CCAATAACTT 


1500 


TNNCATTACG 


AGNAGTTTCT 


TAATGCACGG 


NTCACATCAT 


CACGGTNAAT 


TCAATAAATT 


1560 


GTNCGCCATT 


TATCCAATAA 


AGCTTGATCT 


ACTTCCTACA 


ACTTNAGGTA 


TGTTCTGGCT 


1620 


AAGTGAACAC 


TCNTCCTTTC 


TCTCAACTAT 


G 






1651 



(2) INFORMATION FOR SEQ ID NO:1085: 

(i) SEQUENCE CmRACTERISTICS : 

(A) LENGTH: 3386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085: 



GCATCGGTTA 


AATGGCGTCA 


TTCCACTTAC 


GGGCATCTTN 


AGGCATTTTG 


CTTTTCAAGT 


60 


GGCTAATTTA 


GCTTCACCGT 


TTAAGGCTAT 


TTTTAGCATC 


AGTCACTTGA 


GTTAATGCAC 


120 


GGAGTAATGT 


CATCTGGGTT 


AAGCGTTGGA 


TTTGTCGTTT 


GATTGATAAT 


ACCTTCCGCT 


180 


TGAGACACTG 


CATTTGTATA 


TGCTGTTTGC 


TTATCGACAT 


CAGCATCGTG 


GTAGTTCTCA 


240 


CTTGCTTTCA 


CAGTATCTTT 


ATTTGCAATA 


CCTTGTTTCA 


AGTTACCCAT 


TGCTGTATTC 


300 


ACATTTTGTG 


CATTTTTGTT 


TAGTTTGTCC 


AACCTGTAGC 


TAAAGCTTGT 


GCTTGATTGA 


360 


NTTGGTTTCG 


TAATGCATCA 


CGTTGTGGTG 


ATTTAAGTCA 


CGTNACGTAT 


CAAGATTTGC 


420 


TAAAGCATCT 


TGTTTCGCTT 


GCGCTAATTT 


TTCATCACCG 


TTTAATGCAT 


CTTTCGCTTG 


480 


ATTCATAGTT 


GTAACTGCTT 


TTGTTACTGT 


ATCAGGCGCC 


AATACAGGTG 


TTGGTGTGCC 


540 


ATTTGCGATT 


TGTTGCGCTT 


GGTTAGCTGC 


ATTGTTATAA 


GCTTGTTGTT 


TGTCTTGATC 


600 


CGCTTGTGTA 


AAGTCAACTG 


ATTGTGGTAC 


TTGACTATTC 


GCTTGAATTT 


GTTGTGTCAA 


660 


CGTACCCATT 


GCATTATTTA 


ACGCATCAGC 


ATTTTGCTTA 


ATAGCATTAA 


CACCTGTAAC 


720 


AAGTTCTGCA 


TGCGACACTT 


GGGTCTTTTA 


ATGCTGTTTT 


TTGTGGTTGA 


TTTAAGTTTG 


780 


GTAACTGATC 


AATGGCTGTA 


TTTGNATTGT 


CTTTAGCAAC 


TTGTAAGTTA 


TGGTTACCGT 


840 


TTAAATCACC 


CTTAGCTCGA 


GCCACTTGTT 


GTAACGCTTG 


AGCCACTTGT 


TGTGGACCCA 


900 


CGCTTGCATT 


TGGGTGNACC 


ACTAATGATT 


TGGTCCAGCA 


tgcgctaccg 


CATTTGAAAA 


960 


ATGCATTTTT 


GCTTATCAGT 


ATCAGCATTT 


GCCAAAGTTA 


CCATTTGCTT 


NAGGTGGTCT 


1020 


TAATCTGCAA 


TACCTTGTNT 


CAATTGTGTC 


ATTGCTTTGA 


NNTAAATCTT 


TGTGCTGTTT 


1080 


GCTTAATCGT 


GTTNACATTT 


GCAACAGTTT 


GCGCAGCATC 


GACTTGTTGT 


TTAAAGGCAT 


1140 


CTTTGTGCGC 


TGGATTTAAA 


TCATTTGCAT 


TTGTAATAAC 


TTGTTTCGCT 


GCATCTTTTG 


1200 


CACGTTGTAA CATTGTCATT ACCATTCAAT 


GCTTGTTTCG 


GCTTCGTTCA 


CACGTTGCAT 


1260 


CGCTTGTTTC 


CAACTTCAGT 


TTGAACTTGC 


ATTGCTGCCA 


TTTGCTTTAG 


ATAAAATACC 


1320 


TTCTGCATGT 


GAAACCGCAT 


TAGTATAATC 


ATCTTTCTTA 


GCTNGATCTG 


CATCAGTAAA 


1380 


GTTAATGCTA 


TCTAATGTCT 


CAGTTTTATC 


TTGTAATGCA 


TTGTTTAAGT 


TAGTCATAGC 


1440 


ATTATTCAGA 


TTTTGAGCAT 


TTTGCTTAAC 


ATTATTAACT 


GTTGCAATAT 


CTGGTGCTTG 


1500 


TTCAACTTGA 


GTTGTTAGCG 


CTTGTTTTTG 


AGCATCATTT 


AAATGATCTA 


GTGTACCAAG 


1560 


TGTTGTCTTA 


GCATCTTGTT 


TGGCATCAGC 


TAATTTGTGA 


TTACCGTTAA 


GTGCTTGATT 


1620 


TGTGTCATTC 


ACTTTAGTTA 


ATGCTTGTTG 


AATTGTATTT 


GGATCCATTG 


TTGGATTTGT 


1680 


TGTTTGTTTT AACAATTCTT 


CTGCTTGNGT 


TACTGCATTA 


TCATATGCTG 


TTTGCTTGTC 


1740 


AGTATCAGCA TCGTGATAGN 


TCTCGCTACC 


TTTAATTTGT 


GCTTTATTCG 


CAATACCTTG 


1800 


TTTCAATTGT 


GTCATCGCGT 


CATTAAGCGT 


TGTTGCCGCT 


TGTTGAATAG 


CATTGACGTT 


1860 


TGGTACAAGT 


GTTGCTTGCG 


TGATTTGTTT 


GCTGTATTCA 


TCACGTTGCG 


CTTGGTTTAA 


1920 


GTTTGGCAAT 


TGATCAATAG 


CATGTTGAAC 


ATTTTGTTTC 


GCCGTTGCTA 


AGTTTGTATT 


1980 


ACCATTTAAA 


TCATTTTTAG 


CTTGCGTAAC 


TNNATCTAAC 


GCTGCAGGAA 


TTTCGCTAGG 


2040 


TGTAACGACA 


ACATCAGGCG 


TACCACTAAT 


TAATGCTTCA 


GCTTTCGCTA 


CTGCTTGATT 


2100 


ATATGCATTT 


TTCTTGTCAG 


AATCTGCATT 


GACAAAGTTA 


CCATCAGCTT 


TTGTTTGTTC 


2160 


TTTATCTGCA 


ATGCCTTGTT 


TTAATTGTGT 


CATCGCATTG 


TNTAACTCTT 


GCGCCGTTTG 


2220 



924 
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TTTAACATTG 


TTTACACCAG 


CTACAGTAGx 


IbCAl 1 1 


AL x Iaji 1 vj» 1 I 


1 lAAi\jLAl\. 


ooon 
xZoU 


TTTCTGTGCT 


IwilAAuA I 


\JA1 lAUAui 1 


TV 'l"P A AT*I' A IT 


OC X w X X X X 


v_vj xx. x x n x 




•X rrv"*lTVTV* A »/V 


I xvaUCAl 1 Al_ 


UA1 1 iAAlvvi. 


X lU X X IN X VJV* X 


X XxU_ X X 


0 X i\Ain X 1VA. 


9 Ann 


TTLpNNLAAv, 1 


i\-.Ali"l"I"l\ji\j 


1 luLnl 1ALL 


2k fP A TT A RTT 
ALLnl 1AUV. x 


TTITf^fSAAATfJ 
J. Vj luunAn X \j 


fcrp Afpmm»Tf*A^2 
nini x xim^.avj 


74 fin 


CATTTGTAAC 


TGCAGTGTTA 


XAlvA.1 lul 1 


Vj X i.\Jl>lv>v_ X X\J 


a x x nnn x v_~/\ 


PHfia AfiTT AP 
unkanAu X X 




CTGNTGCTNG 


CGTAWJATL. r 




X\~ol« X iu 1X.A 


P A A(TPT apnr 

V_/\rlVJ 1 1ALLV, 


aTT/^P A flT AT 
A l\.UCAu 1 A 1 


neon 


TTAAGTTTTG 


CGCATTTTGC 


TTAATTTGAT 


x lot x rLAJX. 


AU wUj i, A lvjC 


tit-L»CUA I TAA 


zb4U 


TTTGATTAGT 


AACAGCTTGT 


TTTTGCGCAT 


TATTTAAGNT 


Vj IV- 1 AAlviAA 


fT^TA A AO A tp/P 

L.L. 1 AAAGATT 


2700 


GCGTTGCTAG 


TTGCTTCGCT 


*W"¥V»PA A/T 

IT. i AnSJ 1 


1 1 m i Htir* * a. #n^n A * 
1 I XAj/V.1 1ALL 


A rnmm 7. a » f*r* A 
AX 1 InAAuUi 


1111 lAuALb 


z /ou 


TGCTTACTTG 


TCCAGCAGCl 


xviAl xvxAlAA 


P A /■"TVV?!!"' & TV* 
LiAV? 1\^^jvjA 1\- 


1 An XvxAAvtvtvt 


lllul Alj 111 




GATCAATAAT 


ACCTTCTTGCA 


fVTVfy^Hy A A O 

GTTv?I*jAUAO 


CATTATTGTA 


CGCATCCTTT 


11 A 1 IvAjCjAL 


Z00U 


TTGCATCAGT 


ATAATTTTGG 




TTGTCGCATT 


ATCTGCAATA 


CTTTGACGTA 




ATTTGTCCAT 


TGCTGCATCA 


ACA'x"x\i"l"x"x\j 


C1WTTGTTC 


ATTACCTTGT 


GCTTCTGCAA 


3000 


CAGTAGTCGA 


ITuTTGTACC 


AATTGTTTTA 


ATGCCTCTTT 


TTGTGCATTT 


GTTAAATGGC 


3060 


TTAAACCGTC 


AATTGCTGTA 


TTTGCGTGTT 


GTTTCGCTTT 


TTCAAGGTTT 


TGAGTACCAT 


3120 


TAAGCGCTGC 


TTTTGTAGTA 


TTCACTTGAT 


TCATAGCTGC TTCAACTTGA TCTTTAGGCA 


3180 


CGTTCGTACC 


TGTAGATTTA 


TTTAAAATAT 


TTTCAGCATT 


ACGAACCGCT 


TCATTGTATG 


3240 


CATTTTNCTT 


CTCTGGATCT 


TGCATCTTGC 


AAAGTNNNGG 


ACTGGCAAGT 


GTAGTGTCTT 


3300 


TATCATTCAA 


GCTATTTTTC 


AAGTTANCCA 


TCGCTGTTTC 


AAAATCCTGA 


GNGGGNTTGA 


3360 


TTGGTTGGNG 


TTNACTTCAN 


CTACAG 








3386 



(2) INFORMATION FOR SEQ ID NO: 1086: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2334 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:1086: 



CCACCCANCT 


GATTATAATG 


TTTTAGCANG 


AGCTAGACTT 


GGTTGGTTAC 


CATCATATCC 


60 


ACAATTTAAT 


AAAAATAGTT 


TGTTGTTTGC 


AGAAGAAGCT 


AAAGATGAAG 


GCATTGAGTC 


120 


GAATGAGGCA 


ATTTTAAAAC 


GAGCGATAAA 


TGGAAGTTAA 


GTCAAAACAA 


ACGCAATTTG 


180 


CGATAGAAGA 


TCCGGATTTG 


AAAAAGAATC 


ATCCGGAAAT 


CACTGTTTAT 


ATGGCGCTCA 


240 


AATCTAATCT 


CAAGTTCTGC 


AAAAGGTCAA 


GAATACTTTA 


TGAAGCATTT 


ACTTGGCACA 


300 


AAATCAGGGT 


TATTAGCTAC 


ACCAAATGAA 


GATGAAAAGC 


CAGAAGAAAT 


TACGTGGCGT 


360 


GAGGAAACAA 


CAGGGAAATT 


AGATTTAGTC 


GTTTCTTTAG 


ATTTCAGAAT 


GACAGCAACA 


420 


CCTTTATATT 


CTGACATTGT 


TTTGCCAGCA 


GCGACTTGGT 


ATGAGAAGCA 


TGATTTGTCA 


480 


TCT ACAGAT A 


TGCATCCAT A 


TGTACATCCT 


TTTAATCCAG 


CTATTGATCC 


ATTATGGGAA 


540 


TCGCGTTCAG 


ACTGGGATAT 


TTATAAAACG 


TTGGCAAAAG 


CATTTTCAGA 


AATGGCAAAA 


600 



925 
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unL ini 1 InL 


\~ IvMjA/VLaj i r 


TAAAGATol 1 


pinp A PA A pipn 


CACTTAGTCA 


tgatacaaag 


660 


r* & Af? a a aTTT 

VwAAuAAAX 1 1 




\_GGC-G X AG fb 


AAAVjA 1 l\Ai 1 


pp TV llppWTIP T* 

CGAAGGGTGA 


AATTGAAGCG 


720 


1 AV.V. iuunL 


V3 1 ALAA X\rL-l- 


XAAL, n i\atA 


AX IVjIAuAAL 


GCGACTACAC 


TAAAATTTAC 


780 


papa & ATATT2 




TCCTGTACTT 


p A A JA A A/W^A 

oAAAAAIJLAjA 


A A PIUIIPP A PP 

AAGTTGGAGC 


ACATGGTGTA 


840 




X V~ALs 1 uAAL A 


A r PA r PP A AP A 74 

AX AXviAAuAA 


X XAAAAALaXA 


TGTTAGGTAC 


GTGGAGTGAT 


900 


Zi C A A A TT! A Tfl 


A T*IV M TY2 r IY2 A P 
AX X V- X\» 1 \jnu 


a nr*f* a a TT*r*T 

AvjV_ lj AA 1 \- o X 




jk rpjv p tv p»p tv pp 

A X At-AGC AUG 


TAATGTAGCA 


960 


f! ATrsr 1 A A T A C 


ipTi A A TT*i!V* 
XAAvvXAX X 1\. 


A XV. l\jrt_ X ACVj 


A TATVVPTi A AT* 
AAluu X AAA1 


mAmp a P A AAA 

1 A XX. ACAAAA 


ATCATATGAA 


1020 


flATYT'TY^AAf^ 


A ar A A A fTY"2r* 


A AfWWTTA 
AAlurLLu X X A 


A A A f"* Arp AII1IIV1I 

AAAljAl Al X 1 


PVTITV PPP A A PP 

CTALtCGAACG 


TGCTGCTGAG 


1080 


AAA ATTTYITT 


X X XAAAXAXA 


AC X XaJAC AAL 


p A P*0 APR APm 

uAUliAuAAGT 


A TV m Tv PS"V«1S ^*Tk 

AATACCGACA 


GCAGTATTCC 


1140 


LAuul XLAAA 


rpTv » A P A iv pprn 


pp APP ATI A mm 

CGAv-GA I A l l 


p A A fTrfTWI* A P 

CACCA l X X AC 


AACGAATATA 


GAACGTCTAG 


1200 


1 ALA* 111 XAL? 


A. AP AmrpA A P A 

AA\JA1 lAALn 


pp a /VfHP AAA 

GGACu lv_AAA 


GTTATTATGT 


Pl^t TV TV r^f^ m Tk 

GGATCACGAA 


GTTTTCCAAC 


1260 




uALit* X 


VjXAX axaaac 


pp A P A n*PRPP 


PPP A A rnppfP A 

GCCAATGCJrA 


tttgggaata 


1320 


f2 Aft AT A Afl A A 


A ATT A H'Mr^T 
AAX lAAlMvxvrX 


ub X AC ALiA X \ j 


LI 1 X\A» X AC 1 


GCGTTATTTA 


ACXXTCTCATG 


1380 


pa at a jvnvy j\ a 
GANAAlxiGAA 


TAI AUALTIvA 


a HWMHAfPP A A P* 

ATGTATCAAG 


a on A A m A A PP A 

ATAATAAGCA 


TATGTTGACA 


ctatttagag 


1440 


GTGTCCACCG 


GTTTGGATAT 


-» »■ » tjs is t\ *■» & 

CANATGAAGA 


ITV1J M|>< <%TTV tk -K m 

TGCTGNAaAA 


CACGATATCC 


aagataatga 


1500 


mrrrpppm; apa % 

TTGGCTAGAA 


GTGTATANCC 


GTAATGGTGT 


TGTAACGGCA 


AGAGCAGTTA 


tttcgcatcg 


1560 


TATGCCTAAA 


GGTACAATGT 


TTATGTATCA 


TGCACAAGAT 


AAACATATTC 


aaacgcctgg 


1620 


GTCAGAAATT 


ACAGATACAC 


GTGGTGGTTC 


ACACAACGCG 


CCGACTAGAA 


tccatttgaa 


1680 


T\ pp > A P A P A A 

ACGAACACAA 


CTAGTCGGAG 


TS m A t~*f~*/^ A ^ A 

GATACGCACA 


AATTAGTTAT 


CACTTTAATT 


attatggacc 


1740 


AA1 X\jv?GAAL. 


/"""AAA P>PP is mm 

CAAAGGGATT 


TATATGTAGC 


A P*TWI*TAP A A AP 

AGTTAGAAAG 


TV TV IV TV 

ATGAAGGAGG 


ttaattggct 


1800 


nyi tv t» /-» tv nvn> is 

TGAAGAX XAA 


AGCGCAAGTT 


GCGATGGTAT 


mil TV TV I till Ml ITS /*^TV 

TAAATTTAGA 


TAAATGCATA 


ggatgccata 


1860 


CGTGTAGTGT 


GACATGTAAA 


AACACTTGGA 


CaAATCXjTCC 


AGGTGCTGAG 


taacatgtgg 


1920 


mmn« A m* ■%. pp 

1TTCAATAACG 


mis ^^t* tv tv is 

TAGAAACGAA 


GCCAGGTGTA 


GGGTATCCGA 


aACGTTGGGA 


agaccaagaa 


1980 


/I TV > MflTX TV TV » m 

CACTACaAAG 


blWi I\A*»T 


ACTAAANTCG 


TAAAGGGAAA 


CTTGAATTAA 


aatctggaag 


2040 


TAGAATTTCA 


CAAATTGCTT 


TAGGTAAAAT 


TTTTTATAAC 


CCAGATATNC 


CATTAATAAA 


2100 


AGATTATTAT 


GANCCATGGA 


NCTATAATTA 


TGAACATTTA 


acaactgcga 


AATCAGGGAA 


2160 


GCATTCGCCA 


GTTGCTAGAG 


CCTATTCAGA 


AATTACAGGG 


gataacattg 


aaattgaatg 


2220 


GGGACCTAAC 


TGGGAAGATG 


ACTTAGCAGG 


TGGTCATGTT 


acaggcccaa 


aagatcctaa 


2280 


CATACACAAA 


aTAGAAGaAG 


AGATTAAATT 


CCAATTTGAC 


gaaactttta 


tgag 


2334 



(2) INFORMATION FOR SEQ ID NO: 1087: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1087: 



GGCACGAGAC TAATACCTGA AATTAATCCA CCACCACCAA TAGCGGCAAA TAAATAATCA 

926 
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A AGTT AAPGG 
xvxvw x x nnuvjvj 


AATTGTf ACT 
nnl lui wAw x 


PTTTGATTGT 

\* 1 A 1 \J«A. A A \j A 


TCTAGCATTT 


CTTTAGCAAG 


CGTACPTTGT 


120 




TATGAAPATT 

X XV X utnnvxi X. X 


ATTGAATGGA 


TCTATAAAGT 


TCATTTGATG 


TTCACTTGTA 


180 


TAAGTTAAAG 


CTTCAGCTAA 


ACAGNGATCA 


AATGTATCAC 


CAGTGAGTAC 


AACTTCAACG 


240 


TTACTATTTC 


CAAAGAACTT 


TACTTGATTT 


ACCTTTTGTN 


AAGGTGTAGT 


GACTGGCATA 


300 


AAGATAACAG 


CGTTTANATT 


AaGx^ITTTTA 


GCTGTATAGG 


CAACACCTTG 


AGCATGATTA 


360 


CCTGCACTCG 


CACATGTAAT 


ACCTTTACTT 


TTAGCTTCAT 


CTGATAAAAC 


AGAAATAGCG 


420 


TTGTAAGCAC 


CTCTTAATTT 


AAAAGAACGT 


ACCCATTGTA 


AATCTTCTCG 


TTTTAAATAA 

XXX xxvxvxvxxvxv 


tIOV 


APTTTAPAAT 

XVw X X XXVwXVXV A 


CATACTTTTG 

w>ini« xxx x vj 


AGATAAGTAA 

XVWXV X XVXVW X XVXV 


TGGTCTAATT 


GTAAAGGTGT 

V* X XVXVXVV7VX X W X 


TTpTTTGAPA 

X X w X X X U*XVV_A 




ATATPTTTAA 

XV 1A X w X X X **** 


GTPTTAAAAA 


TGCTTCATCG 


ATATCTTNCG 


TAGAAACTGN 


TGTTTTGAPT 

X w X X X X \3XVw X 


fifin 


GGCATAATAT 


TCAACTCCCT 


TAAAGTGATT 

X «VVxW X \JI» X X 


TNNCATATTN 


NTCTATTAAT 

11 X X «t X A «u\ X 


GATTPATATT 

wWX X W XV X XV X X 


ODU 


GTAGGGTGAT 


TGCAATTGTC 


ATCTAAGCCA 


TTTACCAAGT 


TTATTTTTCC 


AAAGTTTPAT 

*W*V X X XwXVX 


720 


PAATTTCAAA 


ATGGAAAPGP 


TTGTPTGGTG 

X X V7 X ^ X VJ^J X VJ 


ATGACACAGT 


TTGATTTGfST 

X X \J«1 XXX \>7YJ X 


AAATPTATTT 
xvxvn x w x xv x x x 


/ ou 


PAATTTPPTP 

wXVXV X X X X 


ATATTGTGPA 

XV X XV X X \J X Vwt 


AGATGTTPAP 

AwtA X w X X wXV w 


G AGP APTTTT 

UAUUIV X X X X 


TTPTAAANPG 

X X w X XVXVXVX1 W V7 


ATAfifJPAAPA 

AX AwwwAAwA 




'Pf'V'Sf* ATTTTT 
X Lwvn X l x x x 


AfTPfiPAATTP 


ATATAAAATA 

AX AX AAAAX A 


TYTIVAPTGAA 

X V7 X wlLi X UAn 


APTTPPTGP A 

Aw X X WW X V7w A 


ATA ATA AT AT 
AX AAX AAX Ax 




/twa a app at a 

wXYAAAw wA In 


fypppTTAAfiA 

w 1 V* w X XAAwA 


RPPP A Af3PAf5 

wwwwAAwwAw 


PATPTTPAPH 


APTTOA APP A 
Aw X lunJVLLA 


PATT*PA A A AT 
wA ILLAAM X 


OCA 

you 


TATPTPP ACT 
1a1\. x w wAw 1 


AATTAAAATA 
AAX XAAAAX A 


(I A AC2PPPPTT 

UAAULVfUL X X 


T AT A TTYyTYST; 
XAXAX X w*V7 


TTTRTTAfV2A 
X X X Vj X lAuun 


TTO A A A TT* A r* 
1 XwAAAl wAw 


1U/U 


fSATTATPTGA 
wVi xaxwXUA 


APPATCAGGT 

AULA X wXVVSVJ X 


AAGTAPPGPP 

XIXVVJ X XVw w www 


ATTPATPAAA 

A X X w- A X w Ann 


AGPAAATGCA 

nOwAAA 1U\JA 


PPAAAf2PPAf^ 

w LAaAUL wAlW 


1 OftO 
1UOU 


TTTTTGAAAT 

X X X X .J* \Ji*rUx ^ 


ACGCTTTAAG 


TGTACCTTAG 


GAATGATTTG 


GTC TGTATCG 


ATATTfiTfViT 

XV X XV X X X w V3 X 


1140 


TGAAGAGAGG 


GAPTATTTTA 

^Jl X XV X X X X XV 


CCTTTATATG 

\^ XXX X *»■ X \J 


TTGTAATAGG 


TTTY5ATTGOT 1 

XXX wfi X X VJXw- x 


GCPATTTAAA 

\7v*V*n XXX XVXVX\ 


1200 


PAAPPAPPTT 


TpTTAPGTPP 

X w X X XVw\J X w-w 


APAAATTTAP 

XVwXVXVXVA X X XVw 


PATGAATAGP 

w>n X V3XVXV X Aw\> 


TGCTGPTGPT 

X\7w X VXW X V7w X 


GPP AT AfJP Af3 

w*w wA X Auv»<Aw 


1 9 fin 

X>^ D U 


GGGATAPTAA 

wTV^wLTV X Aw X XVXV 


ATGTGTTPTT 

A1U1U1 X w X X 


GPAPPTTTGP 

UWnL>V>l X X V7w 


PTTY2T l PfTT 1 PP 

w X XUXwwXww 


TTPA AAfSTTT 

X X wXVXVXVV7 XXX 


pgattapttt: 

wwAX X Aw X X w 


1 

IjZ KJ 


TAC2ATY2PAPA 


a top ArYiPrv* 


X Vnbu x Aw X X 


w»vj I Lnuun x x 


OATV^TTTA A A 
wAX VJww X AAA 


r* IP ATTV2 A AO 


UoU 


n 1 W V- X WW X X W 


APfiPPATTPA 

A^UUUA X X wA 


AATPPTflPAT 

AAX ww IwWii 


TTTT A A AfZ AT 
X X X X AAAOA X 


Af2T A TPT Afi A 
no InlLl aua 


ww X AAX X X X i 


1 AACt 


X V7w> X X w A X X 


TTTTAPTfyFA 

XXX lAUlUln 


PGAGAAPCTG 

W VJXVVXXVXV w W X W 


GTAPGAPAAT 

VJ X AWUAvAA X 


AGCTGTAATA 

XVVJJW X V3 XXVXV XXV 


TTTGflATGAA 
XXX uun IUAa 


1 son 


PTTTATTTPp 

W X X A XV A X X V*» 


TTTAAPAATA 

XXX XVXV WXVXV X XV 


TGAPTAGPTT 

X UAw X XVww X X 


CAATPAAATC 

WXVXft X WXVXVXVX W 


TGATAGTPTA 

X V7XV X XVw X W X XV 


P ATTTn T AP 

ww A X X X w X Aw 




ATGAAPPGAG 


AAAAAPATAP 


PPT A AfiTPfi A 
ww XxvAVj X wvSA 


TGTPTTPAGP 

X\J lv X X wXMJw 


TTTTTf2 A PPT 
X X X X X wAw w X 


f5T2TTT , T A A Pr* 

ww X X w X AAw w 


lOZU 


PPATATAATP 

w w-xv x «v x nnivi 


ATAPGPAPGT 

XV X fUvWVpXl\»\J X 


TGATCGTTGA 

XUtAXwVJX 1UA 


TATPATTGAT 

X XV X wXV X XVJtnX 


TTPARGGAAT 


f^TTPAPTTlA 
ww X iVnL X OA 


IDOU 


AATTAAPAPP 
AAX innCnUL 


PAfTTPAGGA 


X IxuJl X w ww w 


ATnTPAPTTY2 

A X V3 X l_Aw X X O 


TTJftTTPTAAT 
X ww X X w 1 AA X 


HTTP &&KPBT 
w* X X wAAxvwAX 


in An 
± 1 *± u 


r* A AfTTP A A T 


X AVJU,un lvn 


A aTRTTYVPT 
AA 1 AluuLb X 


r*c*T*T2r*c* ATT* A 

wvr luLUi X \~A 


wAAX AX Aww X 


LALULLAL i I • 




ATCAAPTdAT 


w Af2TY2 A AOT 
X X AwwwAAw X 




X vjut XT- 1 www X 


1 X AAw AX A X X 


wftAAIu X, 1 AX 


*1 O C f\ 


ATCATCAGGT 

•»* WXVA WVWV7 X 


TGGATTATGP 


PttfPATTTf^^P 

w\J X XV XXX \3U\> 


APPAPPTTPG 

XVw wXvw W X X WW 


ATAfiPPATfiT 


TAP A AATAI*2T 
X Aw AAA Inu X 




CATTCGACCG 


TCCATTGAAA 

x ww xv x x unnn 


GGTTTTTAAT 

MVJl XXX xxvxvx 


TfiTTTPfSPPA 

X w* XXX WWW W A 


GTAAATTPPA 

wXxvnAX X w wXV 


AAfiPATArifY* 

AAwwAX AwV>w 


1 onn 

17oU 


TGTACCAAAA 


TCAACACCAT 

X wi*riv >«^*A A 


APGTTTTAA'P 

XVX*\J X X X X XVXV X 


TAAATGCAGA 


ATAATRTPPT 

XV X XVXV X w X ww X 


TAfSPATAfSAP 

X AwwA X Aw A w 


9r*xin 


GCCTGTTGGT 


AAGGTACCAT 


TAATATCGAT 


TTTTAAGTTT 


TTGGGTTTTG 

X X X X X A VJ 


TTTGPPATAf^ 

X X X WW wXV X XVw 


9T no 


CGTTTGAGTT 


GCGAAAACAT 


GTTCAACTTC 


APTTGTTPPA 


ATTCPAAATG 


PAATAGPAPP 

wXVXV X AVAfACV. 


9i fin 


AAATGCTCCA 


TGTGTTGCTG 


TGTGAGAGTC 


ACCACAAACG 


ATTGTTTTGP 


CGGGPTGTGT 

w>\X\JV7w X w* X VJ X 


9990 


AAGTCCTGTC 


TCAGGTCCTA 


CCATGTGAAC 


AATACCTTGT 


TCATCAGAGC 


CCATATCAAA 


2280 


AATATGCACC 


CCAAAATCTA 


TGGCGTTTTT 


TTGTAATGTT 


GTGATTTGTT 


TfTTTTGPAAT 

X W X X X VTwXVXV X 




TTP ATTTTTA 
X X Unl v. X X XA 


ATATTY2AAAA 

AX AX X wAAAA 


A A iWui X nu 1 


Af2f2A APATfYI 
AuunnLni X w* 


lUA X wwAO 1 w 


x xvj LAAATCT 


2400 


TAAATCTGGG 


CGTCTTAATT 


TTCTGTTTTG 


AAGCCTAAGT 


CCTTCAAATG 


CTTGAGGAGA 


2460 


AGTAACTTCA 


TGTATAAGGT 


GTAAATCAAT 


GTATAATAGT 


TGOTGGTCGC 


CCNATTTCCC 


2520 


GNATAACACA 


TGTCTGTTCC 


ACACX!TTGTC 


AAATAATGTT 


TGACCCAGTT 


ACATCTCCCC 


2580 


CTNAGTGATT 


CAATTTTTGA 


GATAGCATTT 


CGAAAATATC 


AGTAGTATTC 


AATTTGCCGC 


2640 
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CTAAATCTGC TGTCGTTTGC CCATGTTCAA TCATGTTATA AATATGTTGT TCTAATTC _ 2698 
(2) INFORMATION FOR SEQ ID NO: 1088: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1088: 



GAATTCTNCG 


CTACAATGTG 


GCATCGGCAG 


GCATTATCGT 


GTTTACTAGT 


TATCTTAATC 


60 


TTAATGAGTG 


GCGTTGCGAT 


TATTTTACGT 


GAACAAATNG 


NAGTAAAANA 


TTCTAATTTA 


120 


AACAATCAAT 


CTCATTTATC 


TATTANAAAG 


GGAGTTTTAN 


ATATGGCGCA 


AACACTTGCA 


180 


CAANCTAAAC 


ANATATCTCA 


AAGTCATACG 


TNTGATGTCT 


CACAAAGTCA 


TCATAAAACA 


240 


CCAGCTGATA 


CANAACTCAC 


AGTCCTGATA 


TGCCCTCATA 


AATATTAANA 


TTTTANTAGT 


300 


CAATAACAAG 


TTGAATATTA 


AAGTTAAACG 


CCCGTTAAAT 


AGCGTTAAAA 


AATTGAAAAT 


360 


GACAGGTATT 


TGGCAAAAAA 


TAAGAATTAA 


TTATTTATAT 






400 



(2) INFORMATION FOR SEQ ID NO: 1089: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1089: 



ATAATTTCAA 


TGTCCCCGTA 


TGTGTAGTGC 


GCTGGTTGCT 


TAACATTGTC 


TTGCGCTTCG 


60 


TTCAATATCT 


ACTTTTCTGT 


TACTGATTAC 


GCTCATTATG 


CTTCACTCCA 


TTTCTTGNAC 


120 


ATTTGGNTAT 


AAGTGACATC 


GANCCAGTAC 


GGATCACGTG 


NATGTTTTTG 


TGGCGTTCCC 


180 


AATCCATAAA 


AGCCATGGTC 


TTTAATCTTC 


TCTTTCTTTC 


CTGTTCATAT 


TCCGCTCTCA 


240 


CAATTTCGTT 


GGTATCGGGT 


CCACAATCGC 


TTTTTTTCTG 


AGTTTTTCTC 


TCCCTTTTTC 


300 


TTCATCTTTT 


ATTTGACTCT 


TCATATATTC 


CAACTTCTTC 


TTTAGATTTT 


GAGTCCTTTC 


360 


TTCCACACAA 


TAATTCATCG 


CCGCGCATTT 


TATGTTTGTA 


TCTATATCTA 


AGAAGTTCTG 


420 


GAGATATATG 


ATATTTTTCT 


GAAACTTCTC 


TCAATGTCAT 


TAGTTTTCCT 


TTAATACGCA 


480 


CTCTTATAAC 


TTTTCTTCTA GCCATCATTC 


CACCTCTAAA 


TCTAAAACCT 


TGATATTTAT 


540 
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AACGTTATAT TTTAATAGTT CACCTGGATT ATTAAATAAA TAGTCCGCCA AATTTTCTTT 600 
TTCTTTATCA ATCTGATTGT AATTAACACT TTCGACTTCT GTAG 644 

(2) INFORMATION FOR SEQ ID NO: 1090: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1090: 



GGCCGATAAA 


TTACATGTTT 


GCCAAGTAGA 


TCTCGGAGAG 


AACGAACCTG 


TACAAATCGT 


60 


ATGTGGTGCA 


CCCGCATACG 


TGTATGCGTG 


ACAATATGTC 


ATTGTTGCTA 


AAGTAGGTGG 


120 


CAGATTGCCT 


GGTGGTATCA 


AAATTACGCG 


TGCCAANTCA 


CGCGGCGAAC 


GTTCAGAAGG 


180 


TATGATTTGT 


NCGTTACAAG 


AAATTGGTAT 


TTCAAGTAAC 


TATATNCCGA 


CAGGTTGCGN 


240 


NTCAGGCATT 


TATGTCTTTA 


GTGAATCCCA 


NGTNCCAGGA 


ACAGATGCCT 


TACAAGCTTN 


300 


ATATTTAGAT 


GATCANGTAA 


TGGAATGTGA 


TTNAACGCCG 


ACTCGCGCAG 


ATGCTTTACG 


360 


TATGATAGGN 


ACTGCTTATG 


AAGTCGCAGC 


ATTATATAAT 


ACAANATTGA 


CTACGCCAGA 


420 


GACAACATCA 


AATGAGCTTG 


ANTTATCTGC 


AAATGATGAN 


TTGACTGTGA 


CAANTGANAA 


480 


TGAAGATACC 


AGTACCATAT 


TATAGTGCAC 


GTGCTTGCTC 


ACAACGTGAA 


CATTTGNACC 


540 


TTCGCCACTG 


TGGATGCACG 


CACGCTTTAC 


TACACGCGGN 


TATACGTCCT 


ANTTAATAAT 


600 


GTCGTCGACA 


TCTCAAATTN 


NTGTGTTTAT 


CAGACTACGG 


CCAACCCATT 




650 



(2) INFORMATION FOR SEQ ID NO: 1091: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1562 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091: 

TTTATTCAGC ATTCGACCTA GTTGAACAAC GCAGTGGTCG TGATGCATTA GAAGTATTCG 60 

AAGAAGCAAT CAACAACATT ATGCCAGTAT TAGAAGTTAA AGCTCGTCGT GTAGGTGGTT 120 

CTAACTATCA AGTACCAGTA GAAGTTCGNC CAGAGCGTCG CACTACTTTA GGTTTACGTT 180 

GGTTAGTTAA CTATGCACGT CTTCGTGGTG AAAAAACGAT GGAAGATCGT TTAGCTAACG 240 
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AAATTTTAGA 


TGCAGCAAAT 


AATACAGGTG 


GTGCCGTTAA 


GAAAC GTGAG 


GACACTCACA 


300 


AAATGGCTGA 


AGCAAACAAA 


GCATTTGCTC 


ACTACCGTTG 


GTAAGATAAA 


AGCTTTTACC 


360 


CTGAGTGTGT 


TCTATATTAA 


TGAATTTTCA 


TTAAGCGTTC 


ATGCTTAGGG 


CATCGCCATA 


420 


TCTATCGTAT 


TTATTCAGTA 


ATATAAACTG 


GAAGGAGAAA 


AAATACATGG 


CTAGAGAATT 


480 


TTCATTAGAA 


AAAACTCCGT 


AATATCGGTA 


TCATGGGCTC 


ACAATTGATG 


CTGGTAAAAC 


540 


GGACTACGAC 


TGAACGTATT 


CTTTATAACA 


CTGGCCGTAT 


CCACAAAATT 


GGTGAAACAA 


600 


CACGAAGGTG 


CTCACAAATG 


GACTGGATGG 


AGCAAGAACN 


AAGACCGTGG 


TATTACTATC 


660 


ACAATCTGCT 


GCAACAACAA 


GCAGCTTGGU 


AAGGTCACCG 


TGTAAACATT 


ATCGATACAC 


720 


CTGGACACGT 


AGACTTCACT 


GTAGAAGTTG 


AACGTTCATT 


ACGTGTACTT 


GACGGAGCAG 


780 


TTACAGTACT 


TGATGCACAA 


TCAGGTGTTG 


AACCACAAAC 


TGAAACAGNT 


TGGCGTCAGG 


840 


CTACAACTTA 


TGGTGTTCCA 


CGTATCGTAT 


TTGTAAACAA 


AATGGACAAA 


TTAGGTGCTA 


900 


ACTTCGAATA 


CTCTGTAAGT 


ACATTACATG 


ATCGTTTACA 


AGCTAACGCT 


GCTCCAATCC 


960 


AATTACCAAT 


TGGTGCGGAA 


GACGAATTCG 


AAGCAATCAT 


TGACTTAGTT 


GAAATGAAAT 


1020 


GTTTCAAATA 


TACAAATGAT 


TTAGGTACTG 


AAATTGAAGA 


AATTGAAATT 


CCTGAAGACC 


1080 


ACTTAGACAC 


GAGCTGAAGA 


AGCTCGTGCT 


AGCTTAATCG 


AAGCAGTTGC 


AGAAACTAGC 


1140 


GACGAATTAA 


TGGAAAAATA 


TCTTGGTGAC 


GAAGAAATTT 


CAGTTTCTGA 


ATTAAAAGAA 


1200 


GCTATCCGCC 


AAGCTACTAC 


TAACGTAGAA 


TTCTACCCAG 


TACTTTGTGG 


TACAGCTTTC 


1260 


AAAAACAAAG 


GTGTTCAATT 


AATGCTTGAC 


GCTGTAATTG 


ATTACTTACC 


TTCACCACTA 


1320 


GACGTTAAAC 


CAATTATTGG 


TCACCGTGCT 


AGCAACCCTG 


AAGAAGAAGT AATCGCGAAA 


1380 


GCAGACGATT 


CAGCTGAATT 


CGCTGCATTA 


GCGTTCAAAG 


TTATGACTGA 


CCCTTATGTT 


1440 


GGTAAATTAA 


CATTCTTCCG 


TGTGTATTCA 


GGTACAATGA 


CATCTGGTTC 


ATACGTTAAG 


1500 


AACTCTACTA 


AAGGTAAACG 


TGAACGTGTA 


GTCGTTTATT 


AACAATGCAC 


GCTAACTCAC 


1560 


GT 












1562 



(2) INFORMATION FOR SEQ ID NO: 1092: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092: 

TTGTTGACTT AGGTTGAACA TTGGCCTGCC GATTTTAACC ACTTGAAGAT GAAAATGCAA 60 

GCATCTCTTC CCACGTCATA TTTGGACCCA TCGAAAAATT CGGGTTCTCC GAATCATCGG 120 

AACATAACCA AAAGAAGCTA AGCAACATGT AGNCCGTTGT CACTGAACNT CTTGTTTTTC 180 

CGATGACAGC TTCTATTTAG AGAATGTCAT GATTATTTTA TATTCACTTC NATGTTATCA 240 

ATATTAGTGC CATCTATGAC GTCTGCCATG CGATTTTCTT GTAATTTTTT GTGTAANTCA 300 

AACGTGTACT TTCCACCGTT TTTCATGTTA ATAACAATTT TACCTGAACC AACTGTTACC 360 

GTAAAGATTA TGATTNTCAA TAAGTTGTTT TCTCAATTTA AAATCAAGCT CTNTCAAGGA 420 
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VTTA£» I AA 1 V 


& TVT A TTPTP. 


A & A PR 

AAAV*vlA X V,V3 V» 


fyPf3AAATCAT 


AfTTTGATTA 


480 


IV 1 lxli 1 AV 


1AAWVV1 1AA 




X Van X v^Wv x x 


TTTTACTATT 

A X X X rn> J- ** Ji 


AtyrCTTCGTG 


540 




c* a & 2k TA T"PT 
UUiAAXAi 1 1 


111 lAnl XUA 


T ATTT ATTf2T 


CTTCTAAANC 


GATAAATACA 


600 


xvAjAIAx IA1 


L. V 1 AAUb IV C 


A IV 1 1 1A1A1 


XXIX XV, XV.AX 


V- X X X XV*V* X An 


rTAAAfiTTAT 
v» x x n x 


660 


TTTATAGATG 


AACCTATTTG 




r* A T A A A r*P T A 
LA1AAALU1A 


AV-V-U X V_VJ l\.k. 


Aluui 1 luftu 


79fl 


CaAAAaTCCA 


AACTGCTTTT 


CAAATNLAAA 


AL. 1 VW 1 1 IV 


Vj XV 1 AA 1 ALA? 


v. 1 V. 1 1 AAA IV 


"7Pfi 


TTCATATTTA 


GGAGTCATAT 


V IV 1 1 1x3 1 VL. 


1 1 Vj 1 1 1 IAIVj 


PTTCm Af2 ATT 
LjX luu/ttiAl X 


O 1 VJVJ lu X VJ iV 


QAPi 
OttLI 


TGATTTAGTA 


LiAl luuAl IV 


V 1 1 V 1 VjVjV^O 1 




VAjnvaui vi XV7 


l\^\V XXX nu X 




TTTCGGCGTT 


uluuAl Ivjvivj 


TTfiPrYSTTTTS 
1 iutuil X X\3 


TY2 ATfY^WPPT 


XV? X X XnV3V7V-V? 


f^T^ , 2^v l f^lTTfif ,, 




TCjATAT. Ai 1 A 


/UiUiA 1 1 IV 1 


LaV IV 1 IV 1 1 vjr 


TTT A f^fWTHW 
xxx nuu X X \ j X 


f2 A T A TTTTPT 
Wixni X X X V- X 


r^ATTTTTCriA 
VylAi X X X \Ajn 




AUL. luALiVl 1 


111 l\.UKV-iil 


T*2lf" , *T* A T*T U I Y72/"5 


nv^/^f ' 'TTPTPrc 
l\JV-k. 1 XV. X k^vj 


nU X X lAOOVkJ 


1UV.U 1 1L1 X\3 


1 OR Pi 


TCTTGTGTTA 


CX-TGC 1 


I IV IVbC loA 


All I VVALV 1 


uU IV 1 1 A1V 1 


T"PATV*Ik'|UIW 
1 1 A1VA1 1W 




TAATCGCTCT 


GCTTTAAGCG 


TTGGTACTTT 


GTCAACTTTA 


GTTGATTGTA 


TTTTTTCTGC 




TTTGACCGAT 


TGCGTCvjI x A 


CTGTAATTGC 


GCOTGTTGTT 


AAA ifiTPPTS 

AAAAuLLtlA 


V lbL 1 AAAL 1 


1 OCA 


GGTTTTAGCA 




TA1 1VA1AA1 


IV 1 Al At, IV v. 


A ATPTATATT 
AA1V1A1A1 1 


A1A1 lVuAI 1 


1 TOft 


v»xVTTT"x , TAC 


GTAATTTGAA 


Ik T> A f lk Ik O Ik 

1 VA 1 ALAAVJA 


TT* A TT AT A r* A 
1VA1 1 AlALiA 


lVAitb 1 1 L. 1 A 


APATA ATT" IP 
AVA 1 AA 1 V AL, 


UoU 


ATTAAACCCT 


TTTTAACAAT 


TATTGAAGTA 


TTATTAAGTA 


XTTTA A/VA A 
Al 1 1AAULAA 


AAA AT* A AHW^lk 

AAAAlAAlvjA 


i iiitn 


GTGATTATGA 


/^/tfTVfl m fit Jk nt Jk 

GGTTAATATA 


GCGTCTCTAT 


GCTGCCTNTG 


AAATGATTTT 


TAAvCATTAA 


lbUU 


AAAGAAvjl lA 


AGCAACGTGT 


^»1k Ik fTVtfVtfTV 1 Ik 

GAATt-CXaTL,A 


L.TTAAL-L.TL. I 


CTATTTCAAT 


T. IvAAAnAI 


lDOU 


TTCGTCGNCA 


AGTATGTGTA 


TTAAGCTTCT 


CTAACTNNGA 


TTTCAATTCT 


ATCAATATCT 


1620 


GTGACATTGA 


TAACATCGGA 


CATACGGTCT 


TCTTGGTAAC 


TTNNTTATCC 


AANTCAAATG 


1680 


TATACTTNCC 


ATAGTATTTC 


TTTGTGGACG 


GNAATTTNCC 


CTGTACTCAN 


TTCAACCGNN 


1740 


AAGGACCATA 


CTTATCAATA 


AGGTATTNTC 


TTTANTTTAA 


NATCAATCTC 


TTTCAATGAC 


1800 


ATCGCTCCTT 












1810 



(2) INFORMATION FOR SEQ ID NO: 1093: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1228 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093: 

GGCACGAGCG TCCAATTCAT CGAACGCCTG ATATGATTAA ACAACATGAG TATTTAGAAG 60 

ATATTACGAA AAATATAGAA TTAGGGCATT ACCAACCAAC AACGACACAA GTTTTTGAAG 120 

GTTTATCACC TGAAAACTTA TATCAAGCAC ATCAA1TGTT GGAAAAACAA TCAATGATTG 180 

GGAAATTAGT AATTAATATA TAGATTAGTA AGTAAAGTCG CCTTATGCAG AAAATACTGT 240 

ATAGGGCGAC TTATTTTTTG TGTCTAGCGA CTATTGATTC TAAACGGAAA TATGGCAACT 300 

TTACAGGTAT CAAATCAATA ATGCACTAGA AAGGCATTAC TTAAATACAA TAAAATACTT 360 
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TGTTGCTCAA 


TGATAAATAA 


TTTGAGTGTG 


TTTTATATAT 


CATTTTTAAA AGATGAGGCA 


420 


"1'1'ATTAATTG 


TGTCTATATA 


AAATTACAAA 


TGATTAAAAT 


GTTGCAGTTA 


TGGGTATTGA 


480 


ACTTACCATT 


ACACAGAAAG 


GGGTTTGTAT 


TTATGAAAAA 


TAAATCAGTA 


AGTCTAGTGT 


540 




TTTAGTGATA 


TGTACGATAT 


TTGTTGCTTT 


TGGAGCAATA 


TTTCCGAAAC 


600 


AAl I AlxAAAA 


ALTTCACCJCAA 


AACA l I ACTA 


CATTTATTGC 


GCTACATTTT 


TCTTGGTATT 


660 


ATTTATTGCT 


CGTTTTAGTT 


ATTTTATTTG 


TCTGTGTGTA 


TATTTTATTT TCGAGATACG 


720 


CAAACATTAC 


ATTAGGTGAA 


GAAGGCGAAG 


ATCCTGAATT 


CTCATTGCCG 


TCTTGGTTTG 


780 


CTATGTTATT 


TAGTGCTGGT 


ATGGGAATAG 


GACTTGTGTT 


CTGGACAACG 


GCAGAACCAA 


840 


TCAGTCATGC 


CTTCAAATTA 


ACGCCTATTC 


ATAAAGCTGG 


CACGCAATCT 


GCAATTAATG 


900 


ATGCAATGCA 


GTTTTCATTT 


TTCCATTGGG 


GGATTCATGC 


ATGGGCAGTC 


TATGGTATTG 


960 


TCGCGCTTGT 


GTTTGCATAT 


TTTAGCTTTC 


ATAAAGGATA 


TCCTGGCTTA 


GTCAGCGCTA 


1020 


CATTGACGCC 


TTTATTAGGT 


GAAAAGGCAA 


TGCGTGGACC 


TTTAGGTGGT 


GCCATTGATG 


1080 


TATTGGCAGT 


CATAGCTACA 


GTAACAGGCG 


TTGCTGCACA 


TTAGGTTTCG 


GGGGCAATTG 


1140 


CAAANTAAAC 


GAGGGNTTAC 


ATTTTTTTTA 


TTTTAATGGT 


ACCGAGTNAC 


TTCANGGANG 


1200 


CAAGTANTCT 


TGATTGGNCA 


TCGNAACC 
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(2) INFORMATION FOR SEQ ID NO: 1094: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1094: 



GTTAATCAAG 


TATCGAAGCG 


GAACAATCAT 


ACTTTAATGT 


TGAAGATTTA 


TATNGCGAAC 


60 


AAGCGATGGT 


CCTAGTGCGT 


AATATTAATT 


TAGCACTGCG 


CGCACAATAT 


TTGTTNGNAT 


120 


CTNATGTCGA 


TTACTTTGTA 


TATNNTGGTG 


ATATTGTTTT 


AACTGACCNC 


ATTACAGGTC 


180 


GTNTGTTACC 


GGNAACTAAG 


TTGCAAGCTG 


GACTTCACCA 


NGCTATTGAA 


GCGAAAGAAG 


240 


GTATGGAGGT 


TTCAACAGAT 


AAAAGTGTTA 


TGCCAACCAA 


TTACCCTTCC 


AGAATTTATT 


300 


TAAACTTTTT 


GAATCAATTT 


TCAGGTATGA 


CAAGCTACAG 


GAAAATTAGG 


CGAATCAGAG 


360 


TTCTTTGATT 


TGTATTCANA 


AATAGTCGTA 


CAAGCACCCA 


ACTGATAAAG 


CGATTCAACG 


420 


TATCGATGAA 


CCAGATAAAG 


TGTTTCGTTC 


AGTTGATGAG 


AAAAACATCG 


CGATGATTCA 


480 


TTGATATAGT 


TGAACTTCAT 


GANNCGGGGC 


CGACCGGTTT 


TACCTCATAA 


CCGAGNACTG 


540 


CTGAAGCGGC 


TTGAATACTT 


TTCNGAAGTA 


TTATTCCAAA 


TGGATATTCC 


TAATAATTTA 


600 


CTCATTGCGC 


AAAATGTTCC 


AAAAGAAGCG 


CAGATGATAG 


CTGAAGCAGG 


CCAAATTGGT 


660 


TCCATGACTG 


TTGCGACTAG 


TATGGCAGGT 


CGAGGCACAG 


ATATTAAACT 


TGGTGAAGGT 


720 


GTCGAAGCAT 


TAGCTGGATT 


AGCTGTTATT 


ATTCATGAAC 


ATATGGAAAA 


TAGCCGTGTA 


780 


GACAGGCAAT 


TACGTGGTCG 


TTCTGGTAGA 


CAAGGGGATC 


CGGGATCATC 


TTGTATATAT 


840 


ATTTCACTAG 


ATGATTATTT 


AGNTAAGCGA 


TGGAGCGATA 


GTAATTTAGC 


GGAAAATAAT 


900 
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CAAl I ATA1 1 


CANTACi/VIvjC 


ALAALuAI in 


TfVSr* A A AfiT A 


AM"!"!*! ■■» *"i - 1" A A 
nX X 1\3* X Inn 


x a i 


960 


& Afa^A a atty" 

AAbwinAi IVj 






TCGGAAAGAA 


CAAGGGGTTA 


AAGCTCGGTG 


1020 


AnAluoL JL 1A 


ill XunnX 1 A Vj 


XYNft/\nn/lU^»n 


TNAGTATTCA 


GCGAAGATCT 


TNtTTATTTAC 

X XV^J X XXX 4 


1080 




nnnl\.Lunu i- 


TTTT & & & & T 
XXX Inunnnl 


TAGATTGATG 


CTGAGAATCC 


NAGATTTTTA 

xwxi>V7n x * * *■ x 


1140 




prpqtii A AfS ATT 


f2TATTTf2 A AA 


X n\3 X X X wVJVJVJ 


NAATGANGGA 




1200 


ACAAAATCGC 


GNGTTGGGCG 


AGTATATTTT 


ATCAAAAATT 


TAAGTTNCCA 


ATTTAATAAA 


1260 


GATGTGGCTT 


GTGTTAATTT 


TAAAGATAAG 


CAAGCAGNAG 


TGACATTTTT 


ATTAGAGCAA 


1320 


TTTGAAAAGC 


AATTAGCTTT 


GGANTCCGTA 


AAAACATGCA 


ANGNGCATAT 


TATTATAATA 


1380 


TTNCCGGCCA 


AAANGTCTTT 


NGGGAAAGCA 


ATTGATNCAA 


GTTGGGGTTA 


GGAAC AAGTC 


1440 


GGCTTTTNAC 


AACAANTTAA 


NAGCAAGCGN 


TAATCAAACG 


ACAAAANTGG 


CAACCT 


1496 



(2) INFORMATION FOR SEQ ID NO: 1095: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095: 



CCCAAATCCT 


TTGAAATAAA 


ACTTAATTAA 


GTTCAAGAAN 


TCGNGGTGAT 


TTTCCCCGGA 


60 


AGTTTGTTTG 


AATATTATTA 


TGGATTGATT 


TGATAGAGCT 


TATGGACCCC 


ACATTAATAT 


120 


TGAAGTCCTG 


GGGCCAAGTT 


CACNCCCCCG 


TGGGTGGAAT 


TTCCTATTTG 


ATCCCTAAAC 


180 


CCAAACTATG 


GACGTCCCAA 


ATTCTAAATA 


TGAAAAAATG 


GCTGAGCATC 


GTTTGATGAA 


240 


TCATGATTTA 


TATCCCGAAA 


AAATAGATAA 


TCGTTAAATG 


TATCATTTAA 


TAAACACACC 


300 


AATAAGTTGA 


TTTTCCTAAC 


TTATTGGTGT 


GTGTTTTTCA 


TTTAGCATAC 


ATAATAGGTT 


360 


ACATTAAAAT 


AACATTTTAT 


ACCCAAAGTA 


CACCAAAAGA 


ATATTAGTAC 


ACGAATTANA 


420 


CAACATTTTT 


ATAGAAACCT 


ATTGCACTTT 


AACGTCAATA 


AGTATATTTT 


TATATTATCT 


480 


CTAATTAATT 


GTGCGCGCTT 


AATAACAGAA 


TATTCTCAAT 


ATTTTTATTT 


TTTTGTGATT 


540 


TGTTGGAATA 


TTTAGTTGAT 


AAGGCACAAT 


CAAATTTACT 


TAAACTATTG 


TATTAGGGGA 


600 


AGAAAGGATG 


GGATGTATAC 


ATGACACAAC 


AAAACTCCCA 


TGGAAATCAA 


ATTCAAGACA 


660 


TACCTCAAAC 


AGGATTTTTC 


GGGCATCCTC 


GAGGACTAGG 


CGTACTCTTC 


TTTGTAGAGT 


720 


TCTGGGAAAG 


GTTTAGTTAT 


TATGGCATGC 


GTGCCCTACT 


CATTTTCTAC 


ATGTACTTTG 


780 


CCGTAACAGA 


TAATGGCCTT 


GGAATTGATA 


AAACAACAGC 


TATGTCAATT 


ATGTCAGTTT 


840 


ATGGTTCATT 


AATCTATATG 


ACATCCAATA 


CCAGGCGGAA 


TGGATTGCTG 


ACAGAATTAC 


900 


AGGCACTAGA 


AGCGCTACTT 


TATTAGGTGC 


AGTCTTTATT 


ATTATCCGAC 


ATATTTGTTT 


960 


AAGCTTACCA 


TTTGCATTAA 


TCCGCTTATT 


CACATCAATG 


TTCTNCATCA 


TTATTGGCTC 


1020 


AGGTTTAATG 


AAGCCAAACA 


TTCCAAATAT 


CGGTGGCCGT 


TTATATCCTG 


AAAATGATAG 


1080 


ACGTATGGAT 


GCAGGTTGTG 


TTATTTTCTA 


TATGTCAGTT 


AATATGGGTG 


CATTATTATC 


1140 


ACCTATTATT 


GTGCAACACT 


GTGTTAATGT 


TAAAAACTTC 


CACGGCGGAT 


TCTTGATTGC 


1200 



933 



WO 97/30070 



PCT/US97/02318 



AGCAGTTGGT 


ATGGCATTAG 


GTTTAGTATG 


GTATGTACTT 


NACANCCGCA 


AAAACTTAGG 


1260 


TAGCGGTGGT 


ATTGAAACCN 


ACTAACCCAT 


TGACNACCAG 


CTTGNAAAGA 


AAAAGTATGG 


1320 


TCTTTATTAT 


CGGAAGTGGT 


GTCTTTAGCA 


ANTGTATTAA 


TTATCGGTAT 


TGGGGCATTA 


1380 


ACTAACTCAA 


TATCAANTAA 


CTTAGGTAGG 


AATACTGTTT 


TAGTATTAGG 


TATTGCATTA 


1440 


CCANTCATTT 


ACTTCACTTT 


AATTATTAGA 


AGTANAGATG 


TCANNGATAC 


TGNACGTTCT 


1500 


CGTGTTAAAG 


CATTTATTCC 


ATTATTTATT 


CTTGGAATGG 


TGTTCTGGGC 


TATTCAAGAA 


1560 


CAAGGGTCTA 


ATGTATTAAA 


CATATATGGA 


ANTGNACATT 


CAGATATGAA 


ATTAAACTTA 


1620 


TTTGGTTGGA 


AAACANACTT 


TGGTGAAGCG 


ATCTTCCAAT 


CAATTAACCC 


ATTATTTATT 


1680 


TTATTATTAG 


CTCCAATTAT 


TTCACTTTTA 


TGGCAAAAGC 


NTGGAACTAA 


ACAACCTAGC 


1740 


CTGCCAGTAA 


AATTTGCAAT 


TGGTACGTTC 


TTAGCAGGTG 


CGTCATACAT 


ACTAATTGGT 


1800 


ATTGTAGGTT 


ATGCATCAGG 


TTCATCAAAC 


TTCTCAGTTA 


ACTGGGTTAT 


TTTATCGTAC 


1860 


ATTATTTGTG 


TTATCGGTGA 


GCTATGCTTA 


TCACCAACGG 


GTAATAGTGC 


TGCTGTTAAG 


1920 


TTAGCACCTA 


AAGCATTTAA 


TGCCCAAATG 


ATGAGTATTT 


GGTACTTAAC 


TAACGCTTCT 


1980 


GCACAAGCAA 


TCAATGGTAC 


TTTAGTTAAG 


TTAATAGAAC 


CACTAGGTCA 


AACAAATTAC 


2040 


TTTATTTTCC 


TAGGTGTTGT 


TGCAATTATT 


GTTACAACAA 


TTTGTATTAG 


CATTCTCACC 


2100 


TTTAATCATC 


AAAGCGATGA 


AAGGTATACG 


TTAATATTGT 


TGGCCTAATT 


CAAAAAACAG 


2160 


TAAGTCATTT 


AAATGGCTTA 


CTGTGTTTTT 


ATAGGTTTCT 


ATTAATTAAA 


TTCAAGATAT 


2220 


CAGTATAAAT 


AAAAGCTTAA 


TATGCTCGTT 


ATAGACAGCT 


ATAACTATAT 


TTTCTCGTCC 


2280 


CACTCTATAG 


ACATCAATGT 


CAGTTATTAC 


CATTTTCTCT 


ATTTAAAAAC 


ATATTTTTGG 


2340 


TATTAACAAT 


TAAAAGTGGG 


TATATATATT 


TAATGAACCA 


ATTTTTAGAA 


AGTAGGCTAA 


2400 


ACTATGGAAA 


ATAAATATAC 


ACATGGTGTC 


CTCTTTTACC 


ATGAACACAG 


TGGATTAAAA 


2460 


AATATTAATC 


AAGGTATTGG 


GGAAGTTACA 


ACAGCATTAA 


GTTCAATTTG 


TAAGCATCTT 


2520 


TCTATTCAAT 


TGAGTGAAAA 


CGAAGGTGAC 


ATTATTAAAT 


ATTGCCAAGA 


AATCAAAACT 


2580 


AAAAACTATC 


CAAAAGATGT 


AGATAG 
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(2) INFORMATION FOR SEQ ID NO: 1096: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1096: 

GGCACGAGCG ATATTTGTCA GGAAGTTAAA GTTCATGGAG ATAAGGCTTT GAAAATGTAT 60 

AATCTAACAT TCGATCATAC GAAAACAGAT CATTTAGAAA TTAGTCATGA ACAAATTAAA 120 

GCAGCATTTG ACACATTAGA TGAAAAAACA AAACAAGCAT TACAACAAAG TTATGAAAGA 180 

ATTAAAGCAT ATCAAGAAAG TATTAAGCAG ACGAATCAAC AGTTAGAAGA ATCAGTGGAG 240 

TGTTATGAAA TATACCATCC ACTAGAAAGT GTGGGTATTT ATGTGCCTGG TGGCAAAGCA 300 

AGTTATCCAT CAACGGTTCT AATGACAGCG ACTTTAGCAC AAGTAGCGGG TGTAGAAAAT 360 
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ATTLiTTL*l~lV» 


rrv« » /•» a /"V"' 7* r+r* 

Tl^lALLALL 


TCAAJLL 


uuAb 1 A ILL L 


AALvluLtLi 1 A 1 


T "kflf^TW A TT* 
1 ALiL 1\jLZA lu 


A OA 


1 1A1A1 1 AL-L» 


A Af^PT^A Arrv* 
LAAbi 1AA1\. 


AALi iLr 1 1 IVA 


AUl 1\3U iUU 1 


pp/ppii a Anr a 

Va. 1 v—AAMvj 1 A 


1 ITjL IvjLAI 1 




l"*AOTTAT/3f!!A 
uAL.1 lAlVSixA 


AC* Af2A a AfYiA 


1ALL innnui 


Tfi AT AA ATT 




f2T A ArT* A ATT 
V71AALLAA1 1 




Tf2f**TM/2r , ATA 


luLLLAAAAA 


A1A1 1 inl 1 1 




VjIAI IV? 1 ILL 


AMATTWARP 
ANA1 luLAuu 


OUU 


Arv"* axi/"* ap*a a 


ATAf2r , Ar*Trn 

Al ALtLAL 1 lu 


ATT A TW A Pf! 




ATYITAI^ATY^r' 
A l\J 1 ALrAlLrL 


r* A MPPHPTHT 
LAnLunL 1 JN 1 


DOU 


M & TOTTTTTf 1 
MA lo 1111 1 Aj 


fSfSr* A Af^f^M A^2 
LjLtLAAvjLWALt 


A Ar* ATM AT/2 A 


A TT AflM A PfiT 


LL A 1 ATI IM ILft 


1 luuluAALiA 




1\jL.uL^AALt 1 V 


rwnuK A A/~» ATT 
L 1 MAAAuAX 1 


TM/2 A ATPAPM 
1 MVaAAlv.AL.rj 


TATTY2PT A AM 
1 A 1 1 oL 1 AAJN 


LrLAl ILjLL 1 A 


A rtj I\£LiALALi 


TOO 


ANALTLtAUAI 1 


GTTTCTANAA 


f»T A TW2r*W A A 
Vjl A 1LLH_IMAA 


1LAALALJNAL 


L. 1JNA ILL AJNJM 


ATNGTAATTN 


840 


W ViA 1 JN iuW3H 


uAnuLAlvnt 


AXTPTVATnA 
UnlML ILAlwl 


ATAPAATPfiP 
A 1 ALAA 1 L vjL. 


flPPTPAAPAl* 


Ltl.l> i l.lfa rTL 




AAAL. ALt 1 AAA 


ILL IvAALUA 




TTT2A APfIA AT 


AA1 1 ILIAIL 


tw^tt » fry* ta m 
ILb I 1 A IlAI 


OCA 




Al 1 A i\-AAL 1 


1 Aj IV? 11 1 L 1 1 


LtL A 1 1 IL 1 1 Vj 


1AA1 ILLtLtLA 


AO A A A/"* ir*Tvn 
ALJLAALALl 1 


1 AO A 


L. 1 IvjAiL. ILL 


ILLtA ILALLtLt 


L, 1L 1 1 ALLiAl 


ta ATikrvnv^r 
1 AA 1 ALL> L\j 1 


TTTATATTTT 


LTLxAACTTGTC 


lUaO 


TTTCAAGTTT 


ATTATTAATT 


A A STPiATir* 
AAA1VAA 1 AL 


L IxjLLtIAi AA 


ATCATCGTTT 


lTlKTTCTTCaG 


1140 


CTCTTaaCGT 


AaCATTTTTC 


aaTGLtAATTG 


TTACTTCAAT 


TTTAGTAGCT 


GAATNTGAAT 


1200 


AAGTTTTAAC 


TTTAACATGC 


GCCACTGCAT 


NTGGTACGTC 


ATTAAAATAA 


CGTTCCAACT 


1260 


TACCAATTTN 


CTCCTCAAAT 


ATAGNTGCGA 


ATAGCATCTT 


GTGATAGGGG 


AGGGNTATCT 


1320 


CCAATGAATT 


TNCAAATCTA 


TATCATANGG 


GAATCTCTCC 


CTTAAACCTC 


TTTATTGGTA 


1380 


ACTCTTTATT 


ATATTNNNAC 


ATTNNNACGC 


TATCGNGCAA 


ACGCAAACAC 


TTTGGATTCT 


1440 


CTGATATTNT 


NNAGCATATT 


AATTTACAAC 


CCTGCACGAT 


GATTG 
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(2) INFORMATION FOR SEQ ID NO: 1097: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097: 



TTTAATTTGC 


TAAACCCAAT 


TCACTAATTG 


TGGAGTTGAA 


TTGTAAAAAT 


GTGTTTTGAA 


60 


ACAAGGTACT 


CTTCTAAGCC 


TTCTTTGCCT 


AATTCTCTAC 


CGATACCTGA 


TTGTTTGTAT 


120 


CCACCCATGG 


lX^CTTGTGCA 


AAATATGGAT 


GGAAATCATT 


AATCCACACC 


GTCCCAAGTT 


180 


TCAACTTGTT 


AGCAACGCGT 


TGTGCTTTTC 


CAATATCTTT 


AGAAAATACA 


GCACCTGCTA 


240 


AACCATATAT 


AGAATCATTC 


GCTAATTGAA 


TCCX^TTCTTG 


TTCAGTTTCA 


AAGCCTTCTA 


300 


CAGTAACGAC 


AGGTCCGAAA 


ACCTCTTCTT 


GTACAATACG 


CATTGACGTA 


TCACAATTTG 


360 


TAATGACTGT 


TGGCTCGAAG 


AATAGACCAT 


CTTTTAAATC 


GTCTCTATCT 


GGACGTTTAC 


420 


CACCAACAGC 


AATTGTTGCG 


CCTTCTGCTT 


TAGCTACATC 


CATATAAGAT 


TCGATCTTAT 


480 


TACGATGTTC 


TGTTGAAATC 


ACTGGTCCCA 


TTTCAGTATC 


AGCATCAAAA 


CCATTACCTA 


540 


ATTTGATTTT 


TTTCACGCGA 


TCAATAAGTG 


CTTGCTCAAA 


TTTGTCTTTA 


ATACTGTTTT 


600 


GTACTAATAT 


TCTNGATCCT 


GCTGAACANA 


CTTGANCTGC 


ATGNAAATAT 


CCACCATTTA 


660 



935 



ACGCTTGGTC TACTGCCAAT TCANAATCAG CATCATCAAA GATAATGTTG GGATTTTTAC 
CGCCAAGTTC CAAGACAATA TTCGTAAACA TTATTAGCAG CATTTTTCAT AATATGGCTT 
ACCAGTCTCA ATGCCACCTG TAAATGATAC AAGGTCAACC CTCTTTATGA CCTGACATTA 
CGTCACCCAA CTTCAGAACC TGCACCAAGA ATAAGATTAA TTGTTCCTTT AGGGAAACCC 
AACTTCTTCC ATTAATTCAA AAACACGTAT TGTTGTTAAT GGTGTAATTT CACTTGGTTT 
CATAACTAGT GAACAACCCG TAGCAAGCGC TGGCGCAATT TTCCATGATG CTTGTAATAA 
CGGATAATTC CAAGGTGTAA TTTGTGTAAC TACACCTACT GGTTCTTTAA CAATTTTGCT 
TTCTGTATCT GGAATTGGTG AATCAATCAT TTCGCCACCA TCTTTATCTG CTAATCCAGC 
AAAATACATA AACACATTAT GAATATCATC CATATCTGCA GATGATTCTT CTAACGTTTT 
TCCAGTATCT AATGTTTCTA ATCGTGCTAA CGCTTCGCTC GTGCC 

(2) INFORMATION FOR SEQ ID NO:1098: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098: 

GAATTCNAGG TACTGCAGNT GNANTCGCAA CACATGTNGC TGTCACTGNT ATTCAGTCTT 
AATTGATGCC ACAGGNTAAA TTCGCNTATA TTAGNTNATT GCAANCGGGG TATAACAATG 
TNGCTATGAT TGCAGACGGA GTGAATGATG CNCCAGCACT TGCTGCATCT NCTGTNGGAN 
NAGCAAGGCT CGGNNCTGGG ACGGATACTG NANCAGAAAC AGCTGATATT GNATGATTGN 
GAGGTGNACG AAGCNCACTT CCCATTGCAG CANCCCTCNN TCGAAAAACT TTCTANNTCA 
TTNGAGCTAA CATCACATGN GGGNATCNGN GCCTNGAATA ATTGTCCTCC TTATCANCNT 
ATCCCCC 

(2) INFORMATION FOR SEQ ID NO: 1099: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099: 
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TACTATTAGT 


1140 


TATCCCGGGA 


r¥i/"»< %i in ii x » ****** 

TGGTTAACCC 


TTTGGATAGC 


GATTCTTTCC 


GATATGGGAG 


CTACTATTTT 


1200 


GGTAGCATTA 


AATAGTTTAC 


GACTGATGAG 


ik d miiu » ik ik m m 

AGTGAAGGAT 


aaa m a /"*^>mn /"^ 

AAATAGGTAG 


A A A 9k /**9k ^ Ik fn 

AAAACAGAAT 


1260 


m **ri\ ]4 11 poqip A 

AGTAAGGTCA 


CGv-TGTGCvaV, 


m ik fwffw^ "k ^ ****** 

AATTCAAGGG 


GGGCTTTTCA 


TinVWItPTl AO A A 

ATTTGAAGAA 


AAGTCCTACC 


1 1 O A 


CCTAAAATAT 


AAA AfTIAIIWIW '/^ 

AAAATATTGG 


AGATGGAAAA 


ik m/^ ik rw^/**^ A A 

ATGATCGCAA 


/v* Am A ^fmn a /** 
CGATACTTAC 


AGCAGCTGCG 


1 "3 O A 


GTATATGTAG 


OA AO AOOA AT 


TGATATCTCG 


TTATATTAAT 


TCTGGTTCTG 


TTGCAAAGTA 




AAA A*T*A»PAOO 

AAAA I A I Aviv- 


rpA AOO AO*PA A 
1 AALLAL 1 Aft. 


TTTATCATGT 


CAGTGTTCGC 


TTAACTTGCT 


a r*f* ik mn tk ^n/^^ 

AGCATGATGC 


1 CAA 


1 AAX 1 X VAj 1 Vj 


V?v*A 1 ajuLuAA 


AATCCGTAGA 


TCTGAAGAGA 


CCTGCGGCTC 


TTTTTATATA 


1560 


GAGTGTAAAT 


ACATTCAATA 


CCTTTTAAAG 


TATTCTTTGC 


TGTATTGATA 


<"«Jiirnf m*» ik niTk **r> 

CTTTGATACC 


1620 


TTGTCTTTCT 


TACTTTAATA 


iim'i ik ********%** ik m 

TGACGGTGAT 


CTTGCTCAAT 


GAGGTTATTC 


AGATATTTCG 


i c a a 
loot) 


ATGTACAATG 


ACAGTCAGGT 


TTAAGTTTAA 


a A^omnwnA Am 

AAGCTTTAAT 


TACTTTAGCC 


ATTGCTACCT 


1 /40 


TCGTTGAAGG 


TGCCTGATCT 


GTAATTACCT 


TTTGAGGTTT 


ACCAAATTGT 


rffWiiA Arrw^A^A^ 1 

X X AATGAGAC 


i o n a 
1800 


GTTTGATAAA 


CGCATATGCT 


** Ik Tk m ffynm m 

GAATGATTAT 


CTCGTTGCTT 


it f*r*** i> ik f*** ik ik 
ACGCAACCAA 


ik rnik nwmk fk f*w« 

ATATCTAATG 


1860 


TATGTCCCTC 


TGCATCAATG 


0^*AOOA*T»AH1A 

GCACvjATATA 


A A rp A O^TlOO A 

AATAGCTCGA 


TTTTCCTTTT 


ATTTTGATGT 


iyzU 


ACGTCTCATC 


AATATGCCAT 


TTGTAATAAG 


U"X W X'X'X"X"X'ATG 


CTTTTTCTTC 


A A A IIMllflW ' a m 

CAAATTTGAT 


1 O O A 


ACAAAATTGG 


GGCATATTCT 


rrv~»A AOOO A AO 

luAAULLAAL 


OjO*PAO AOOOO 


1*OA A t>o A m/-« A 
XuAA XviA. Ivift 


AOOTVTVTt Arf^ A/^ 

ACbl X XACAC 


O A A r\ 
ZU4U 


CACGTTCCCC 


TTAATATTTC 


AGATATATCA 


CGATAACTCA 


AIuLAIAlL X 


f*AOA1*AO*PAO 

IAuAXAXjXAIj 


01 AA 


CCAACGGCTA 


CAGTGATAAC 


ATCXTTTGTTA 


AATTGTTTAT 


ATOTOJ A A ATA' 


npfflO A *P A O A O 
VjC X V-A X AvJAv? 




AAGATTCCTT 


TTTGNTAAAA 


TTATACTATA 


AATTCAACTT 


TOW A JkO AOA A 
X\7lNAAL.ALrAA 


CCAGAATTTT 




GATATAGATC 


TTGTAGATAT 


CCTTTTAGAA 


AAGTCGAAAG 


AOATT/2f20 AT 
AuAl X\3ViV.AX 


AOAOO»pAOAT 
AL. Aviv? X Av, A X 


*)0 Q A 


CTTCAACATT CCGTTGAATC CATTGAAAAA GAACAAGGAA 


AGTTTCATGT 


GTACGCTCGA 


2340 


AAAAAAGAGG 


GCACGAGCGG 


TTTGAAGCAG 


ACATTGTTAT 


TCATGGTGCT 


GGACGGGTCC 


2400 


CTGCCTTAGA 


TATGAATCTT 


GAAAAAGGGA 


ATATAGAAAG 


GAAAAAACAT 


GGTGTCCATG 


2460 


TTAATGAGTA 


TTTGCAAAGT 


GTAAGTAACC 


CX5AATGTCTA 


TGCAGCTGGA 


GATGCTGCAG 


2520 


CAACGGATGG 


CTTGCCCCTC 


ACACCTGTAG 


CCAGTGCAGA 


TTCTCATGTC 


GTAGCATCTA 


2580 
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ATTTATTGAA AGGGAACAGC AAAAAAATTG AATATCCCGT GATTCCATCT GCTGTATTTA 2640 

CCGTACCTAA AATGGCATCG GTAGGTATGA GCGAGGAGGA AGCCAAAAAC TCTGGCCGGA 2700 

ATATTAAAGT AAAGCAGAAA AACATCTCCG ACTGGTTTAC GTATTAAACG GACAAATGAG 2760 

GACTTTGCTG CGTTTAAGTG CTGATTGACG AAGGATCATG ATCAATTTTG GGGTGCTCAT 2820 

TTGATTAGTA ATGAAGCCGA TGACTGATTA TCAATTTGGA CAGCATCCGT TTGGGATTTC 2880 

AACCAAAGAT TGAACAATGA TATTTGGCCT TCCCACC 2917 

(2) INFORMATION FOR SEQ ID NO: 1100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100: 



ATGATGTAAA 


CCCAGTTCCN 


ANCTTCATCA 


TTCAGTTGAT 


GCAATAATGT 


CACNNAAACT 


60 


CTACCACCAA 


GATGCACCAA 


TAGGTTGTCC 


CTAAATGCAA 


TAAGCGCCAC 


CCTTTAACAA 


120 


TTCACCCTTT 


TCAGGGAGGT 


NATTTTAAAT 


TCACCGATCC 


AACAGCTAAT 


GATTGTCCTN 


180 


CAAATGCTTC 


AATTTAATTC 


CAAATACATC 


AATATCGCTT 


AATTCTTTTT 


TACTACGTTT 


240 


CCAAAGCCTT 


TTCTACAGCG 


CCAACTGGTG 


CAATACCCAT 


AATAGAAGGA 


TCTACGCCAT 


300 


GACTTCCCAA 


AGCCATCCAA 


GCACTGCCAA 


TGGTTCGATA 


TTTAATTCTT 


TAGCTTTGTC 


360 


TTCTGACATG 


ACTAACATCA 


TCGCAGCACC 


ATCATTGATT 


CCTGATGCAT 


TACCTNCTGT 


420 


AACTGTCCCG 


TCTTTTTTGA 


AAGCTGGTCT 


TAAGCGACTT 


AATTTTTCGA 


CTGATACATT 


480 


TTCACGGACA 


CCTTCATCCT 


TAGTGACTAA 


GATTGGTTCA 


CCTTTACGTT 


GAGGAATCGA 


540 


TACTGGAACT 


ATTTCACTAT 


CAAATTCACC 


ATTTTGCTGT 


GCACGTACTG 


CTTTATTGTT 


600 


GTGAGATTAC 


CAGCAACATG 


TATCTTTGTT 


CTTCTCTTGA 


AATACCATAT 


TGCTCCTACT 


660 


AAATTTTCAG 


CAGTAATACC 


CATATGATAT 


TGAGTAAATA 


CATCTGTTAA 


ACCATCATAT 


720 


ACCATGCTAT 


CAACCATTGA 


GTGATGTCCC 


ATTTTAAAAC 


CGAAGCGACT 


GTTGTTGNCA 


780 


AGCATTGGTG 


NCTGAGACAT 


ATTCTCCATA 


CCGCCAGCGA 


GCACGATGCT 


CATGCCCACC 


840 


AGTCAGCNAT 


AGATTGATAT 


GCTAATTGAN 


TCGACTTTAA 


CCCAGAGCCC 


ACATACTTTA 


900 


TTCACTGTAA 


ATGCAGGTGC 


TGTTTCTGGC 


AAGCCACCTT 


TCATAGCAGC 


AATTCGTGCT 


960 


GGATTTTGTC 


CTTGTCCTGC 


TTGTAGTACG 


TTACCGATGA 


TAACTTCATC 


AATCTCACTT 


1020 


GGATTCAAAC 


CCGTCTCTTT 


AATAATATGT 


TCTATTAAAG 


TCGCACCTAA 


ATCATAGGCT 


1080 


GGCACGTCTT 


TAAACGCACC 


TCCAAAAACG 


CCAATAGGTG 


TCCTGTATGC 


TGCTGCTAAT 


1140 


ACGACTCTCG 


TCATGTCATA 


TTCATCTCCT 


TATGATATAT 


ACAATTTTAT 


TACTAATACG 


1200 


TCTAATGAAC 


CAATACGCCA 


CTAACATCCA 


ATTTATGAAA 


CCAATTGCTA 


CGTATGTTAC 


1260 


ATTTTCATTC 


TTGTTATGCA 


TTTAACAAAT 


CATATTATAC 


AATGATGTAA 


TTTGTTTGGG 


1320 


CAATCGGAAA 


CGTAAACGGT 


TTCACTTTAA 


CATTTCACAC 


TCTTAAATAT 


GCTCTATCAT 


1380 


CCTCATCTAT 


AGTACAAAAT 


TATGCTTTGA 


TTTACCACAA 


ATTATCATTA 


ATCATTTCGC 


1440 



938 



WO 97/30070 



PCT/US97/02318 



TCGTGCCTCT AATTAAATTC TTTCTATCAT TGGATCGNTA TCACAGGTGG NTTCACCACA_ 1500 
ATTCTCATCA ATTT 1514 

(2) INFORMATION FOR SEQ ID NO: 1101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101: 



CGGTATTTCA 


ATCAGCGCCT 


CGTTCGGAGT 


GCGCTCAAAC 


GGTGCGAAAT 


ATGAAGCAGA 


60 


AACCAAACTA 


TATCCGAATC 


CTAAGACATC 


AATATATCGC 


TTTAGGTTGA 


AGACTTCAAA 


120 


GCATTGTGCC 


AAGGACAATT 


AGATGCGCAC 


AGACGGGGGC 


ATTTTATTAG 


CGACCATGAT 


180 


TATCATATCG 


CCTTAAATAT 


CGCCACAATT 


TTAGCGGGTG 


GTGATTTACC 


AAGAAATACA 


240 


TTTATAAATC 


AACGTTACAT 


TCAATCGTTG 


GAGAAAATTG 


GCTTTATTGA 


CTTACTAAAA 


300 


ATCTAAAAAA 


TCATATGGAA 


AGAATTGCAC 


ATATGTTAAA 


AACTGGTAAG 


CCATTACGTA 


360 


ATTAAAAGAT 


AGTCATTAAG 


AGAGGATGAT 


AACCATGCAA 


GAAGCATACA 


TTGTAGCTTA 


420 


TGGGGCGTTC 


AGCCGCAGCG 


AAAGCAAAGC 


AAGGCGCATT 


ATTCCACGAA 


AGACCTGATG 


480 


ATGTCGCAGC 


CAAAGTATTA 


CAAGGCGTAT 


TGAAACGTAT 


TGACGGAAAA 


TTCAATAAGN 


540 


ATATGATTGA 


AGATGTCATT 


GTTGGTACGG 


CTTTTCCAGA 


AGGATTACAA 


GGACAAAACA 


600 


TTGCACGAAC 


GATTGCATTG 


CGTGCGGGAT 


TATCTGACAC 


GGTACCGGGT 


CAAACAGTGA 


660 


ATCGNTACTG 


CTCATCAGGA 


TTACAAACCA 


TCGCGATTGC 


AGCCAATCAA 


ATTATGGCTG 


720 


GTCAAGGAGA 


TATACTTGTA 


GCTGGTGGCG 


TTGAATTGAT 


GAGTGCCGTA 


CCAATGGGTG 


780 


NCAACGAGCC 


CACAAACAAT 


CCAACCTTAC 


AATATGATGA 


TATAGGTGCG 


TCATATCCTA 


840 


TGGGTTTAAC 


TGCTGAAAAT 


GTAGCATCCC 


AATTTGACGT 


ATCACGCTCG 


AAGATGCCAA 


900 


GATGCTTATG 


CTGTCAGAAG 


TCATCAACGT 


GCCTATGACG 


CACAACGTGA 


TGGTCGGTTC 


960 


AAAGATGAAA 


TTATTCCAAT 


ACGGGTAAAC 


TCAGTTGAAT 


ATACAAACGC 


AGGACCAAAA 


1020 


GTCTCGTGCC 












1030 



(2) INFORMATION FOR SEQ ID NO:1102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1102: 



GGACTACTAA 


AACAAGTAAT 


CAAATCATTC 


CAAGANGTAT 


TAAGCNCCCT 


ATGATCCAAA 


60 


GCCCAACTCG 


NNAAAGAAAT 


TGCGTTTCGC 


NTCAATTTGG 


CAGGAAGTAG 


ATAGCACATT 


120 


ACAAAGGACA 


ACCGGGAGAC 


ACTTTGTNAT 


TCCCAGCATT 


CGTTTATAAG 


CGGAGACGCA 


180 


CAAGTAGTTG 


CAGAGGGCGA 


NGAAAATCCC 


TCCTGACATT 


TTAGANACGC 


NAAAACGTGA 


240 


GGCTAAAATC 


CGTAAAATTG 


CTAAAGGTAC 


ATCTATCACA 


GATGAGGCTT 


TATTAAGTGG 


300 


NTACGGAGAC 


CCTCNAGGCG 


ANCAAGTACG 


TCAACACGGT 


TTAGCACATG 


CTAACACAGT 


360 


TGATAATGAT 


GTATTAGAGG 


CTTTAATGGG 


AGCTAAACTT 


ACTGTTAATG 


CGGACATCAC 


420 


TAAGTTAAAC 


GGCTTACAAT 


CAGCAATCGA 


CAAATTTAAC 


GATGAAGACT 


TAGAACCAAT 


480 


GGTTTTATTT 


GTTAATCCAC 


TTGATGCTGG 


TAAGTTACGC 


TCGGAGATGC 


ATCAACTAAC 


540 


TTTACACGTG 


CAACCGAATT 


AGGCGATGAC 


ATCATCGTTA 


AAGGTGCGTG 


TGGCGAAGCT 


600 


CTAGGTGCTA 


TCATTGTACG 


TACTAATAAG 


TTAGAAGCTG 


GCACAGCTAT 


TTTAGCTAAA 


660 


AAAGGTGCAG 


TTAAATTAAT 


CTTGAAACGT 


GATTTCTTCT 


TAGAAGTAGC 


GCGTGACGCA 


720 


TCAACAAAAA 


CAACTGCATT 


ATACAGTGAT 


AAGCACTATG 


TAGCATATTT 


ATATGATGAA 


780 


TCTAAAGCAG 


TGAAAATCAC 


TAAAGGTTCT 


GGAAGCTTAG 


AAATGTAATA 


GGAGGTAGTG 


840 


ACATATGTAT 


AAAGTAATCG 


AACGTTTTGA 


AGATGCACAA 


GACAATGGAC 


ATGAATATCA 


900 


AGTGGGAGAC 


ATTTACCCAC 


GTGATGGGTT 


AGAAGTATCA 


GAAGAACGGT 


TCACTGAATT 


960 


ATCTACAACA 


AACAACCGCC 


GTAACTTAAT 


CGCTATCAAA 


CTTGTTGAAG 


ACGATACAAC 


1020 


AGAACAGTCT 


GAGGCGAGCG 


CTGACGAGCA 


AAAAAGTTTA 


TCTGATATGA 


AAGTAGCAGA 


1080 


ATTAANAGAA 


CTTGCTAAAA 


AGCGTGAAAT 


TAAAGGCTAT 


AGCGATATGA 


AAAAAGATGA 


1140 


GCTTATCAAA 


GCTTTAGAGG 


GTGTTAAGTT 


ATGGACGCAA 


AAGACGTCAA 


AATGATTAAT 


1200 


GGACTTTCAC 


TCAATGATCC 


GCTCTACGAT 


GAGCAGGTCC 


GATNTCTATG 


AAGATNTTTA 


1260 


GGGGGTGGAG 


GAGGATTATT 


GGNTCAGAGG 


TTGATGACAA 


GAGTGCCGCG 


GGTGTTAGAG 


1320 


GTTATGTTGA 


TGTTCAAGTG 


GGCAACTGGC 


AATTCTCAGG 






1360 



(2) INFORMATION FOR SEQ ID NO: 1103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:1103: 

GGCACGAGCT CAAGTAAAGT GCTACCATTT TTCTTACAGA CATTTGCCGA NATCTTAAAC 60 

TAAGACATAN CCCCTAAAAT ACTCNCTANA GNACCGNACA ACCNCAACAC AATTCCTGTC 120 

ATGATCTTAG GCTTTGTGCC ACCTNCCCCA GTTGNCAAAT GCTTTGTCCA TCCTTCTTGT 180 

AATGACTGAA ATCCTTCTTG ATACATNCGA AATGCAACAA ATGGAAATCC TTCATAAACT 240 
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GTTACGGGCA 


ATGATTGTGA 


AGTAGATGCA 


CTTCCTAAAG 


CAAATCCTTC 


AATAAAAATG 


300 


ACGGGTCGCA 


CTTTTATGAC 


CTCCAGTTGC 


ATAATAATCT 


TCTTTATTTG 


TTAATGTCAC 


360 


CGGTCCAAAT 


GCATGCTGGT 


TAGTCCGCAC 


CGTCGTCTAA 


GGTANAAAAT 


ACATTCATAC 


420 


CACCGACTTG 


TCATTAAATT 


AAATATCGCT 


GAAAACCCTT 


CGTAGAAACG 


CTTTGrrATA 


480 


TGATAAGGCT 


GTACGCTTAA 


CAATCCTTTT 


TCACCTTGTA 


ATTGATACTG 


ATTAATCAAC 


540 


GTTTCAACAG 


CATCTTTCCT 


TAAGAAAGTT 


ACATCAGCAT 


CTACAAAGGC 


AATACGATTC 


600 


GTACATGCAT 


GCGTCACACC 


TTGATAACAA 


GCATGTGATT 


TCCCCATACC 


ATTTGCCGTC 


660 


AGTATCATCA 


ACAACATCTA 


CTACAGTCGC 


ACCATATGAA 


CGTGCTACAC 


GTGCTGTTTC 


720 


» rp/' H 1 v/Tr»/*»/-'» » rp 


CLaj 1\JA 11 A— A 


I AALAA IajAL 


A IX. 1 AC 1 vj<jA 


ACTTGCTGTT 




/ OU 


ATGTAGTAAA 


TGACCTATTC 


TTTTTTCTTC 


ATTTCTAGCC 


GGAATAATAA 


TTGTTAATGC 


840 


TTTATGATTG 


AAGTTCAGCG 


TTTTCGTCTT 


TAGCTGATGT 


CTACGATTAA 


ATATCAATGC 


900 


ACCACACGCC 


ATAGACATGG 


TCACTATTAC 


TGTTAATATT 


CGTGATAACC 


ATTTCATAGA 


960 


TATCACTTCC 


TATTCTTCGC 


TTCTCGCGCG 


AGCCCGNGCC 






1000 



(2) INFORMATION FOR SEQ ID NO: 1104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 916 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104: 



AAATCCTGCG 


CCTTGCATCN 


CCTTNCCATA 


AAAANCCTCC 


CCGTCAGCAG 


CGGATAGCCC 


60 


CCGCCCAATC 


AATACCCAGG 


AACCAAAGAC 


NGGGCTTGTC 


AAGCGATACT 


ANCAGCGATA 


120 


TATCCAGCTA 


GTATTGGAGC 


CATAAATTTA 


AAGGCTAAAC 


TACCCAATGT 


TTTCAATGGA 


180 


TTTCCCACAA 


TGAATCATCT 


GGGATGACTA 


ATCCTTTTGA 




CCGCCTAGAG 


240 


TCAGCGCGAT 


GNCGATAAGG 


AGTCCACCAA 


CTCCGATAAA 


AGGANCCATA 


AACGATACAC 


300 


CGTTCATTAA 


ATGTTGATAC 


ACCATTTGAA 


TACCACTTTT 


AGACTTACCA 


GCGATCTTTC 


360 


GAATGATAAT 


TTGTTTCAGA 


TTGATAAATA 


GGCGCATCTT 


GATTAATGAT 


ACGTTGAATT 


420 


AGACCTCTCG 


GATTATGAAG 


CCCTACGAGC 


GAACATTTTC 


ATTAATCAAC 


CGTTTACCAA 


480 


CAAATCGGGA 


TAGATCAACT 


TGTTTATCAG 


CTGCAATTAT 


GACACCGTCA 


GCTTCTTCGA 


540 


TGTCTTGCGT 


AGTTAAAACA 


TTTTCAGCAC 


CAACACCGCC 




ACTTTAATAT 


600 


CCACACCCAT 


TTCTTTTGCT 


ACCTGCTCAA 


GCTTTTCTTG 


AGCCATATAT 


GTATGTGCAA 


660 


TGCCATTTGG 


GCATGAGGTA 


ATAGCTACAA 


TTTTCATAAA 


ATCATCTCCT 


TTTCTATATT 


720 


GTAAGCGTAT 


TCTCGATACT 


AAAAAAAGAA 


TAATTACCGT 


TACTAGTGGC 


AATTATTCTT 


780 


GTAAGTATTC 


AAATAACTGT 


TGCTTTAAAC 


TATGATCATC 


TAAACTACAT 


AAATGGTTCA 


840 


CTGAATCATC 


ATCCAAGTTA 


GCAATTAATT 


GCATCATTTG 


TTTTGTAAAA 


GCTTTGTCTT 


900 


TATGCGAAAT 


CGCTAA 










916 
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(2) INFORMATION FOR SEQ ID NO: 1105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105: 



GGCACGAGCT 


ATATCCAATT 


GGGGTAAAGC 


GGGTAAAGAC 


CAAGAGAAGA ATTTAAGAAG 


60 


ACATTAGCAG 


AAATTGAAAG 


GACACCGGCA 


CGAGCTAGCG 


CAACAAGTTT 


AGCGATTGAA 


120 


GCATTTGGTG 


CAAAAGCAGG 


TCCTGATTTA 


GCAGATGCTA 


TTAAAGGCGG 


TCGCTTTAGT 


180 


TACCAAGAGT 


TCTTAAAAAC 


TATCGAAGAT 


TCGCAAGGAA 


CGGTCAATCA GACATTTAAA 


240 


GATTCTGAAA 


GTGGCTCCGA 


AAGATTTAAA 


GTAGCAATGA 


ATAAACTTAA 


ATTAGTAGGT 


300 


GCTGATGTAT 


GGGCTTCTAT 


TGAAAGTGCG 


TTTGCTCCAG 


TCATGGAAGA 


ATTAATCAAA 


360 


AAGCTATCTG 


TAGCAGTTGA 


TTGGTTTTCA 


AGTTTAAGTG 


ATGGATTTAA 


AAGGTCGATT 


420 


GTTATATTCG 


GTGGTATTGC 


TGCTGCAATT 


GGCCCTGTAG 


TTTTTGGGAT 


TAGGTGCATT 


480 


CATAAGCACA 


ATTGGCAACG 


CAGTAACTGT 


TTTAGCTCCA 


TTATTAGCTA 


GTATTGTAAA 


540 


GGCTGGCGGA 


TTGATTAGTT 


TTTTATCAAC 


TAAAGTGCCT 


ATTTTAGGAA 


CAGTCTTCAC 


600 


AGCATTAACT 


GGTCCAATTG 


GTATCGTGTT 


AGGTGTACTG 


GCTGGTTTAG 


CAGTCGCATT 


660 


TACAATAGCT 


TATAAGAAAT 


CTGAAACATT 


CAGAAATTGT 


GTTAATGGTG 


CAATTAACAG 


720 


TGTTAAACAA 


ACGTTTAGTA 


ATTTCATTCA 


ATTTATCCAA 


CCTTACATTG 


ATTCCGTTAA 


780 


AAACGTCTTT 


AAACAAGCGG 


TTTCAGCAAT 


CGGTGATTTC 


GCTAAAGATA 


TTTGGAGTCA 


840 


AATTAATGGA 


TTCTTTAATG 


AAAACGGAAT 


CTCTATTGCA 


CAAGCGCTTC 


AAAATATATG 


900 


CAATTTTATC 


AAAGCTATAT 


TTGAATTTAT 


CTTAAAATTT 


TGTAATTAAA 


CCAATCATGT 


960 


GTGCGATTTG 


GCAAGTGATG 


CAATTTATTT 


GGCCGGCGGC 


TAAAGCCTTG 


ATCGTCAAGT 


1020 


ACTTGGGAGA 


ATATAAAAGA 


GTAATAC 








1047 



(2) INFORMATION FOR SEQ ID NO: 1106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1106: 
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TGTAATACCA 


GCAl IV1X3CC 


TGAACATGGT 


f*r* x fvviswfWTi 
WjAL CGCCI I 


mx mm A A A T* A A 
TA II AAA1 AA 


f*+ x m x iv xmx x /~* 

GAT AAA i AAV? 


C f\ 

oU 


ANTGNANTTA 


TTGAAATAAC 


CTGCATATTG 


CATAATTAAC 


AATGGCGCTG 


nv^/^^wnn x jv m 

TGGCGTAAAT 




GACATGCGGG 


AATGTGATGT 


TTCTAAACTT 


TTGCCAACTA 


GACGC ACC AT 


CCATATCTGC 


180 


TGCTTCGTAC 


CAATCTGATG 


AAATACTTTG 


CAGTACTCCA 


GTGAACAGTG 


y^x Tk iv /"^ ik n iv iv 

CAAAGACAAA 


240 


TGGGAATCCA 


AGCCATACTT 


GAATGCCGAT 


TAATGCCACT 


qvrvi v/*^/*v F *»r^X X x 


ACGGATCACT 


iUU 


TAAuCATUC 1 


GGTCvCTACAC 


pms x rpx x jx r^r* 
C1AAI AAAGO 


ninv^r^ x x *. ivm x 
X I\jLAAAAfA 


TCATTATTTA 


nv^r > /^ x r^c* x a A 
IT^GCACLAAA 


JbU 


TTCATCATTA 


iv ik rr\ tv ik r*r*r*tn x 

AATAACGCTA 


/™» j\ * x mx nun a a 

CAAATATTAA 


x iv 1 1 i^i/ttv' x /"^ x 

AATTGTCACA 


x x nv^ x nv^/rn x 

AATGATGGTA 


CAGCCCAAGG 


A O 


TAGGATTAAC 


ik #~i X /*"»T»X f~*fT"* IV 

ACAGTACGGA 


TAAATTTCTT 


ACCTTTGACG 


x iv r'^VfrvN x m 

ACAGGGTGAT 


mmx iv iv miv x m 

TTACAATAAT 


480 


TGCCAAAAAC 


AGCCCTAATG 


CAATTTGAAG 


TGTCGTTGCA 


ACAAGCGTCC 


ATACTAATGT 


540 


CCAAGTAATA 


ACACTGAAAA 


ATGTTTTGCG 


CCAAACGCCA 


ATTGTGAATA 


ACGTTTTAAA 


600 


GTTATCTAAA 


CCAACCCATT 


CTAATGTGTG 


TCTCGGAGGC 


GCGTTGTATA 


AATTGTAGTT 


660 


TGTAAATGCT 


ACTCCAAACA 


rn iv iv iv mx mmx iv 

TAAATAl rAA 


TGGAAATACA 


ACTACAAATA 


TCAATAAGAA 


720 


TGTACCTGGT 


GAGATTAGTA 


AGTATGGGAA 


CGTCTTGTCC 


CATGTTGCCA 


CCATACGCGC 


780 


CTGCCGATCC 


TTTATTTCCT 


CATTGCGATT 


AAATCGTTCA 


GCATTACGAT 


ATGCATCTAA 


840 


AATATTAATG 


ACATATAGCA 


TGATGGCGAA 


AGCAACGAGT 


AAGATAGAAA 


TAATACCTTG 


900 


TGCAAGTAAG 


ACACGAGAAT 


CGTCTAACTT 


AGGTACTGTC 


CCTAATGTGA 


ACAATCCCCA 


960 


AAAACCAATA 


TTTAAAAAGC 


TATAAAATAC 


AGAAATAAAA 


CTGATGAAAA 


AGATAAAAAA 


1020 


TATCGTACCT 


TTAATGGGTC 


TTTTATTATA 


AAACTGTCCC 






1060 



(2) INFORMATION FOR SEQ ID NO: 1107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107: 



TAAAATAGTG 




CTCGTCCAAC 


ACCTAATTGC 


GCATCAATAT 


CCCATTGCNC 


60 


CCGGATGGGG 


TNCCTTCTTA 


CCTCCTACCA 


AGTTCCTTGA 


AAAATACNTA 


TTGAATCTCC 


120 


TTCTNTCGTT 


GTATCCACAC 


TTGACGGTTG 


TAATTTAAAT 


AACCGTCTCA 


TCTTTCCATT 


180 


TGCGGAATCT 


TTGATNATTT 


GTAATATTTT 


CCACAATCCA 


AAGTCAACTT 


TATGGGGTAT 


240 


CCAATCCACG 


CTTTAATATG 


GTAATTCTCC 


AGCAATCGCA 


CCTTTTAAAT 


CATTATTTTC 


300 


AATGACTTTA 


TTCTGTTTAA 


AATCAACAAG 


CAATAACTTT 


CCAGGATTCA ATTGACCTTT 


360 


AAAAGCAACA 


TTACTTTCAG 


GTACGTCCAC 


AACACCCACT 


TCAGATGAGA 


AGACAATAAA 


420 


GTTATCTTTA 


GTAATCGTAT 


AACGACCTGG 


ACGTAATCCA 


TTTCTATCTG 


TAAGCGCGCC 


480 


AAGTTTGTCA 


CCGTTACAGA 


NCGAAATCAT 


TGTAGGACCA 


TCCCACGGTT 


CCATTAAATA 


540 


ACTATAAAAT 


TCATAAAACG 


CACGTACATT 


TGCATCATTC 


GCTTCATTGT 


ATAACCAAGG 


600 


TTCAGGTATG 


AGTAGCATTC 


GCTGCCTTTT 


CTGGCTCCAT 


GGCTAACGAT 


AAGAACTCTA 


660 


GCGCATTATC 


TACAATAGCA 


GAGTCACTAC 


CATCCTCATC 


GACAATTTGA 


AACACTTTAT 


720 
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GTTGATCCTC 


GCCAAATAAT 


GTTTCGATTA 


AATTTATGTN 


GGCGTGCTCG 


CATCCAGTTT_ 


780 


ACATTACCTT 


TAATCGTGTT 


AATCTCACCC 


ATNATGCATT 


AACAATACGG 


TTAGGATGTG 


840 


CCCTTTTCCC 


AACTCGGGAA 


TGTATTCGTA 


CTAAATCTCG 


AATGCACTAA 


CCCTAGCTTT 


900 


GATTGATATA 


AATCATCCGA 


TAAATCTGNA 


TATAGTTTTT 


TAATTTGGTC 


TGATCGTAAC 


960 


CAACCTTTAT 


AAACAATTGT 


TTTGCGTGAT 


AAGCTCGTAA 


AANACAATTC 


TAAATCGCAC 


1020 


TGAGTCGAAT 


AGAACTCTAA 


TTGTTTTCTC 


GCTAAAAACA 


AACGCTTTTC 


AACATCTTCA 


1080 


ATGCCCCTAA 


TATCAATAAA 


CACTTGTTGA 


ATGACTGGCA 


TCGTATCTGC 


TACATGTTTA 


1140 


GCAATGGCAT 


CTTTATTAAC 


TGGTACATTN 


CGATAACCAA 


GAATTGATAA 


CCCTTCGCCT 


1200 


TCAAAATATT 


TTTTAAAAAC 


TGCTTCATGT 


TCAGAACCTA 


AAATGCGTTC 


TTTGGAAAAA 


1260 


AATAACCCCA 


CGGCATATTC 


ACCTTCACCT 


GGGATATCAA 


AGTCCGTTAC 


ATGTTGTTTG 


1320 


AAAAATGCAA 


AAGGTATTTC 


AGTCATAATA 


CCTGCGCCAT 


CACCAGTGAT 


GCCATCTGCG 


1380 


CCGACCCCGC 


CCCTGTGATC 


TAAGCGTCGC 


AACATTTCAA 


GCGATTTATC 


AATGATGTCG 


1440 


TGAGACCTTT 


TATTATCCAT 


ATTCGCATAA 


AAACCAATAC 


CACACGCATC 


ATGTTCCTCA 


1500 


CGATAGTCAT 


ATAAGCCTTT 


AATTAATTTC 


TCAG 






1534 



(2) INFORMATION FOR SEQ ID NO:1108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108: 



AGTTCTTCCT 


TTCGTTAATT 


GATTTCCCCT 


ANNAAATCCA 


TAAACTTTGC 


NCGGGACCCA 


60 


AAGCCANCCC 


GCGACAAGCT 


GCAACTTCGG 


ATCCCTAATC 


CCACCGTGAT 


GGGTGGTAAA 


120 


ATTCGTTTGG 


GATCGTTACT 


GCTAGCGGCT 


CTGTAATATT 


ATTTAATTCT 


ACAAANCGGT 


180 


TCACCAAAGA 


CTTAAGATGG 


TCAGGCGCAT 


CATATAACAA 


TACCATCGTA 


AACATCGCTT 


240 


TTAATAGACG 


AATAGTTCCC 


GCTCTCTAAG 


GCTTCTATTA 


AAACTTTGAT 


TTTACCTGCG 


300 


TTAAACGGTA 


CTGCAATAGC 


AGGCTCTCCA 


AATGTAACAG 


CATGTTCTCC 


TATTAAAATA 


360 


ATCTTACCTG 


TCGATTCCCC 


ATATCCTTTT 


CTTGTCATGT 


CAATATCACC 


TTTTATATTT 


420 


ATCCTATACT 


TGATTCATTA 


TTTTTATTTA 


TTAGTAAAAG 


ACATCATATT 


CTAAGTTGCA 


480 


TACGCATTCG 


CGTTAAATTT 


CATTGCAGTC 


TTTATCTCAC 


ATTATTCATA 


TTATGTATAA 


540 


TCTTTATTTT 


GAATTTATAT 


TTGACTTAAC 


TTGATTAGTA 


TAAAACTAAC 


TTTCGTTTAC 


600 


TTCAAAGTTT 


AAATCTTATC 


GAGTGATATT 


TCAGATTCTT 


TATCTTTTTA 


TAAAATAGCC 


660 


CTACAATTTA 


TAATTTTCCA 


CCCTAACTAT 


AATACTACAA 


ATAATAATTG 


GAATATATAG 


720 


ATTTACTACT 


AAAGTATTAG 


AACATTTCAA 


TAGAAGGTCG 


TTTCTTTCAT 


AGTCATACGC 


780 


ATTATATATA 


CCCTATTCTC 


AATCTATTTA 


ATACGTAAAA 


CATGAAATTT 


TCTTATTAAA 


840 


TTTATTATTT 


CCATCATATC 


ATTACTTTTA 


ATTTAATGAT 


GTTCAATTTA 


AATATGAGGG 


900 


TCAATAACAT 


ATTTATGCCT 


TTTTATGGAT 


ACTTTTCAAA 


AATAACAGCC 


CCAAACGATA 


960 
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ACTTGAAAGG GGGCTGTGAA ATATTTAACT ATTGCATTTG GGTCTATCAT TTNCTTGTTT_ 1020 

CTATCAATCA TTTTATCAAA ATACGTATCG TATCTTTGGG CATTCTTCTT GAGTAATGGG 1080 

CGTCATATTT TAATACAACC GCCAAGATCT TTGATTGGAT ATTAATAGGC AAAAACATTA 1140 

CATCTTGGAC AGTAATCGTT TGGTTAAGCA ATGTCTCTAA TG 1182 

(2) INFORMATION FOR SEQ ID NO: 1109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109: 



CAGTTCATCA 


TGGATGCTTT 


GAGCGGTATC 


CAAGTGTGGA 


TTTATTACAC 


ATATTGTATA 


60 


AGACATGTCA 


CCATTATTTA 


TTATCACACA 


TCACTAAGTC 


ATGTATTGAA 


ATATTGTCAG 


120 


TACATTTATG 


TTTAAAGAAG 


TTCAAATCAT 


TGAACGAGGT 


AATATTAATC 


ATTTCAAGTA 


180 


TGAGCATTTG 


CATCCGTATA 


CTGAACGTCT 


AATTAAATAT 


AGAACACAAT 


TAAAGAGGGA 


240 


TTACTATGAT 


TGAGTTAAAA 


CATGTGACTT 


TTGGTTATAA 


TAAAAAGCAG 


ATGGTGCTAC 


300 


AAGATATCAA 


TATTACTATA 


CCTGATGGAG 


AAAATGTTGG 


TATTTTAGGC 


GAAAGTGCCT 


360 


GTGGTAAAAG 


TACGCTCGCT 


TCATTGGTTC 


TTGGCTTGTT 


TAAACCTGTT 


AAAGGAGAAA 


420 


TTTACTTAAG 


TGATAATGCT 


GTGTTAACGA 


TTTTCCAACA 


CCCTTTAACT 


AGCTTTAACC 


480 


CTGATTGGAC 


GATTGAGACC 


TCATTAAAAG 


AAGCGTTATA 


TTATTACAGA 


GGCCTAACTG 


540 


ATAATACTGC 


TCAGGATCAA 


TTATTATTAC 


AACATTTATC 


TACTTTTGAG 


TTAAACGCGC 


600 


AATTATTGAC 


TAAATTACCA 


AGCGAAGTGA 


GTGGCGGACA 


ATNACAAAGA 


TTTAATGTCA 


660 


TGCGTTCGTT 


ATTAGCAGAG 


CCTCGCGTTT 


TAATATGTGA 


TGAGATAACT 


TCAAATTTAG 


720 


ATGTTATAGC 


TGANCAAAAT 


GTAATCAATA 


TANTAAAAGC 


GCAAACGGTT 


NCGGAACTTA 


780 


AATCATTTTA 


TCGTTATTTC 


TCATGATTTA 


ATCCGTGTTA 


CAACGCTTAG 


TTAATAGAAT 


840 


TATCGTTCTT 


AAGGATGGCA 


TGATAGTCGA 


TGATTTTGCA 


ATAGAGGAAT 


TATTTAATGT 


900 


TGATAGACAC 


CCTTATACAA 


AAGAATTAGT 


GCAAACATTT 


TCATATTAGC 


TATTTAAGAA 


960 


TGCGATAATT 


CTAGACTTGT 


TATAAAATAT 


AGATGAATCA 


AGTATATTAA 


TCTAGACACT 


1020 


TATCGATTTT 


ATTTTCTTTA 


TTTAAAAATA 


ATAATAAAAA 


GGAGTATCAT 


TAATGGGATT 


1080 


ACTTTGATAT 


TGCCAAGTAT 


TCGTTCTAAA 


GAAAGAGGCT 


TTAATTATTA 


TCCAAAGTGA 


1140 


ATGCGTCATT 


AACTTAAAAT 


CATTTTCAGA 


AACGCAGCAG 


GAGGCTGAAG 


TAAAGGNCAG 


1200 


TGGCAACAAA 


GTATATCGTT 


GTTATATTGA 


TATGGAACAT 


CCTAGAAAAT 


CCAAATGTAA 


1260 


TTGTCCTCAT 


GCTGATGGAA 


GACGAGTGAT 


ATGTAAACAT 


ATGATTGCAT 


TACTCTTTAC 


1320 


AGCTAGTCCA 


GAAGCAGCAA 


ATAAACATAT 


AATGATGTTA 


AACGAAGTTG 


AAGAAGACTA 


1380 


TCAATTACGC 


AGAAATATGT 


GGATTG 








1406 



(2) INFORMATION FOR SEQ ID NO: 1110: 

945 



WO 97/30070 



PCT/US97/02318 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1167 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110: 



ATATGATATT 


AAAATGCAGG 


AACGTATTTA 


GTACGAACGT 


AAAATTAATG 


ATTTAAAATG 


60 


CTAGTATGTA 


TATGATTTTG 


ATAAATAAAT 


GCTTTTTAAC 


GTAAATCAAG 


TTTGATACAG 


120 


AAAGGACTAA 


ATCAAAACAT 


TTATTCGTTG 


TAATAACGTT 


TAAATAACTT 


TATTAAAAAG 


180 


TCATAATAGT 


GTTAAAATGT 


ATTGACGAAT 


AAAAAGTTAG 


TTAAAACTGG 


GATTAGATAT 


240 


TCTATCCGTT 


AAATTAATTA 


TTATAAGGAG 


TTATCTTAAC 


ATGTTAAATC 


TTGAAAACAA 


300 


AACATATGTC 


ATCATGGGAA 


TCGCTAATAA 


GCGTAGTATT 


GCTTTTGGTG 


TCGCTAAAGT 


360 


TTTAAGATCA 


ATTAGGTGCT 


AAATTAGTAT 


TTACTTACCG 


TAAAGAACGT 


AGCCGTAAAG 


420 


AGCTTGAAAA 


ATTATTAGAA 


CAATTAAATC 


AACCAGAAGC 


GCACTTATAT 


CAAATTGATG 


480 


TTCAAAGCGA 


TGAAGAGGTT 


ATTAATGGTT 


TTGAGCAAAT 


TGGTAAAGAT 


GTTGGCAATA 


540 


TTGATGGTGT 


ATATCATTCA 


ATCGCATTTG 


CTAATATGGA 


AGACTTACGC 


GGACGCTTTT 


600 


CTGAAACTTC 


ACGTGAAGGC 


TTCTTGTTAG 


CTCAAGACAT 


TAGTTCTTAC 


TCATTAACAA 


660 


TTGTGGCTCA 


TGAAGCTAAA 


AAATTAATGC 


CAGAAGGTGG 


TAGCATTGTT 


GCAACAACAT 


720 


ATTTAGGTGG 


CGAATTCGCA 


GTTCAAAATT 


ATAATGTGAT 


GGGTGTTGCT 


AAAGCGAGCT 


780 


TAGAAGCAAA 


TGTTAAATAT 


TTAGCATTAG 


ACTTAGGTCC 


TGATAATATT 


CGCGTTAATG 


840 


CAATTTCAGC 


TGGTCCAATC 


CGTACATTAA 


GTGCAAAAGG 


TGTGGGTGGT 


TTCAATACAA 


900 


TTCTTAAAGA 


AATCGAAGAG 


CGTGCACCTT 


TAAAACGTAA 


CGTTGATCAA 


GTAGAAGTAG 


960 


GTAAAACAGC 


GGCTTACTTA 


TTAAGTGACT 


TATCAAGTGG 


CGTTACAGGT 


GAAAATATTC 


1020 


ATGTAGATAG 


CGGATTCCAC 


GCAATTAATA 


NTATCATTCA 


ACAGCTTGGG 


TTCACCGTTT 


1080 


ATCATATATT 


GTGGAGCAAA 


AGCTTTNTGG 


GTTTTATTAA 


TAATCNGGCT 


GATGGAAAAT 


1140 


TATTGGATAT 


TTCAACCTGA 


CTTGATT 








1167 



(2) INFORMATION FOR SEQ ID NO: 1111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111: 

946 



WO 97/30070 



PCT/US97/02318 





PGGPTAAATC 


ATTACGTGCT 


CACAAATGGA 


GATATTATCT 


TCTCAGGTTG 


60 


CAGAGCATCG 


CACAGCTCAA 


AATTATGCTG 


AAGTACAAGT 


TAAGATTAAG 


ATAATCAATT . 


120 


CTCNAAAGCT 


CAGTGTTGTT 


GTAAACCGAA 


GTTATTGTAA 


CCAAGAAGAT 


TGTATCGAAG 


180 


TGGTGTAAGT 


GAGTAPTACA 

v*x»^j x n^* x «m^xx 


TAAATAATGN 


CCGTGCAAGA 


NTNAAAGATA 


TTGCCGATTT 


240 


ATTTTTAGAT 


TCTGGATTGG 


GAAAAGAAGC 


GTATAGCATT 


ATCTCNCAAG 


GTAGAGTTGA 


300 


TGAAATACTA 


AATGCTAAAC 


CAATTGATAG 

^^*X*» x i\jni«tw 


ACGTCACATT ATTGNAGAAT 


CGGCTGGTGT 


360 


APTTANATAT 


AAAAAACGTA 


AGGCTGAATC 


ATTAAATAAA 


CTTGACCANA 


CAGAAGATAA 


420 


TTTMAPfiAGA 


(TTAGAAfSAPA 
vj x auaaua\_a 


TTTTATATGA 

X X X X f*. X f & X VJXm 


TTTGGCACGA 


GGTCGCGTAG 


AACCTCTAAA 


480 


AGAGGAGGCA 


f5PT A TARCTA 
x x\ ± run* x a 


AAGAATATAA 


GACACTTTCA 


CATCAAATGA 


AACATAGTGA 


540 


P A 'I'll ."^A^l"!' 
LAI IVJlnul X 


APAGTGPAPG 


ATATTGATCA 


ATATACAAAT 


GACAATAGAC 


AATTAGATCA 

X X Xl\M& X Vi* 


600 


APf SH^PTA A AT 
nV>Ul X XAAAX 


fSATTTAPAAG 

Utn XXX ALAAU 


GTPAACAAGC 


AAATAAAGAA 


GCTGACAAGC 


AACGTTTAAG 


660 




PAAPAATATA 
LAnUmlnl A 


AAGGTAAAPG 

AAUUlAAAvw 


ZkPATPA APTT 


AT A A Pf2 ATf2 
ua X nnLbn lu 


TTGAATCGCT 

X X WU) X \«UV X 


720 


Tfc ATTATPAA 


TTAf2TAAAAf2 
X 1AU1AAAAV3 


PTAPGGAAGP 


CTTTGAAAAA 


TATACGGGAC 


AATTAAATGT 

nn .X xn^mxux 


780 




LulliIwuAnn 


AATPAATPTG 
An IVJUil V- XV 


AAACAAATGC 


ACGATATGAA 


GAAGAAPAAG 


RAO 


111 ftTTTA AT 


VVAVal* 11 11A 


d A A A ATATAT 

VsAAAAl nlA X 


CAAATGAGAT 


TTCTGAAGCT 


CNAGATAPTT 


900 


ATAAGTCTCT 
ax aau iviui 


G A AAAGT A AA 
waaaavj x nnn 


PAPANAAGAA 


CTCAATGCTG 


TCATTCGTGA 


ACTTGAAGAA 


960 


PA ATPTATAT 


GTTTPAGAPfi 
ox x luiunuvj 


AAGCACATGA 


TGNNAAATTG 


GAAGAAATTN 


AAAACGAATA 


1020 


CTATACATTA 


ATGTCAGAGC 


AATCAGATGT 


TAACAATGAT 


ATTCGTTTTT 


TAAAGCATAC 


1080 


TATAGAAGAG 


AATGAGGCTA 


AAAAATCAAG 


ACTAGATTCT 


CGATTAGTTG 


AAGTTTTTGA 


1140 


GCAATTGAAA 


GATATTCAGG 


GTCAAATAAA 


AACGACCCAA 


AAAGAATATC 


AACAGACCAA 


1200 


CAAAGAACTT 


TCTTCTGTAG 


GGNAAGAAAT 


TAAAAATAGA 


GAAAGAGATC 


TCTCTGACAC 


1260 


ACAAAAAGCA 


CAATATGAAT 


GCGNAGAGAA 


TTTGTATCAA 


GCATATCGAT 


ATNCCGAAAA 


1320 


AATGAAAACA 


CGTATTGATA 


GTTTGGCAAC 


GCAAGAGGAA 


GAATACACTT 


G 


1371 



(2) INFORMATION FOR SEQ ID NO:1112: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112: 



GATGTGTGAN 


TCTATTATNN 


TTTNNAAGAT 


AATTCATAGT 


TATAATTTTN 


ATTNCTGCTA 


60 


TCAGTCATAA 


TCTTAAATAA 


TTTGTGGAAA 


NTTGAACAAG 


TGTNCGNAAG 


TAATTAATAA 


120 


CAGATCGTTG 


AAGTGCAAGT 


AATACCTTTA 


TTTCTAATCC 


CANAAAATAA 


TAGCCACTCT 


180 


AANAAGTAAT 


ATACTCTTAG 


AATGGCTGTN 


AGTTTAAAAT 


TGTCGATTCA 


AAGATAATAT 


240 


TCTACTTCAA 


CATGACATAT 


TAGTTAAGTA 


GTCTATAGTT 


TCCTTGGAAT 


ATTATTTTAA 


300 


CTTTTTAGCG 


TTATATAAAT 


CAGCTAAAGG 


CTTTAATTCA 


TCATAAAATG 


CTTGGAANTC 


360 
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ATCTAAATCC 


ATTTGTTGAC 


CCGCATCACT 


AAGTGCAACA 


GATGGATCTG 


GATGCACCTC. 


420 


AGCCATAACT 


CCATCAGCAC 


CAACTGCTAA 


TGCTGCTTTC 


GCAGTTGGTA 


ACATGATATC 


480 


TTTACGACCT 


GTACTATGCG 


TAACATCTAC 


CATGACTGGT 


AAGTGTGTAC 


CTTGTTTTAA 


540 


AATTGGTACT 


GCTGAAATAT 


CTAAAGTGTT 


ACGTGTCGCC 


TTTTCATAAG 


TTCGGATTCC 


600 


ACGTTCACAt 


AAAATAATGT 


TGTGATTACC 


TTGTGAAGCA 


ATGTATTCAG 


CTGCATAAAC 


660 


AAACTCTTCG 


ATTGGAGCAG 


ATAAACCACG 


TTTTAATAGA 


ATAGGCTTTT 


TCGGTACGGC 


720 


CAGCTTCTTT 


TAATAACTCG 


AAGTTTTGNA 


TATTTACGTG 


CACCAATTTG 


GGAATACGTC 


780 


TTTAAATACT 


CATCAGCCAC 


TTCCAAAAAT 


CAATTTGGGA 


TTTTACGGAT 


TTCGNTTGAC 


840 


AAACAATNTA 


AATCAATATT 


TATCTTTAAT 


CTGTTTAAGT 


ATTTTAAGTC 


CTTCAACACC 


900 


TAGGCCTTGG 


AAATCATATG 


GTGATGTACG 


TGGTTTAAAT 


GCACCGCCAC 


GAATAAATTT 


960 


TTCACCTTTA 


GCATGTGAGT 


TTTTAGGAAC 


AGCTTCAACT 


TGTTCAAATG 


ATTCAACTGA 


1020 


ACATGGCCCA 


AATACAAATG 


ATTTATTGCC 


GTCTCCAATA 


ATGCCCCCAT 


TATCAAATGT 


1080 


TACAATCGTA 


TCTTCAGGTT 


TCAACTTACG 


TGATACGTAT 


AAGTGTTTTC 


ATTTCAGATT 


1140 


TTGTAAATCT 


GTAGAGCTTT 


AAAATTC 








1167 



(2) INFORMATION FOR SEQ ID NO: 1113: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113: 



CTAGCATATA 


TCTANTTTGT 


CAGNTGGGGG 


GATATCAATG 


ATGTATTTAA 


CCCAGTCTAT 


60 


TTTAATCNAT 


TGTAAGATTA 


CCCGAGATTA 


TNAGAAGCAT 


TTGTTGACNA 


GGTATGATAT 


120 


TAACAGTTGC 


AGGTCTTATA 


TTTCAAACCA 


GTTTTAAATA 


ATGCATTGGC 


AGATANCTTT 


180 


ACATTNGGAT 


TGNCAAGCGG 


TGCTACATTT 


GGNTCAGGAT 


TAGCATTATT 


TTTAGGTTTA 


240 


ACAACGTTAT 


GGATTCCTGT 


ATTTTCAATA 


ACATTTAGTT 


TGATAACATT 


AATAACTGTA 


300 


TTAGTCATTA 


CGTCGGTATT 


GAGCCAAGGC 


TATCCAGTTA 


GAATCTTAAT 


ATTAAGTGGT 


360 


TTAATGATTG 


GTGCATTATT 


CAATTCACTT 


CTATATTTTT 


TGATTTTATT 


AAAACCTCGC 


420 


AAATTAAATA 


CAATTGCCAA 


TTATTTGTTT 


GGTGGCTTTG 


GTGATGCAGA 


ATACTCAAAT 


480 


GTATCCATAA 


TAGCAATCAC 


ATTTATCATT 


GCACTGTTTG 


GTATATTTAT 


CATTCTTAAA 


540 


TCNACTAAAG 


TTATTGCAAT 


TAGGAGAACT 


NAAAAGTCAG 


TCACTAGGCT 


TAAATGTTCA 


600 


ATTGATTACA 


TATATCGCGT 


TATGTATAGC 


TTCCNTGATN 


NCGGCGANAA 


ATGTCCCAAA 


660 


TGTTGCCATC 


ATTGGATTCA 


TNGGTATGGG 


TGATACCNCC 


AACTCATAAG 


NAAATGCCAG 


720 


TGGAAACCAA 


TCATTAGGGA 


AGACNATTGC 


CTTTGAATAT 


TGTAATTGGA 


GGACCAAATA 


780 


ATGGTTATGG 


CAAGATTTTA 


ATTGGGTAAC 


CCAATATACT 


TGTCACCCAG 


TACCAAATAC 


840 


CCGGCAAGCA 


TTATTATTTG 


CATTAATTTG 


GTATACCAGG 


TTATTTTACA 


TGCTAATATC 


900 


TCAGTCGAAA 


CGGTTACACT 


AGCACATGAC 


ATTTGCTAAA 


ATAAAAATAA 


CTATAAACAT 


960 
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AAAGAGGGCA TAAGCGATGG ATTTGAATCA AATTNAAGCA GTTGTATTTG ACTTAGAAGG. 1020 

TACGTTGTTG GACAGAGTTA AATCTCGAGA GAAATTNATC GAAGAGCAAT ATGAACGANT 1080 

NCATGACTAC TTANTNCATG TNCAACTGGC AGATTTTAAA AAAGCATTTA TTGAGCTAGA 1140 

TGACGATGAA GATAATGATA AACCTGATTT AGATAAAGAA ATCATTAAAC GTTTCCATGT 1200 

CAGATAGGTT AACTTGGAAA GACTTATTTA CTGATTG 1237 

(2) INFORMATION FOR SEQ ID NO: 1114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114: 



GGCACGAGAC 


AATATATV*TA 


fWITfiA ATA A 




AGTGAAGTTG 


TAGCCGCCAT 


60 


ACLiCATTTCA 


TTTTCATCAG 


TTCTGCCATA 


AATCAATGCA 


TAGTCGGCAA 


TTTGAGCCAC 


120 


GTGCCGATTA 


TTCGGCACGA 


GCTGTACTAG 


ATATAGTTAT 


GATGGGAATA 


CTGTAATGTG 


180 


TGGCCACCTG 


TGCAATTGAC 


TGCAATTCAC 


TATGACTACC 


TTGATTCGTC 


ACAAAAATCA 


240 


TGCAATCTCT 


ATCATCATGC 


GTCGCAAATG 


TTGACACAAG 


TAAATGCGTT 


TCATGTAATA 


300 


ACCTGACATT 


TAAGCCAATA 


CGAGATAACT 


TTTGAAAAAG 


ATCACCAATA 


GTCAAACTCG 


360 


ATGCGCCAAA 


TCCAAATAAA 


AATATTGTCC 


TGGCATTTTT 


CAACACATCA 


CAAATTGCAT 


420 


CAATTTGCGC 


ATCCATAATA 


TTAGTAGCTA 


CAAATCGCAT 


CCGTATTCCG 


TTGCTCTAGC 


480 


AATCATTTTA 


TTTTTCAAAG 


TTTCTACAGA 


TTCATTTTCA 


ATCAATTCTA 


AATGTGGATT 


540 


GGTTGCAATA 


TCTTCGGGTA 


AGTATCGAGA 


TATCGCAATC 


TTTAGCTCTT 


GAAAACCTTG 


600 


ATGTGTCATT 


TTCCGACTAA 


ATCTAACAAT 


TGATGCTGTA 


CTAACATTCG 


TAACATCTGC 


660 


CAAATCATTC 


ACCGTCATAT 


CAATGATTTT 


ATGTGGATTC 


TTTAAAATGT 


AATCAGCGAT 


720 


TATCTTTTCT 


GTCTTCGTAA 


AATCACTCAA 


CTGTTTATCA 


ATGCGATATT 


AAAATATTTG 


780 


TCATCATTAA 


TCACCCAACA 


AATCTGTCTT 


GTCGCATCGC 


CTTTGTCGGT 


CCAAATAAAT 


840 


ATGTACAAAC 


GAATCCACCA 


GCATATGCAG 


CAAGTAATCC 


TGCAATATAA 


CCTAAATACA 


900 


TATTATCTGA 


GATTAATGGT 


AAGAGTGACA 


CACCACTTGG 


GCCTATTGCT 


TTGGCACCAA 


960 


TATGTCCAAT 


TCCACCTATT 


ACAGCGCCAC 


CAATACCACC 


ACCAATACAA 


GCAGTTAAGA 


1020 


AAGGTCGAAC 


TAATGGGCAA 


AGTCACACCA 


TAGATTAATG 


GTTCTCCGAT 


ACCTAGGAAA 


1080 


CCAACTGGCA 


ATGCACCTTT 


TAAAGTATTA 


CGTAATGTTG 


TGTTGCGTTT 


ACATCTTACC 


1140 


CAAAGTGCTA 


ATGCGGCACC 


TACTTGTCCA 


GCACCCAGCC 


ATCGCTGCAA 


TTGGCAATAA 


1200 


GTAAGTAGCA 


CCAGATTGGT 


TAATCAATTT 


CTATATGAAT 


TGGCGTNAAA 


AATATGATGA 


1260 


AGGCCCTAAC 


AATAACTTAA 


CGGTAGGAAG 


CTTGGTCCAA 


ATGATAAATC 


CACTTAAATT 


1320 


ACGNCACCCA 


ATACTTAATA 


ATCCCCGTTA 


CTACTGAAAC 


TTAANTTGTC 


TTTGAAACAA 


1380 


A 












1381 
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(2) INFORMATION FOR SEQ ID NO: 1115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115: 



GGCACGAGAC 


CGCATAGACA 


TCTAAACCAT 


GTGTCTCTTC 


TTTGTATTTA 


TTTGCATCTT 


60 


TTCCCAATAA 


AAATACTTGG 


AACTTTTGAT 


TTTGTTGGTT 


AAAAACGCAA 


AACGTTAGAG 


120 


TCATAATAAC 


CTTGACTGTA 


GGCCACTAAG 


ATTTGTTAAA 


TCAAAACTAG 


GTCCTGAATA 


180 


GTATTGATGT 


AACTGGTTTG 


TATCATATAA 


ACGGTACATT 


TTTTCATACT 


TTCCTTTTGC 


240 


ATTAACCGAT 


TGATTTTCTG 


TTAGCATTAC 


ACTTGTTGTT 


AATATGCCTA 


ATACAAATAT 


300 


CGTCTTTAGC 


TATCGCTGTA 


AATTTCATAT 


TCATATGCAT 


CCAATCTTAA 


TATATTGGAT 


360 


CGCTTTTATT 


ACGTACCTTT 


GAATAATCAC 


ATTTATTATA 


AAGGGCATTG 


AGACCGACTG 


420 


ACATTAACAC 


ACGTTNTATT 


TAACATTAAA 


TCATGGTTAA 


TGAA 




464 



(2) INFORMATION FOR SEQ ID NO:1116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116: 



GGCACGAGCG 


TGAAGAAAAA 


GATGTCTTAT 


ACGTTGTTAT 


TTCTAAAACA 


CAAGTTTCAA 


60 


AAGCAAAGCG 


ATTAATTAAA 


CAAATCGATA 


AAGATGCATT 


CCTCGTAATT 


CATGATGTAA 


120 


GAGATGTCTA 


TGGTAATGGC 


TTTCTTGCAG 


ATGAATAAAT 


AAATGGTATG 


AGCACACATA 


180 


CTTAAATAGA 


AGTCCACGGA 


CAAGTTTTTG 


AACTATGAAG 


ACTTATCTGT 


GGGCGTTTTT 


240 


TATTTTATAA 


AAGTAATATA 


CAAGACATGA 


CAAATCGAGC 


TATCCAATTT 


AAAAAGTAAT 


300 


GTTAGTCAAT 


AAGATTGAAA 


AATGTTATAA 


TGATGTTCAT 


GATAATCATT 


ATCAATTGGG 


360 


ATGCCTTTGA 


AAATTGATAA 


TTTAAAAATA 


GAAATTATTT 


TTTATAAACA 


GAAAGAATTT 


420 


TATTGAAAGT 


AGGGAAATTA 


TGAATCGTTT 


GCATGGACNA 


CNAGTTANAA 


TTGGTTACGG 


480 


GGATAACACG 


ATTATAAATA 


AATTAGATGT 


NGAANTNCCA 


NATGGCAAAG 


TGACGTCAAT 


540 


CATTGGTCCT 


AACGGCTGCG 


GGAAATCTAC 


TTTGCTAAAG 


GCATTGTCAC 


GTTTATTGGC 


600 
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AGCTAANNAA GGCGAAGTAT TTTTAGATGG GTGGAAAATA TTCAATACAC AATCTACGAA 660 
AGAGATTGCA AAAAAAATAG NCATTTTACC TCAATCACCT GAAGTAGCAA GATGGCTTAA 720 
CTTGTTGGGG AATTAAGNNC ATTGNCCGGC NTCCACATCA AAANGGTTGG 770 

(2} INFORMATION FOR SEQ ID NO: 1117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117: 



GGCAAGAGCG 


AGATGATTCC 


AGTTCGATTA 


TCCATTGGCG 


CTAAAAACAT 


TTATTTACCG 


60 


TTCATTCTTC 


GTTAAACAAT 


TTTGGAATAC 


AGTACGGATA 


CAATATGGAG 


ATGTAAAAAA 


120 


CTAATAACCT 


TTTACAAATT 


TGTTTATCAA 


AATATTTTAA 


GTTTTGCAAA 


GCTTTTTATT 


180 


GTGATTATTT 


TCAACAAAAT 


ACTATAATGA 


GGATAGTAAA 


TAGAGAGGAG 


TCCTTAAGTT 


240 


GACGAAACGA 


CAAATGGGTA 


TATTCATTTA 


TGCTGGAATT 


ATCGGTGGCT 


TGTTATCTGG 


300 


AATTGTAAAA 


TTAGGTTGGG 


AGNTCATGTT 


TCCCACCTCG 


CACACCAGAA 


CGTAATGCNA 


360 


CCGAACCCAC 


CTCAAGAGTT 


ATTGCAACAA 


TTAGGATTTA 


GTAGTGAGTT 


TACACATCAA 


420 


ACATATACAT 


TTTCAAATAT 


GGAATTGCCT 


TGGGTAAGCT 


TTATTGTCCA 


CTTTAGTTTT 


480 


TCTATCGTCA 


TTGCAATTAT 


TTACTGCATA 


TTAGTTAAAA 


AATACGCTTA 


CTTAGCAATT 


540 


GGACAAGGTG 




TATTGCTATT 


TGGGTATTAT 


TCCACCTTAT 


CATTATGCCA 


600 


ATCATGCATA 


CTGTACCTGC 


TGTGTGGGAT 


CAACCATACC 


AAGAGCATCT 


GTCAGAATTC 


660 


TTTGGCCATA 


TCGTCTGGAT 


GATGACAATT 


GAATTAGTGC 


GACAACATTT 


TGTCTATCGC 


720 


TATAAATTAA 


ATTAATACAC 


TGACTAACAT 


TAACGTGAGT 


TTTAAATCAT 


CGTTTGAGTA 


780 


TGATGATTGA 


TGCTCACGTT 


ATTTTATTAA 


CTGACATGAT 


ATGATTCCAG 


CCAACTTACG 


840 


TGAGCATTAA 


AGTCTCAAAT 


GCGTCGTAAC 


AAGATATTAT 


TTTTGGCAAT 


TTCAATATTG 


900 


CTCAGTATAT 


TTTCACCTTA 


TCACTTACAT 


TAATCTCGTC 


ATGATTTTGA 


ATGATGCCAT 


960 


CGTGTATTCA 


CCTTTCATTT 


TTCCCAATAA 


AAATA 






995 



{2) INFORMATION FOR SEQ ID NO: 1118: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118: 



L. 1 1 AAAACA 


CCAAAGCAAC 


TCTATCTAAT 


TAAAATTAAT 


TTTATCATCA 


TTATATATTG 


60 


ACjlACCAGTG 


TATTTAATAT 


AACATAGTGA 


GACTTTGTTT 


TTATNTTATT 


AATATCATNT 


120 


NACGTTTGTG 


CTATAAATTA 


TTTCTAGAGA 


CACACANAAN 


• CCGATGCATA 


CGCATCGGCT 


180 


TATTTGTAAA 


TACAGTATTT 


ATTTATCTAA 


TCCCATTTTA 


TCTTGAACCA 


CATCAGCTAT 


240 


TTGTTGTGCA 


AATCTTTCAG 


CATCTTCATC 


AGTTGCTGCT 


TCAACCATGA 


CACGAACTAA 


300 


TGGTTCTGTT 


CCAGAAGGTC 


TTACTAAAAT 


TCGACCTTCT 


CCATTCATTT 


CTACTTCTAC 


360 


TTTAGTCATA 


ACTTCTTTAA 


CGTCAACATT 


TTCTTCAACA 


CGATATTTAT 


CTGTTACGCG 


420 


TACGTTAATT 


AATGATTGTG 


GATATTTTTT 


CATTTGTCCA 


GCTAATTCAC 


TTAGTGATTT 


480 


ACCAGTCATT 


TTTATTACAG 


AAGCTAATTG 


AATACCAGTT 


AATAAACCAT 


CACCAGTTGT 


540 


ATTGTAATCC 


ATCATAACGA 


TATGCCCAGA 


TTGTTCTCCA 


CCTAAGTTAT 


AATTACCGCG 


600 


AGCATNTCGT 


GCTATGCTAC 


ATATCTGTCG 


GCANCTTTAG 


TTTTATTAGA 


TTTAATTCCC 


660 


TCTTGGTCAA 


GCGCTTNGGT 


AA 








682 



(2) INFORMATION FOR SEQ ID NO: 1119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119: 



TGTATAGGCC 


TATTTGTATA 


NAAAGTAATA 


TGATGTTTAG 


TGGCTATGTA AGTGAAGCAA 


60 


TGTATAAATA 


ATGATGAGTG 


GTTTGTTACT 


AATGATAATG 


GCTATGTAAA AGAGCAGTAT 


120 


TTATATTTNG 


CGGGACGTCA 


ACAGGATATG 


TTAATTATTG 


GTGGGCGAAA 


TATATATCCA 


180 


GCACATGTTG 


TNCGCCTTTT 


AACGCAATCT 


TCGAGCATTG 


ATGAAGCAAT 


TATCATCGGT 


240 


ATTCCACATG 


AGCGTTTTGG 


TCANATAGGC 


GTATTGCTTT 


ATTCTGGTGA 


TGTGACACTT 


300 


ACACATAAAA 


ATGNGAAACA 


ATTTTTNAAA 


AAGAAAGTGA 


AACGCCATGA 


AATTCCATTC 


360 


GATGATTCAT 


CATGTAGAAA 


AGATGTATTA 


CNCTGCAAGT 


GGTAAAATTG 


CTAGAGAAAA 


420 


AATGATGTCG 


ATGTATTTGA 


GAGGTGAATT 


ATAATATGAA 


TCAAGCAGTC 


ATAGTTGCAG 


480 


CTAAACCGAA 


CTGCATTTGG 


GNAATATGGT 


GGCACTTTAA 


AACATTTAGA 


GCCAGAACAA 


540 


TTGCTTAAAC 


CTTTATTCCA 


ACATTTTAAA 


GAGAAGTATC 


CAGAGGTAAT 


ATCTAAAATA 


600 


GATGATGTAG 


TTTTAGGTAA 


TGTTGTTGGG 


ANTGGTGGCA 


ATATTGCAAG 


AAAAGCATTG 


660 


CTTGAAGCGG 


GGCTTAAAGA 


TTCAATACCT 


GGCGTCACAA 


TCGATCGGCA 


ATGTGGGTCT 


720 


GGACTTNGAA 


AGTGTTCAAT 


ATGCATGTCG 


CATGATCCAA 


GCCGGAGCTG 


GCAAGGTATA 


780 


TNTTGCAGGT 


GGTGTTTGAN 


AGTACAAGTC 


GAGCACCTTG 


GAAAATCAAA 


CGACCGCANT 


840 


CTGTGTACGA 


AACAGCATTA 


CCNGAGTTTT 


TATGAGCGTG 


CATCATTTGC 


ACCTGAAATG 


900 
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AGCGACCCAT CACATGATTC AAGGTGCTGA AAATGTGGCC AAGATGTATG ATGTTTCAAG_ 960 

AGAATTACAA GATGAATTTG CTTATCGAAG TCATCAACTG ACAGCGGAAA ATGTAAAGAA 1020 

TGGAAATATT TCTCAGGAAA TATTACCTAT AACCGTTAAA GGAGAAATAT TCAATACTGA 1080 

TGAAAGTCTA AAATCACATA TTCCGAAAGA TAACTTTGGC CGATTTAAGC CCGTAATCAA 1140 

AGGTGGGACC GTTACCGCTG CGAATAGTTG TATGAAAAAT GATGGTGCAG TTTTATTGCT 1200 

TATTATGGAA AAAGATATGG CATACGAATT AGGTTTCGAG CATGGTTTAT TATTTAAAGA 1260 

ATGGTGTTAC GGTAGGTGTT GATTCTAATT T 1291 

(2) INFORMATION FOR SEQ ID NO: 1120: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:1120: 



GCGTGCATTA 


GATAGTATTT 


TACCATTTTT 


AGGAAAAGGG 


AATACTATTA 


TTGTAGAGTC 


60 


GACAATTGCG 


CCTAAAACGA 


TGGATGATTT 


TGCAAAACCA 


GTCATTGAAA 


ATTTAGGATT 


120 


TACAATAGGT 


GAAGATATTT 


GTTTAGTGCA 


TTGTCCAGAA 


CGTGTACTGC 


CAGGAAAAAT 


180 


TTTAGAAGAA 


TTAGTTCATA 


ACAATCGTAT 


CATTGGCGGT 


GTGACTAAAG 


CTTGTATTGA 


240 


AGCGGGTAAA 


TATGTCTATC 


GCACATTCGT 


TCAGGGAGAA 


ATGATTGAAA 


CAGATGCACG 


300 


TACTGCTGAA 


ATGAGTAAGC 


TAATGGAAAA 


CACATATAGA 


GACGTGAATA 


TTGCTTTAGC 


360 


TAATGAATTA 


ACAAAAANTT 


GCAATAACTT 


AAATATTAAT 


GTATTAAGAT 


GTGATTGAAA 


420 


TGNCAAACAN 


ACAATCCGCT 


CGTGCCAATA 


TCCATCAACC 


TGTCCAGGTG 


TAGGCGGTCA 


480 


TTGTTTAGCT 


GTTGATCCGT 


ACTTTATTAT 


TGCTAAAAGA 


CCCTGAAAAT 


GCAAAGTTAA 


540 


TTCAAACTGG 


ACGTGAAATT 


AATAATTCAA 


TGCCGGCCTA 


TGTTGTTGAT 


ACAACGAAGC 


600 


AAATCATCAA 


AGCGTTGAGC 


GGGAATAAAG 


TCACAGTATT 


TGGTTTAACT 


TATAAAGGTG 


660 


ATGTTGATGA 


TATAAGAGAA 


TCGCCAGCAT 


TTGATATTTA 


TGAGCTATTA AATCNAGNAC 


720 


CAGACATAGN 


AGTATGTGCT 


TATGATCCAC 


ATGTTGNATT 


AGATTTTGTG 


GAACATGATA 


780 


TGTCACATGC 


TGTCAAAGAC 


GCATCGCTAG 


TATTGATTTT 


AAGTGACCAC 


TCAGNANTTA 


840 


AAAATTTATC 


GGACAGTCAT 


TTTGATAAAA 


TGAAGCATAA 


AGTGATTTTT 


GATACAANAA 


900 


ATGTTGTGAA 


ATCATCATTT 


GNAGATGTAT 


CGTATTATAA 


TTATGGCAAT 


ATANTTAATT 


960 


TTATCGACAA 


ATAAAATGTG 


TCNAACTAGG 


GCATACATGA 


TTAAGGNAAG 


ATAAGCTCGT 


1020 


GCCATGTGTT 


TGNACNTCAG 


AGAGGATAAT 


GTTATGAAAA 


AAATTATGGT 


TATTTTCGGT 


1080 


ACGAGACCCG 


NAGCAAAAAA 


ATGGCACCAG 








1110 



(2) INFORMATION FOR SEQ ID NO: 1121: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121: 



GGGAGGAACA 


GATATTCAAT 


TGTAATGATA 


TAAATATCTC 


GGCACAATGA 


TGCTGGGATA 


60 


TTTTTTAACA 


AATAAGGCAA 


TAAAAGGTAT 


GGGAAATGCG 


TATCTAGCGC 


TAATGATAAC 


120 


TTAACATGCA 


AAGAAGATAC 


TATTTAAATA 


TTTCAATTTC 


AGAAATTTTA 


AAAATAAATT 


180 


CACTTTAAAA 


ATATTGCATT 


TATAGAGAAT 


AGAAATGTTA 


TATTAAGATA 


AATACATATT 


240 


TTTTTAGGAG 


GCAAAGTATG 


GGACTTGTAA 


TTGAACATGT 


TACGAAACGT 


TTTGGCAAGA 


300 


TGACGGCAGT 


AAATGATATT 


TCATTAGAGT 


TAGAATCAGG 


CAAAATGTTA 


GGTTTTCTAG 


360 


GAAGAAATGG 


TGCAGGTAAA 


ACAACGACAT 


TTCGTATGAT 


TTTAGGCTTA 


AGTGAACCAA 


420 


CTGAAGGACA 


CATTACGTAT 


AACGGTAAAA 


AACTAGATAA 


AACAATGTAC 


AATCGTATTG 


480 


GTTATTTACC 


GGAAGAACGC 


GGTTTACATG 


CCGGAAATTG 


ACAGTTGAAG 


AAGAATTGAA 


540 


ATATTTGGCA 


ACTTTAAAGG 


GAATGTCAAA 


AACAGAAATA 


CAACAGCAAA 


TATCGTATTG 


600 


GCTTGAGCGT 


TTTGATATTA 


CTGAAAACCG 


CAAAAANCGA 


ATTGATAGTC 


TATCAAAAGG 


660 


GAATCAACAA 


NAAATTCAGC 


TGTTAGCAAG 


TATGTTACAT 


AAACCAGAAT 


TACTAATATT 


720 


AGATGAGCCT 


TTCAGTGGTC 


TAGACCCAGT 


TAATGTTGAG 


TTATTAAAAG 


AAGCAGTTAA 


780 


AGACTTAAAT 


GATTGGGGCA 


GTACAATCGT 


TTATAGTTCT 


CACCGTATGG 


AGCATGTTGA 


840 


AGAATTATGT 


GATGATGTTT 


GTAG 








864 



(2) INFORMATION FOR SEQ ID NO:1122: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1122: 

AAACCCNCCT TGTCTCATTA AGTAAGAAAT CGTTAAAAGA TACATCCTTC CNAATTNNAA 60 

AACCCCGNGC CTTTTGTATT GTTTGTCACA CGGATCTACG TTTANCAATG TTGTCATATC 120 

CATGCTCCTC TTTGTNTAAT TTTAATAAAA ACGTCTTCCC TCCAATAAAA TAGCTANAGG 180 

GCGTAAATAT ATGNNAAATA CCNATTGATA AGTACCCNAA CACTAAAGCT TTTGANTAAT 240 

TCCTGTACCA ATTTGAATAA AGACCCTATA AAAATAATGG CTAGTATCAA TATTGATAAG 300 

TTGNTCGCAT ATGTTTTAAA TAATGCAATT AATTCTAATA TGGCGTTCAC CTTCATCCAA 360 
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T A TTTPP A AT 


Pf^IPTTTTPAf^ 


TY2TAATTPTT 


ATPTGP AATT 


TTTGGCATTP. 


TCyrCATCAAA 


420 


XXX 1\. 1 rVl X r\ 


A AP AfTTY^T A A 
AnLAU X\J X t\e\ 


ATTf2f2f2TATT 


GANNATAGCA 


CTANCAAAAA 


ATGGTAGTAA 


480 


AAAGAATAHP 


ATOfSPATTTG 

MXUU^AX X XV7 


OTTPAnfTTfi 

^» X X \»AVA» 1 1U 


CCCTACCACA 


AAAATTAGTA 


ACACTACAAA 


540 


fZf^P A AT A ATT 


fiTPKHTATAA 


PflPTTA ATAT 


APTTPPATTT 


AAAAPATAAP 


GATTPGPAAG 


600 


PA ATTVATAT 


X\3/\X\,X\jX X Vj 


PATPCJATATfS 
LAiVunlnlVj 


TAPPTPTTPA 


TPTAPAARTP 


OA— 1 1A1 Al^ 1 1 


DOU 




iNll IVxAl XV.A 


TPAAAAPAAT 
xa« aaaav* aa x 


f2f2PTTPTAPP 


AATATP-ATfJA 


tp at a riTpnp 


/ ZU 


A A P AATPfiAT 


ATTAP A APfS A 


PATTYTTTATA 
v»-A l xwx xaxa 


m\\j x v»o\^AAA\* 


GTCAAtTTTCT 
xj x unnu x x ^> x 


PAATAPPPAA 




TPPATPAAAC5 


TTAPMTA A A A 
X lALHlnnnn 


TTPPAPPAAT 


GATAPPCCCT 


ACAATTPPAP 


PP AflTA A A Ad 




ATATVPRATA 


TATMTTAPTA 

X £\ X XV X X n!L3 X A 


TTTTPATOPT 

XXX X V~*V J. U\j X. 


TPAPPPTPPT 


PAATAATAAA 


A APPP/PTTPP 
AAV«i,o X X lew 


^uu 


& PPfSTTTP AT 


Tf2A A 11 ATY^Pf2 
X UAAAA IuCVj 


a/y A ATPTTT 

AVtV^AA X X X X 


ATTTJPPfyrTA 

AX XOV»V*V7X 1A 


ATAPTY2ATf2P- 


P ATA A A ATTP 
LA1AAAA1 1\j 


joU 


rnqwiyvyPP A A 
1 1 X\-\jt_ iLAA 


1 IAIj^AjAAAI 


UV7 X X lO IV- 1 1 


f2A AAPAPPAP 


I— lAul 1 X\jV,V~ 






TGATTTAAGC 


CATCTCGTGC 


TCGTAACTCT 


TTCAATCGAT 


TACGCACATC 


GCATCAACTC 


1080 


CTTAGTCCTA 


ATGTAAATGA 


TATTTGTCTT 


TGTGACAACT 


ATATTCGTCA 


AATCTACACA 


1140 


AAAAAATATG 


ATTGTCCACC 


TATGTATGAC 


ATTGTNGAAA 


CAACACCTCA 


ACGCCTACAA 


1200 


GTCATAATTG 


TTTACTTTCG 


GNTACACCTT 


CCTGCATAAT 


TAACAGCATT 


CTAATTTTAG 


1260 


TATGATGCAC 


GCATTTTCAC 


TAAATCAAAC 








1290 



(2) INFORMATION FOR SEQ ID NO: 1123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1123: 



GCCCATCCAT 


ATCTAATGTT 


ATGTTAATAG 


TLTlTriTAC 


TTTnTTTCCA 


TTTTTAATTT 


60 


CTGTATCTTT 


TACCGACTNA 


ATTGTGATTA 


GNGGTACAGN 


ATCGTATGTG 


TTGTCTTTTC 


120 


TATTTAACTT 


CCTTTTATAA 


CCNACATGGC 


TGTCTATTAA 


AGCATCTAAC 


CTGGGCACAC 


180 


CANCACTAAT 


GTTAACGCGT 


TTTCCTGGCA 


TATCTTTGAT 


GAATGGATCT 


TGTAACCATG 


240 


TTAATAAATC 


GTAGGTACTA 


TTAAAACTAA 


TCATATTATC 


AAAGCGTGGT 


CTTAGCAAAT 


300 


TTCTGNCAAG 


CAGCATAAGG 


TACCATTGCT 


GGGTCAAGTA 


GCAACAACTT 


TTTCATTTGG 


360 


ATGTTTCGCT 


CCTTGATATT 


TTGCTCCTGC 


ACCGTCTTCC 


GAATTACCGC 


CATCCGCCAC 


420 


AATGGTTTTG 


TTTTTGTAAT 


TATTTGGACT 


AACACCATAT 


TTTTGTGTAA 


AGTTATACTT 


480 


ACTTAATTTA 


TTAGCGTCAT 


TTAGTTTGTC 


TCTATATAAA 


TCTGCAAATT 


CGTCTGATTG 


540 


CTTAAGATAA 


TCCGTTGACT 


TATTACTATT 


ATCCATTAAT 


TTCGCATTTT 


GTTAACCAAT 


600 


CATCTCCCGA 


TATCTAAAGA 


TTTTAATGGA 


TTATTAGGGT 


TTATTGCCTC 


CATTAGATGT 


660 


TCCTTGATAA 


ATCATGGTTT 


GTTCACCAGT 


TGGTTTTCCT 


TTTTCATCCA 


ACAATTCATA 


720 


AATTTTTAAA 


TCTGAGGCAC 


cm-riTATT 


TTTATTTCCA 


TTATCATTAT 


ATTCATCAAC 


780 


CTGCTTAAAT 


CTTTTTCCGT 


TAACTGTAA 








809 
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(2) INFORMATION FOR SEQ ID NO: 1124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124: 



GGCACGAGCG 


ATCGAATTTT 


AATAAAATAT 


CATTGTTTAA 


CGGTCTTGGC 


GGACGCGTTT 


60 


GATGAATATC 


AGGAATCGCA 


TCTATTAAGC 


GTTTTGTATA 


GGTATGTTGT GGCGATTTAA 


120 


AAATACTTCA 


ACCGTGCCAC 


TTTCAACGAC 


ACTTCCATCT 


TTCATTACAA 


TCACATCGTC 


180 


GCAAAATTGA 


TACACAGCGC 


CTAAATCGTG 


AGTGATAAAA 


ATAATAGATG 


TTTCTGTGTA 


240 


CTCATAAAGG 


GACTTCATTA 


ACTGCAGTAA 


TTGATTTTGT 


GTACTGGCAT CTAATGCCGT 


300 


TGTTGGTTCA 


TCTGCGATTA 


AAATTTGTGG 


CTTTAAAATC 


AATGCCATTG 


CTATCATGAC 


360 


ACGTTGACGC 


ATACCACCAG 


AAAGTTCATG 


TGGATAAGCA 


TCAAATTGTC 


GAGTTGCATG 


420 


TTTTTATACC 


TACTTTTTCT 


AAAATGTTTT 


ATTGTCATCG 


ACTTTGCTTC 


AGATTTAGAT 


480 


ACACGTTTTA 


TGTTTGAAAT 


AGTACTTCCT 


GTAATTTGTT 


TGCCAATCGT 


TAATCCTTGG 


540 


ATTCANCGAA 


GAGAGTGGAT 


CTTGAAAAG 








569 



(2) INFORMATION FOR SEQ ID NO: 1125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1125: 



CTAGTTCTAC 


AGTGCCTAAG 


TTATATCACC 


TCCATCAAAA 


GTACGCGCTT 


ACATCATTGA 


60 


TACACAAGAC 


ATCATGACAT 


AGTTGTATAC 


TGTCTAATCT 


TCTACTATAC 


TAAGATAAAA 


120 


GGAGTGATCT 


TTTATGGAAA 


AAGTTTATGT 


TGCTGGTGCA 


ATACCAGAAG 


TAGGGTTAAA 


180 


ACTTTTACCA 


AAGAACAATT 


TTGNAGTTGN 


AATTGTATGA 


AGGTAAAGGA 


TTAAGTCGAT 


240 


AAAAGACCAC 


NTTANTTAAA 


GGTTGTAAAA 


AACCGCGACT 


GCCTTAATTA 


GTTTATTAAT 


300 


CTACAAACGT 


TGATAAAGAT 


GTTATCGATN 


CTGGTAAAAG 


ACNTAAAAAT 


CATTGCCAAC 


360 


TATGGCGCTG 


GNTTTAATAA 


TATTGATATC 


GAGTATNNCC 


GAGANAAAAG 


TATAGATGTT 


420 



956 



WO 97/30070 



PCT/US97/02318 



AGAAACAGAN 


\-Tl AAACjCA IV 




WL. 1 VjV^ IvxAl 1 


nth Ar*A ATTY^T* 


Al lAuiALl 1 


Ann 








r* a a tta tc a r* 


fTM A C A A rTY2f2 
uNAV* AAV- 1 VjVj 


Al 1 IuAUjuA 




7* ^**/ *f|V ft 

lufouCAlA* 1 1 


1A1 111 iuAu 


AfSfTAnAfiA & 

nub 1 Aununn 


uln 1\. 1 Own 


a a ara atyw* 


PBTTaTYVyT 
Uil lAlVbul 




I IALAjIvjAAA 


1 1 \iO 1 Av» 1 


Avr 1 ALA. 1\A3 1 




LAI 1 IVjALAI 


vnjAlVj 1\tL. 1 A 


reft 

oou 


TACACTGGAC 


CTAATCGCAA 


7\ /""» A Af* A A A XT A 

AvjAAVjAAANA 


{jAALLxALxAAA 




A T* A TVVTVp A /"* A 

ATA rtJTT AGA 


Ton 


TTTAGATACA 


CTATTAAAGA 


ATGCAGATTT 


TATCACTATC 


AACGCTGCTT 


ATAATCCCTA 


780 


AAATGCATCC 


ATTTAATTGA 


TACAGAACAA 


TTTAAAATGA 


TGAAATCTAC 


GGCGTTATTT 


840 


AATCAATGCC 


TCCTCCGTTG 


GTCCCAATCC 


GTGCACGAAA 


CAAGCACTAG 


TTGCAAGCCA 


900 


TTGAAAGATA 


ATGAAATTGA 


AGTGCTGCAC 


TTGATGTTAT 


ATGAATTTTG 


AACCCAGATA 


960 


TTACCGATGA 


ACTTTAAAAT 


CACTTAATAA 


TGT 






993 



(2) INFORMATION FOR SEQ ID NO: 1126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1126: 



TTCCTGAAAA 


TACAATTAAT 


GTTTATACGA 


ATGTCATGCT 


GTTCAAAGTC 


TTGCAGCATA 


60 


TGCACCGAAA 


GGCTTAATGA 


TTCTCATACT 


GGAGATGCGA 


ATGACTATGT 


TGGTAAAGGA 


120 


TTATCTGGTG 


GTACGTCATT 


GTCAAAGCAC 


CTTTTGAAGA 


ACGACAAAAT 


GAAATTATTG 


180 


CTGGTAACGT 


CTCATTCTAT 


GGTGCGACAG 


GTGGTAAGGC 


ATTTATTAAC 


GGTAGTGCAG 


240 


GAGAAAGATT 


CTGTATTAGA 


AATAGTGGTG 


TAGATGTTGT 


CGTTGAAGGT 


ATCGGTGACC 


300 


ATGGATTAGA 


GTACATGACT 


GGTGGACATG 


TCATTAATTT 


AAGGTGATGT 


AGGTAAAGAA 


360 


CTTCGGTCAA 


GGGTATGAGT 


GGTGGTATTG 


CTNACGTTAT 


CCCGTCTGAT 


GTAGAAGCTT 


420 


TTGTTGAAAA 


TAACCAACTA 


GATACGCTTT 


CGTTTACAAA 


GATTAAACAC 


CAAGAAGAAA 


480 


AAGCATTCAT 


TAAGCAAATG 


CTGGAAGAAC 


ATGTGTCACA 


CACGAATAGT 


ACGAGAGCGA 


540 


TTCATGTGTT 


AAAACATTTT 


GATCGCATTG 


AAGATGTCGT 


CGTTAAAGTT 


ATTCCTAAAG 


600 


ATTATCAATT 


AATGATNCAA 


AAAATTCATT 


TGCACAAATC 


ATTACATGAC 


AATGAAGATG 


660 


AAGCGATGTT 


AGCTGCATTT 


TACGATGACA 


GTAAAACAAT 


CGACGCTAAA 


CATAAACCAG 


720 


CCGTTGTGTA 


TTAAGGAAAG 


GGGGAGATAC 


GATGGGTGAA 


TTTAAAGGAT 


TTATGAAGTA 


780 


TGACAAACAG 


TACTTAGGTG 


AATTATCANT 


GGTAGACCGT 


TTGAAGCATC 


ATAAAGCATA 


840 


TCAACAACGA 


TTTACTAAAG 


AAGATGCCTC 


TATCCAAGGT 


GCGCTCGTGC 


C 


891 



(2) INFORMATION FOR SEQ ID NO: 1127: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 691 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127: 



GTCCATCTGT 


TTTAATGCAT 


TGATATAGTT 


ATCAAGTTCC 


ACATATCTTT 


TGGGGATTTA 


60 


TAATCTGGTA 


ACATTTTATC 


CTTCGCTGTA 


TTGTGCTCGT 


CAAATAATGA 


ATTTTGTTTT 


120 


TCAATTAAAG 


TCGAAACATN 


ATAATCTGTG 


TTCACCCTGA 


ACGTATNTGA 


ATTCGCACTC 


180 


AATAATGATT 


TATTGTATGT 


TTGGAACCAT 


TTTGTAATGT 


CTTTGTTTGC 


AGAAATTGAA 


240 


TTTACAAGCG 


TATCTGTAAA 


TAATTCCGGG 


AAGTCGTTAA 


TTGGATTTAA 


TAAGTAATTC 


300 


GAGAATTTAC 


TATTCACACC 


ATGTTCACGC 


GTCATAATAG 


CGCCAACATT 


TTTTTGTGCA 


360 


TTATGTAAAT 


TATCAATGAT 


GCTTGTTAAA 


TAAATTTCGA 


CTAAGTTTTT 


GTTAAAGTCG 


420 


TTAAGTACAT 


TACTTACAAC 


TTTTTCTGTG 


TTTTTAGCTA 


CTTCTTCTTT 


TTGTCCTACA 


480 


GCTGTTTTAT 


ACTGTAGCGA 


TATTTTCGAT 


GGTGTTTTAG 


CGTCTAATTG 


CATTGCCAAT 


540 


TTTGGAAAAG 


TTTTCTTGGG 


ATAACAATCA 


TGAATNGGGC 


ATCCACCATT 


TNTCAANCCA 


600 


GACTCAGCAA 


CGTTTCTTGG 


TTACTGTTTC 


AAATTTATAG 


GNCTNCTCAA 


CTGGTTAACC 


660 


CTNNTAATAA 


ATTGCTTGGA 


CCCAGCNCAA 


C 






691 



(2) INFORMATION FOR SEQ ID NO: 1128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1128: 



TCCAGCAATA 


TATTGCAATT 


ACCCTAAAAA 


CACGTTACTT 


TAATATTTAG 


TTAAACCAAA 


60 


TAAGCTGATG 


AATAGAATGG 


AAGATAATAC 


CTGAAACGGA 


AATAATCGTT 


TCTAATAATG 


120 


ACCATGTTAA 


GGAATGTTTC 


TTTTACAAGT 


TAAACCCCAA 


AATATTCTTT 


AAACAATCCA 


180 


AAATCCTGCG 


TCAATTTACA 


TGAGACAAAA 


TCACACTACC 


TGCACCTATC 


GCAAGTACAA 


240 


CTAATGCAAC 


ATTTACATCT 


GATGATTGTA ATAATGGTAA 


GACAATACCT 


GTAGTTGAAA 


300 


TCGCAGCTAC 


TGTAGCCGAA 


CCTAATGCGA 


TACGTAGCAC 


GAGCTGCAAC 


AANCCATGCT 


360 


AGTAAAATTG 


GAGACATCTC 


TGTACCTTCA 


AACATTTTAG 


CAATTGTATT 


TCCGACACCA 


420 


CCGTCAANTA 


ATACTTGTTT 


AAATGTACCG 


CCACCGCCNA 


TAATCAATAA 


CATCATTCCG 


480 


ATTGGATAAA 


TCGCATTCGT 


CACTGATTCC 


ATAATATGAT 


TCATCTTACG 


TTTTCTCATT 


540 


NATCCCATCG 


TAACGATTGC 


AAATAATACT 


GCTATTAGCA 


TGGCTGTCCC 


TGCTGTTCCT 


600 
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ATCATATAAA 


xvaAxAutAx AV. 


A A ATAfSATTT 

AM/1 X/vUXlX X X 


OTAI^^TTTV^'F 

w X AUU XXX U X 




TACAAGTTGC 


660 






T A AT A TCZ APT 


GGTAATGTTG 


CAGTTAATAA 


ACTCATACCA 


720 


AA, lT-t. l\iv»\^A 




Ptf^'FA A A'!"! *PT 


X X X X VJ X UWiV* 


PTAACGPTGA 


AATATCGCCT 


780 


TCTCGTGTAT 


ACGCAGACGG 


AATCATTTTT 


TGTGCAACTT 


TGTTAAATAT 


AGGTCCTGCA 




ATGAGTGTAA 


CTGGAATGGC 


AATAATCATA 


CCGTACAGTA 


ATACATCTCC 


AACATTTGCC 


900 


TTTAATTCTT 


TTGCGATGAC 


TACCGGTCCT 


GGATGTGGTG 


GTAAAAAGCC 


ATGTGTCACA 


960 


GATAAAGCTG 


ttaccatagg 


TAGTCCTAGT 


TTTAACACTG 


AAACATTTGC 


GCGGCACGAG 


1020 


CTCGTGCC 












1028 



(2) INFORMATION FOR SEQ ID NO:1129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1129: 



AGTATTTTTT 


TGACCGAGAC 


ATGCAAGCCC 


TCCTGGTTGA 


TTATCATTTA 


CATATCGTAT 


60 


CATAACGAAT 


CATAAACAAT 


AGTTTTATTA 


ACTTGTAGAC 


ACAATGTTTG 


CTAAAGTCAT 


120 


TTTTAAAAAA 


TATAGCCAAA 


AAATTAGCTA 


TATATTATAA 


AAGCGTGATA 


TAAATGTTTT 


180 


ATATAACAAA 


GAAATAAAAA 


TCATTTTTTA 


CAAATGGTTG 


TAAGAAAAAG 


ACATGCAGAT 


240 


GTTGTTAAAA 


TTTTAATAAG 


AATCAAGGAG 


GCTATATTAT 


ATGGCTAAAC 


TAAATGTAGA 


300 


AGTATTTGCG 


GACGGTGCAG 


ATATTGAAGA 


AATGAAAGCA 


GCTTATAAAA 


ACAAACAAGT 


360 


GGATGGTTTT 


ACAACAAATC 


CTAGTTTAAT 


GCCGAAAGCG 


GCCGTAAGCA 


GATTACAAAG 


420 


CTTTTGCTGA 


AGAAGCTCGT 


GAAAGAAATT 


CCAGATGCTT 


CAATTTCATT 


TGAAGTATTT 


480 


GCAGACGATT 


TAGAAACTAT 


GGAAAAAGAA 


GCAGCAATTT 


TAAAACAATA 


TGGCGAAAAT 


540 


GTATTTGTTA 


AAATTCCTAT 


TGTAAATACA 


AAAGGTGAAT 


CAACGATTCC 


TTTAATTAAA 


600 


AAACTTTCAG 


CTGACAATGT 


GAGATTAAAC 


GTTNCGGCTG 


TTTACACAAT 


TGAACAAGGT 


660 


AAAGAAATAA 


CTGAAGCAGT 


AACTGAAGGT 


GTGCCCAACA 


TATGTTTCAG 


TATTTGCAGG 


720 


ACGTATTGCA 


GATACAGGCG 


TAGATCCATT 


ACCATTAATG 


AAAGAGGCTG 


TAAAAGTTAC 


780 


GCATAGTAAA 


GACGGCGTTA 


AATTATTATG 


GGCAAGTTGC 


CGCTCGTGCC 




830 



(2) INFORMATION FOR SEQ ID NO: 1130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 1130: 




CCGGANGNTG 


GAAGTTATCC 


AAGCTCCAAG 


TTCGGGTTGC NGGAATTNGG TAAATTCAAG 


60 


ACGTGGTAGT 


TGTCGAGAGN 


ACTATCACAT 


TTAACCGTTG ANNAAAGTTT GTTTAAGCAT 


120 


TGATTCCATC 


TTGANACCNA 


AATTACACTA 


TATTGTTGAA TTTCAGGAAT AAAGAANAAA 


180 


CTATTATGGA 


TCGTGCGAAA 


CCAATGGACG 


AGNCNATANN AAATGGTGAA GATTATGCGA 


240 


GTCTGATTGA 


NNAAGCTAAA 


GNNAAAGGTC 


TATCAGATAT TCCAAATACC AAAATCTTCA 


300 


AGTATAGATG 


AATTAAAGCA 


ACTTGCTAAT 


AGCCATATAT CTGATTTGGA AAATAAAGCG 


360 


CAAGCATATT 


CAAGAACATT 


CGATGANCAA 


AAGCGATATA TGGATGAGAA ACATGAAGCC 


420 


TTCAAGCAGT 


CAGTGAATAG 


TGGTGGTTTA 


GTCACAAGTG GTTCTACTTC AAATTGGCAA 


480 


AAAGCTAAGA 


TTACTAAAGA 


TGATGGTAAG 


ATAATGCAGA TTACTGGATT TGATTTTAAT 


540 


AATCCAGAAC 


AAAGAATAGG 


TGATTCAACC 


CAATTTATTT ATGTTTCGCA AGCTATAAAT 


600 


TATCCAAGAG 


GTGTTAGTAC 


GTGCC 




625 



(2) INFORMATION FOR SEQ ID NO: 1131: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131: 



GGCACGAGCG 


ATTAATAGAA 


ACGCTAGGAA 


ACCAACGATT 


CTCCATTTAA 


ATACTTTTAA 


60 


AAAATTAATC 


ACTTGTTGTG 


TAGAGTCTTG 


TCCGTTTTGG 


TTATGATTGT 


TAGCCATGAT 


120 


ATACCTCCCT 


TACAACACTC 


GTGGACCAGA 


AGTTTTCTGA 


TCTCTCACAT 


TAACTTCTAA 


180 


CTTACGTACT 


GGCATTTCTG 


TGAAATATTC 


TACATTCTTT 


TTAATATCCG 


AACGAATTGC 


240 


TTCAGTTAAA 


GATTGAACTT 


GAACATTATT 


TGGTACGAAA 


AAGTCAGTTT 


TAATGTCGAT 


300 


ATAAGATTTA 


TTTTTTTTGT 


TATATAGGTG 


CGCAACTACA 


TGTGGGTGTC 


TTACTTGCAT 


360 


CATATTTTGC 


AACCGTATCG 


AATGCCGTCT 


TTACAACAGC 


TTTACGAGAT 


ACGTAAACAT 


420 


GCACCATCAT 


CGAAGTCTTT 


GTATAATCCA 


GGTTTTCGAT 


GCGTAGGTTT 


GAAGATACTA 


480 


AATACTAATA 


TAAGACCTAT 


TAATATCAAT 


AGAGCAGCAA 


GTGAAATAAG 


TAATGGGTTG 


540 


GAACCACACA 


AATTGAAGGA 


AGTAGCTTTG 


AT 






572 



(2) INFORMATION FOR SEQ ID NO: 1132: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132: 



GGCACGAGCG 


GAATTATGAA 


AATAAGGAAG 


GTCAACGTGT 


ATATGTTACG 


GAAGTTATTG 


60 


CTGATAGTAT 


TCAATTTTTA 


GAACCGAAAA 


ACTCAAATGA 


CACTCAACAA 


GATTTATACC 


120 


AACAACAAGT 


ACAACAAACA 


CGTGGACAAT 


CGCAATATTC 


AAATAACAAA 


CCAGTAAAAG 


180 


ATAATCCGTG 


TGCGAATGCA 


AATGGTCCGA 


TTGAAATAGA 


TGACAATGAT 


TTACCANTCT 


240 


AATTTANCCG 


GTTTGAAAGT 


GAGGTGTGTA 


TATGACTGGT 


TGGATAAGTA 


TTGATCGCTC 


300 


AATTCAAAAT 


CATTGGCTAT 


TTAAAGAAAA 


GAGAACATTT 


TCAAAGTTTG 


AAGCATGGAT 


360 


ATATTTACTC 


ATGGAAGCGA 


ATCATTCAAA 


GGCAAAAGTG 


CCTATTGGAA 


ACCAAATTGT 


420 


AACCGTAGAA 


AGAGGACAAA 


GATTAACATC 


TATTTTGNCC 


TTGTCTTGAC 


CTTTTTAACT 


480 


GGTCACGATT 


TAAAGTGNAA 


ACCTTCCTTG 


ACTTACTCGA 


GAGTGATGGA 


ATGTTAGAAG 


540 


TCAAAACAAC 


ATCANAATAT 


ACCCTTATAA 


CCATTGTCAA 


TTATGACTTT 


TATCAAAGTG 


600 


AGCAGGGCAG 


GAACCAACAT 


CAAAACGACA 


TCAAACCAAC 


ATCAAAACAA 


CATCAGTCAA 


660 


ACATCAACCC 


AACATCAAAA 


CAACATCAAA 


CCG 






693 



(2) INFORMATION FOR SEQ ID NO: 1133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1133: 



ACTTGATCAA 


GTTGGATTAT 


TGAATTTGAA 


ACNCCGTTGT 


ATCACCTTCT 


GGGCACGAGC 


60 


GTTGATGATT 


TAAGAAGATG 


CTGGTTCAAA 


TTGTTTATGC 


TGAAGACAGT 


ACCGTAAAGT 


120 


AAGAAGAAAG 


AAATCCAATT 


TACAACCGAT 


TGAAATTTTA 


CCATCAGATA 


TGGGCGGGGA 


180 


CTATGATGCG 


TTAGAAACAT 


TAATGGCGTC 


AAGACCTGAC 


ATTTTAAACC 


ATAATATTGA 


240 


AACTGTTCGT 


CGCTTAACAC 


CGAGAGTTCG 


TGCGCGTGCG 


ACTTACGACA 


GAACATTAGA 


300 


GTTTTTACGT 


CGTTCANAAG 


AATTACAACC 


AGATATCCCA 


ACTAAATCAA 


GTATTATGGT 


360 


TGGATTAGGT 


GAAACTATAG 


AAGAAATTTA 


TGAANCGATG 


GATGATTTAC 


GTGCGAATGA 


420 


TGTAGATATT 


TTAACGATTG 


GTCAATATTT 


ACAACCTTCA 


CGTAAACATT 


TAAAGGTTCN 


480 


AAAATATTAC 


ACGCCTTTAG 


AGTTTGGTAA 


ATTAAGAAAA 


GTGGCAATGG 


ATAAAGGGTT 


540 
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TAAACAGTGC CAACTGGACC TTTAGTACTT AGTTCTTATC ATGCGATGAG CAAGTAAATG 600 
AGCTGCTAAA GAAAAGCACG CC 622 



(2) INFORMATION FOR SEQ ID NO: 1134: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134: 



GGCACGAGAC 


ATCAAGTTGG 


TTTAAGTCTA 


TTAAAGTTGA 


ATCAACATGT 


GGCATTTTGT 


60 


GATGAAGATT 


CACTAACGCA 


TTCTCTCATT 


TTGGTATTGG 


AGTGCCATTC 


GACAATTCAT 


120 


TCACACTTAT 


CTGTATTACA 


AAATTTAGCA 


ACCGTACTGG 


GCGATAACCA 


AACAGTCCAG 


180 


CAACTATTAA 


CTGCAAAAAA 


CGCACAAGAC 


ATTAAAAACA 


TTTTAAAGGA 


GCATGATTAA 


240 


TATGAAAATT 


TTAGTAGTAT 


GTGGCCACGG 


TTTAGGAAGT 


AGTTTTATGG 


TAGAAATGAA 


300 


CGCACAAGAA 


GCACTTAGGC 


AACTTAATGC 


ACCATCTGAT 


ATCGAAGTTG 


AACATAGTGA 


360 


CATTATGACA 


GCAAGTCCAG 


AGATGGCTGA 


CTTGTTTATT 


TGTGGTAGAG 


ATTTAGCTCG 


420 


AAAATGCCGA 


ACGCCTAAGG 


GATGTCTTAG 


TTNNTGATAA 


CATTTTAGAT 


AAAGCTGAAT 


480 


AACAACAAAA 


GCTCTCAGAA 


ANATNACAAC 


AACTTAACAT 


GATTTAAAGA 


NGGTCTACCG 


540 


TGCC 












544 



(2) INFORMATION FOR SEQ ID NO: 1135: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135: 



CAATTTAAAA TGGNATAAAG AGTCNNTCCC AAGGATTAAA ATAGGAATCC ATCCTTGGTT 60 

GACTTTCTNT TGTTTGTATT GTTTTNACAC GATGCTGCGG NATACGTTTC ATTACCACTC 120 

GTTCAAGCAA TTGCTAATGT AACCTTTTAT TTTTCGTTCT AATCGGCAAT TACTATTTCA 180 

ATGGCAGTGG TTGATTTATA TCAATACCTA AAACTATAGG CATTACATAA TACACGACTA 240 

AAATAATAAT TATTGCACTG ATTAAGTTTA CCCAGAATCC TACTGATGCC ATTTGTTTGA 300 
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TAGATATTTT ACCTGAACCA AAGATAATTG CATTCGGTGG TGTCCCTACT GGTAACATGT_ 360 

ATGCACAGTT AGCCGCCATA GCTGCAGGTG CCATAAGTAG TAATGGATGC ACTCCAACAG 420 

CAACAGACAA CGTTGCTAAA ATCGGTAAAA TCATCGTTGC AGTTGCAGTA TTAGATGTCA 480 

CTTCAGTTAA AAATAAGACA AAGATTGTTA TGACAATTAC AATAAGAATC GGACTAACAC 540 

CATTTAATGA TTTCAACTGT TCGCCTAACC ATTTTGCTAA ACCACTTTCA GAAATACCTT 600 

TCGCTAATGC TAAACCGCCA CCAAATAAAA TTAATACACC CCAAGGGAGC TCGTGCC 657 

(2) INFORMATION FOR SEQ ID NO: 1136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136: 



GGCACGAGCG 


CCAGAACTTT 


ACCTTTTGCC 


TTTTTCAAAT 


CACACCAAGT 


AACATATTCA 


60 


AAGCGTTGTT 


GGTTTATAGT 


TACCTTCAGT 


TGTTAAAAGT 


TGTGTTTGTT 


TAACTTTAGA 


120 


AACTGATTTT 


TTCGGTAAAA 


TCATATCTTC 


AATTTTACGT 


AAAAGCAAAT 


CAAATAAATA 


180 


CGTATCATCA 


TGATGCCATA 


ACGGCAATAT 


CGCTAATGCT 


TGATAGTCGC 


CCGCTAATTC 


240 


TAAATAGATA 


TGTGCATTCA 


TAAACCAATT 


GTGAATATCA 


TGACCTGTAA 


AACCTTCATA 


300 


TTTTATCAGC 


AACTTCAACG 


GATCATCAAC 


TTTGGAAGCA 


TTTCAAAAAC 


CCTTCTTCTC 


360 


CAAACATTCC 


GATTAATTGC 


GCTCTCTTTA 


TCAAAAAACA 


CGGTACTATC 


ATATGTTTTA 


420 


TAGAACTCGG 


CAGCTGACTC 


TAAACTAGCC 


ATAATCAAAT 


ACGAAGGACT 


AGATGTTTGG 


480 


AAGTAGCTTA 


GATATTCTAT 


AATAGTTTCT 


CTATAAGGTG 


CATTTTTATG 


AATATAAAGT 


540 


ACCGAGCCCA 


TCGTTAAAGC 


TGGTAACGTT 








570 



(2) INFORMATION FOR SEQ ID NO: 1137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137: 
TACATGTAAA AGNANATGAA TATATACATC TTTAGTCTAG TATTATACCT ATAACTAGCA 60 
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TGTCTTTAAA 


TGAATATGCT 


ATCATGATAT 


GGTAATTTTG 


ATATTTAAAT 


AAATTAAATT 


120 


GAGTAATGAN 


AGTGGGTACA 


TTATGTTTAT 


CATTGAATTA 


ATAAAAGGTA 


TTATCTTAGG 


180 


GGTCGTCGAA 


GGANTAACAG 


AATTTGCACC 


TGTTTCCTCT 


ACTGGACATA 


TGATTCTAGT 


240 


TGATGATATG 


TGGTTAAAAT 


CATCTGAATT 


TTTAGGTTCT 


CAATCAGCAT 


TTACATTTAA 


300 


AATCGTCATC 


CAATTAGGAT 


CCGTCTTTGC 


AGCAGCATGG 


GCATTCCGAG 


CGAAACGCTT 


360 


CTTAGAGATT 


TTACATATTG 


GTAAACACAA 


ACATGTTGAA 


GGAGAAAACG 


ATCAACAAAG 


420 


ACGTTCAAAG 


CCAAGACGTT 


TAAATTTATT 


ACATGTATTA 


GTGGGTATGG 


TGCCAGCAGG 


480 


TATTTTAGGC 


TTACTATTTG 


ATGATTTCAT 


CGAAGAACAT 


TTATTTAGTG 


TGCCAACTGT 


540 


TATGATTGGC 


TTATTCGTAG 


GTGCTATCTA 


TATGATTATT 


GCTGATAAGT 


ATTCAGTTAA 


600 


AGTTAAAAAC 


CCACAAACAG 


TGGATCAAAT 


CAATTATTTC 


CAAGCGTTTG 


TAATTGGTAT 


660 


CTCTCAAGCA 


GTAGCTATGT 


GGCCTGGTTT 


CAGCCGTTCA 


GGCTCAACAA 


TTTCAACTGG 


720 


GGTTATTAAT 


GAAATTAAAT 


CATAAAGCAG 


CATCGGACTT 


CACATTTATT 


ATGGCTGTTC 


780 


CAATTATGTT 


TAGCAGCTAG 


TGGTTTTCTT 


TATTAAAACA 


TTATCAAGAT 


ATTCCAGATT 


840 


GCTGATATAC 


CTTTTTATAT 


TTTAGCTTTT 


TAGCTGCATT 


TACTGTTGGA 


CTAATTGCAT 


900 


CAAACATCCT 


TGCATCTAAT 


AACAAATC 








928 



(2) INFORMATION FOR SEQ ID NO:1138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1138: 



GGCACGAGCT 


CATTAACAAT 


TAGTCTATAT 


AACCCTTGCT 


ATATTTTCAA 


AAACAAAACC 


60 


CAATTACGTT 


TTCATGTCAA 


ATATCATCTT 


GCATGAAATC 


GTAACTGGGT 


CAATTGTATG 


120 


TTATTAGTTA 


TTTTGTGTTA 


CATCCTCATC 


TATCGATTTG 


GCAATTTGTT 


TAATAGCTTT 


180 


ATGTGATTGT 


CTAATTGGAT 


AAATTGGAAA 


ATCATGTACC 


ATCTTAGGAT 


AATCATAAAA 


240 


CTCAATGTAT 


TGATGATGTT 


GCAACATCAT 


TTGTTCGAAT 


AGCTTCATAT 


CAGGATGTGT 


300 


CATTTCACGT 


CCACCACCAA 


ACATATAAAC 


TGGTGGCAAT 


CCTTCTATTG 


TGCCATTAAT 


360 


TGGCGATATG 


CGCTTATCTG 


TTAATGGTAG 


GCCATTCGCC 


CATTTTTTCA 


TAATCTCATT 


420 


GACACCAAAC 


TGACTTAGAA 


CCGCATCTTG 


TTCGATTAAG 


GCGTCCGAAA 


TATCTTTATT 


480 


AGATAGTGTT 


GCATCTAAAA 


TTGGTGAGAT 


TAAAATACAA 


TTTATTCCGG 


TAATGGGCTG 


540 


TTGATTATCT 


AAAAGAGATT 


GTACAAAGGA 


TAATGCCAGT 


GCACACCTGA 


ACANTC 


596 



(2) INFORMATION FOR SEQ ID NO: 1139: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 517 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1139: 



GGCACGAGCG 


AGGGTTAATA 


TTATTAATTG 


CTATATGTTT 


GCTTTATCTA GCTTTTTATT 


60 


ATTTTTCACC 


AGATAAGGAA 


GAGAAACTAG 


CTAAAAATAA 


TATTCTATTC 


TTTTTAACAT 


120 


GGGCAGTTCC 


AGCGGCAATT 


AGTGGTATTT 


ATATTAAATA 


TATAAACAAG 


GCTACGGTAG 


180 


AAAGATTTTT 


TAAATTAGTA 


TTTTTCATAT 


TTTCTATTTC 


ATTTATTTTT 


GTAATTTTAA 


240 


TACCAAAACT 


TACAGGTGAG 


ATACCTAGCT 


ATATCAATTT 


TGGACTTATG 


AACTATCAAA 


300 


ACGCTTCGTA 


CCTTTCAGCA 


TTTACTGCCG 


GATTAGGCAT 


TTATTTCATT 


ATGAAAGGTT 


360 


CAGTGAAACA 


TAAGTGGATA 


TATGTTCTAT 


TTACAATAAT 


TGATATCCCT 


ATTGTGTTTA 


420 


TACCAGGAAG 


GCGTGGAAGT 


GCTATTTTAT 


TAATTCTTTA 


CGGCTTTATT 


TGCATTTATA 


480 


CTTATTACGT 


TTAAAAGAAG 


AATACCTATT 


GCAGTAA 






517 



(2) INFORMATION FOR SEQ ID NO: 1140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140: 



GGCACGAGCT 


GTTAAAGTAA 


ACTATACAAT 


TTAATTTCAA 


GGAAATTTCT 


CTTCAAACCA 


60 


TCTTGTTGCA 


AATTCATTTT 


CAAATAAGAA 


TACTAAATCG 


TCATATCTAT 


CTTTCACTAA 


120 


AATCGATCTT 


GATGTGTTCA 


TCTTATCTGT 


AATTTGGTCT 


TCATTTTCAA 


TCCAACGTGC 


180 


AATTTTACGG 


CCTACTGGCT 


CCATAACAAC 


ATCAACATTA 


TATTCGTTTT 


TCATTCTATG 


240 


TTCGAAAACT 


TCAAATTGTA 


ACTGACCAAC 


AGCACCTAAA 


ATAATTTGGT 


TTGTGTGTAA 


300 


TGTTTTATAG 


TATTGAATCG 


CACCTTCTTG 


TACTAATTGT 


TCAATACCTT 


TATGGAAATG 


360 


CTTCTGTTTC 


ATGACGTTTT 


TAGCAGAAAC 


TTTCATAAAA 


ATTTCTGGCG 


TAAATTGTGG 


420 


TAAATCTTGG 


AAACTGTANG 


TTTGTTTTCC 


ACCAACTAAA 


GTATCTCCAA 


TTTGATAATT 


480 


ACCAGTATCA 


TATAGTCCAA 


TGATATCGCC 


TGCTACAGCA 


TGATCACAGT 


TTCTTTATCG 


540 


TCTGCCATAA 


ATGACTTGAA 


CGTGTGATCT 


T 






571 



(2) INFORMATION FOR SEQ ID NO: 1141: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141: 



GAATTCCATC 


CATTCGTTAC 


ATCAAGTAAT 


CCAATTGCAG 


GTAACGTTAC 


TGTTGGTACA 


60 


GGTGTAGGTC 


CTACATTCGT 


TTCGAAGGTA 


ATTGGTGTAT 


GTAAAGCTTA 


TACATCACGT 


120 


GTTGGTGATG 


GTCCATTCCC 


TACTGAATTA 


TTCGATGAAG 


AAAGAAACTA ACAACTGTAA 


180 


TCAATAATGC 


AAATAAAAAT 


CCAGCGTGAT 


AACCATGCAA 


CATCGTATCA 


ACTTTAATTT 


240 


GATTCATCAT 


ATCAATTTTG 


TTCATGCCAT 


TATAGTCTGA 


CATAGTTGGA 


GAAAAGTTTT 


300 


TGGTTGCAAT 


GAGATAAAAT 


AGTAATTAAT 


GCAGGTTGGT 


CCCGATAGAA 


CCAGCAATTG 


360 


GTGGTNCTGT 


GTTTNGGCAA 


TAGGATGAAC 








390 



(2) INFORMATION FOR SEQ ID NO: 1142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1142: 



CTACACTTCA 


CGTTGATAAG 


CAGCTTCTAA 


TAAATGGGTA 


ACCATATTAT 


GATAATATGT 


60 


TCTAGATAAA 


ATTGAGCCAA 


TAACCCAACG 


CTTAAATCTT 


GGATTATCAC 


TTGTGTTAAA 


120 


TAAATAATTG 


GCCATCAACA 


TTTCATTCAT 


TGTAGAAGGG 


GCTTCAACAA 


AAGTACATTG 


180 


ATGCTTCTGA 


TTCAAGATAT 


GGTTGATGTT 


TTTGAGCTAA 


TGTAAAATGA CCTGCATGAC 


240 


CTAATTCATG 


TGCTAAGACA 


AATGCTTCAG 


CCATTTTACC 


AGTCCAAGAA 


ATAAACACAT 


300 


ATGAATGTGT 


AAAGTATGGA 


CTTGNACAAA 


ATGCGCCTGT 


ATCTTTACCT 


TTATTTCGTG 


360 


CAAAATCAAT 


CCATCGCTGA 


TCGTATGCTT 


CACGTAACAT 


GTTTGTATAG 


TCATCACCTA 


420 


AAACAACTTA 


ACGCACCCGA 


AAAATATAAT 


TTTTTTGAGT 


CTTCAATTGA AATCTCTGGG 


480 


TTCATAATCA 


GGGTCTACAG 


AAATCTTCAA 


GTCTTCAAAA 


CGCATGTTAT 


CTAATCCGTG 


540 


AATACGTTGT 


AAAAGTTTAG 


CATATTTCTG 


CATAACTGGT 


GCCAAGTCAC 


GCATAATCAT 


600 


ATCGATTTGA 


CGGTCAAACA 


TATCACGCGT 


TACTTCTTGA 


CTATGTAATA 


AATAATCGAT 


660 


GACTGATTCA 


AATCCACGTA 


AATCAGCTTC 


AATTTTTTCT 


TGTTGTACTT 


GCATATTAGA 


720 


TGTAGCCGCT 


CGTGCC 










736 
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(2) INFORMATION FOR SEQ ID NO:1143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1143: 



AGNCAACTTA 


ATTGTGTTAA 


ATACAGTTTN 


GNATGCCAAA 


TGATGTGGTG 


GATGTTAATT 


60 


ATAACATATT 


AAGCCCACTC 


AATCGTTCCC 


CAATATTNTT 


CATTTNTAAT 


CTTCTGTTCT 


120 


TTATCTGNGA 


TTCTACACAC 


TGCACAATAG 


AAATCGTTAG 


TACTAGAGCC 


CTCACGTNTG 


180 


ATATTNGAAT 


CTANTCCACC 


ANGTAGCCAT 


CTTTGTTANT 


GACTTGGTCG 


AATTTNACCC 


240 


AATCANCTTT 


NGTGTATAGC 


CACGAATCTT 


CTTCAACGAC 


TGTACCGGTT 


AATCCNGNTG 


300 


TTTTTCTGAC 


TCTTATAGCT 


TTTTCTGGAT 


TAGGATAAAA 


TACACCTTTC 


CAATTCCATG 


360 


TTATTTAGTC 


AGCACTTGGA 


TTACTACTTG 


GCGCATCAAT 


TTGTCTGCCG 


ATAATAGCTT 


420 


CAGCAATCCG 


CTTCGNGAAG 


CTATCTAAAT 


TATTTTNAAT 


GTAGTTTAAA 


TCTTTCGTCG 


480 


ATGTGATAAA 


ACCTAATTCG 


ATTAAACGAT 


AATTAAGATT 


AAGGTCAGCG 


GACACGTTAG 


540 


CGTTCAATAA 


ATCCCCTCTA 


GGTGTCACAC 


CTCTTATTTT 


ACCTACTGTT 


TTATCTAATG 


600 


CGCTACTTAA 


TGCCTTGTCA 


ATGTCATCAG 


CTGGGAAACG 


GTCGCTAATG 


ATTACATGCC 


660 


CGCCACTTGC 


TTGCGGACTA 


GCAGAATCTA 


AATGAAACTC 


TATGATTGCA 


TCCGGTTTAA 


720 


CTTCGTTTTT 


AATCCAGTAC 


ATGCCATAGT 


CTTTATAGTT 


TCCAACACGT 


TGACCGCACA 


780 


AAGTATCTTG 


ATATAAGTCT 


TGATTCATCG 


AGTTGCCACC 


GTATAACAAT 


ACTGTGTTGC 


840 


CTACTGACTC 


AAGATACTTT 


TACACTCTAG 


GTATAATATT 


TTTACGGNTA 


AAATCTCTTT 


900 


CGTTTTCTCC 


ATTCGCAACG 


GCACCTGGGT 


CGGTAGAGTA 


TGCACCAATA 


CCATGACCAG 


960 


CCACAAGCAT 


GATTTTTTTA 


CCTTGTGATA 


ACTTATCTTG 


TTTAACTGGC 


GTCACTGCGC 


1020 


TTCTTAGCTT 


ATTAGCGGTC 


GTTTCTTTTG 


CGTAGAATGG 


ACGGATAAAC 


CACATAGGGA 


1080 


AGTCGNAGCC 


GTGTGTACGT 


CTTGTAGTAA 


CTTCTGGTGG 


ACTCCAGTAA 


GCACCGCCTA 


1140 


GCCAGTTCTG 


CTCTAAAATA 


GTTATAGAAT 


CTAACGTAGC 


GCTTATTACA 


ATACCTACAT 


1200 


GACCATAACC 


ACCGNGATAA 


TTACGGTTAA 


AAATAACGAC 


GTCGNCCCGG 


CAATGCTTGA 


1260 


AACGACACAG 


TATTTTCGGT 


AAACGGTTGC 


TTCGTTAGTG 


AAATCATTCC 


ATGTAGGAAT 


1320 


GTCCGCAGCG 


CCCACACCTT 


TCAACCTATG 


ATTAAATAAG 


TAAAGCCAAT 


ATTGGTTGGC 


1380 


AGTATCGAAG 


CATTGACATC 


CAAATGCATT 


GTCTGGATTC 


CACGCCTTAC 


CCTCTAGGCT 


1440 


TTTAAGGTAG 


CTAATAGCTT 


GACTGTATGT 


TCTAACCGAC 


GGCATTGTTA 


TCATCTCCGC 


1500 


TCACTTTAGG 


TGCGCCACCA 


GTTGACTGAA 


TGOCAGCTTT 


TACTTCATAA 


ATTTTGTGTT 


1560 


GCCCTTTCTT 


AGATGCGTGA 


GTAAAGNTGG 


GNTATTCTTC 


CACCACGCCC 


AAATTGAAAC 


1620 


AATCCCAGTA 


ACGACTGTGC 


TTATAAACAC 


TTCGNNCAAC 


TGGGATTTGG 


AGAAATATTG 


1680 


TTTNGGGTTT 


GGTTAAAAAC 


TTGATTGGAT 


CCCCAAGCGA 
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(2) INFORMATION FOR SEQ ID NO: 1144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144: 



ATCATGTTAC 


AAATCCATTA 


GGTGTAGGAG 


GTATCAATGA 


TGCAACAACT ATCTCTGAAA 


60 


CATAGATTAA 


ACAATGGTGA 


TTCAGTTTAT 


GGCATTTTTA 


ATTCTATACC 


GGACCCATTG 


120 


ATGATCGAGG 


TTATCGCAGC 


AAGCGGGTAT 


GACTTTGTTG 


TGATTGATAC 


AGAACACGTG 


180 


GCGATTAATG 


ATGAGACACT 


AGCGCATTTA 


ATTCGTGCAG 


CTGAAGCAGC 


GCATATTATA 


240 


CCAATTGTAC 


GTGTCACTGC 


AGTGATAGAT 


AGAGATATCA 


TTAAAGTGTT 


AGATATGGGT 


300 


GCGAGAGGTA 


TTATTGTGCC 


ACACGTTAAA 


GATCGTGAGA 


CAGTTGGAGC 


ATATTGGTGA 


360 


AATTAAGTCG 


GTTATTACCC 


GCAAGGATTA 


AGAAGTTTGG 


ATGGTGGTCG 


CATGGCAAGA 


420 


TTTGGACGGT 


ACAACCATTA 


CTTGGATGCA 


ATGGAGATGG 


TTAATGAGCA 


TTATTATGGG 


480 


GGATTGCCAT 


GATAGAAGAT 


TGTTGGAGGG 


GGTATTGGCC 


ATGGACGATA 


TAGCTCAAAG 


540 


TCGAAGGGGT 


TTNGACAATG 


ATAGCCGAAG 


GTGCCNGCAG 


ATTTATCGCA 


GTCACTTGGC 


600 


ATACCATGGC 


AAACGCTCGT 


GCCGAATTCG 


GCACGAGCGG 


CACGAGCGGC 


ACGAGCTCGT 


660 


GCCGCTCGTG 


CCGCGGAAAA 


ATATATCGCC 


ATTTAAGTGC 


ATCTCTAGCG 


ACGTCTAAAC 


720 


AGAAAGGGGA 


TGATGGCTAA 


TGCGTATAGC 


TCANCCTGNT 


ATTGAACAAT 


TAAAAGCACA 


780 


ATCTCATCCA 


GTGTGNCATT 


ATATCTATGA 


TTTAGTCGGA 


CTGGAACATC 


ATGTGCAACA 


840 


TATTACATCG 


NCATTGGCGA 


GTAATTGTCA 


AATGTACT AT 


GCAATGAAAG 


CAAATAGTGA 


900 


ACGAACAATC 


CTAGATACAA 


TTAGNCAGTA 


TGTNGANGGA 


NTCGAAGTNG 


CATCTCAAGG 


960 


TGAAAATAGC 


AAAAGGTCTT 


GCTTTNANCC 


AGCAACNCAT 


ATTAT 




1005 



(2) INFORMATION FOR SEQ ID NO: 1145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145: 



ACATTTTCAG CATCAGCAAT CGCTTGTTTA TAAACCTTTG GCTTATCTTG AATCTGCTTG 

968 
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ffVViywtK TA flVT 

L\j 1 vj i AA 1 V- X 


ATAGTTTGCT 


LTLJAAUAj X 1/lvj 


CATGTTCATT AACTGCTTGT TGTAATTGAT 




UL-AX XviAl X\j 


Cil XvAAXIvtC 


lj 1 XviL, X X\iA X 


CTACTAATTC 


AGCGATTGGN 


TGAAGTTTCG 


1 OA 

loU 


TCGCTTGATC 


AATATTTTGT 


TTAGCAGTTG 


CAATTTGATC 


AGCATTTAAA 


TGTGTTAATT 




UC»Xa_AAXAIjX 


TTGTTTCGCT 


'L\S i\S IT l~L"rA\j 


CTTCAGCGAC 


TCTCTCATTA 


CCATTTAACT 


T Aft 

300 


LAi 1VAL 111 


1 1 IV 1 lvxAAL? 


CTTAGTTAAT 


GCTTGTTCTA 


CAGCGTCTTT 


ATTCGCATTT 


1 C rt 


wVACwiX lAt? 


1 1\AjA1V 1 \j 1 


AAluLMl lui 


GCAGCTCGTA 


ACGCTTGATC 


CTACTGCTTC 


420 


11111 1A1\,A 


GTTGACGCTT 


CAGTGTAATT 


TGGTTGAGCC 


TTATCTGTAT 


TCACTTGATC 


480 


AACTTTATTT 


TTCAATGTTT 


CCATCGCGTG 


Al\.nnui J. v_/\ 


VI 1 A\J\-J\\3 1 X X 


UAAL, ATC» lib 


540 


TGCAACCTCT 


GTTCTTGTAG 


GCGCTGCATT 


TATAGCATCA 


GTTAATGTTT 


GTTGTTGTGC 


600 


ATGATTTAAG 


TTTGGCAATG 


CATTTACAGT 


TGTTACTGCT 


TGTTGTTGAT 


CACGAGCAAG 


660 


TTTTTGATCA 


CCATGTAGAT 


TATCTTTTGG 


AGNTGGTACT 


TGCTTGTGTT 


AATTGTTCTA 


720 


CTTGGTGATT 


TGTCGAGTGT 


TGGATTACCT 


GTTTGGTTAA 


TTAAATCTTT 


TGGATTTNGG 


780 


GCTTGCTGCT 


TGGGNAAGCA 


TCTTTTGG 








808 



. (2) INFORMATION FOR SEQ ID NO: 1146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1146: 



CTATTAAGGA 


ATCCTTGNNA 


AATTCACATT 


AGCAAGTTGG 


ATTGTCCACC 


TTATTAATTG 


60 


ATAAAGTATG 


CAAATAATTT 


CACAGTGTTT 


CCAGCGGGGC 


CAANTTATGA 


TTGCAAGTAA 


120 


CCTATAGCAA 


TCGTATTCTT 


GTTCTTGCCA 


ACGCTATTTA 


AGTATCAGGT 


TTAACCAACA 


180 


GGTGCGACAA 


AAGGTTAGTT 


TGAAATTAGG 


GAGTGGGGCA 


GAATTGATAA 


AGAACCACTA 


240 


ATGACGATAA 


AGATTAAAAG 


GGAGGACGTT 


ATGATGACGG 


ATTAAAGTTG 


GAATCATTGG 


300 


GTGTGGTGGT 


ATTGCGAATG 


GCAAGCAACA 


TGCCAAGTTT 


ACNAAAAGTT 


GAAAATGTTG 


360 


AAATGATCGC 


An u ini?iX3AC 


GTAGACATTT 


CGAAAGCAGC 


GAGTGCGGCA 


GAAGCATACG 


420 


GAACTGACAA 


TGCAAAGGTT 


TATGATGATT 


ACAAAGCATT 


GTTAAAAGAT 


GACACGATTG 


480 


ATGTTATCCA 


TGTTTGTACG 


CCAAATGACT 


CGCATTGTGA 


AATTACTGTA 


GCAGGGTTGC 


540 


ATGCCGGTAA 


GCATGTGATG 


TGTGAAAAAC 


CAATGGCTAA 


AACGACAGCA 


GAAGCTCAAA 


600 


AAATGATAGA 


TACAGCTAAA 


TCAACAGGTA 


AAAAATTAAC 


AATAGGTTAT 


CAAAATCGCT 


660 


TCAGACCAGA 


TAGTCAATTT 


TTACATCAAG 


CAGCGCAACG 


TGGCGACTTA 


GGAGACATTT 


720 


ACTTCGGAAA 


GGCACATGCC 


ATTCGTCGTC 


GTGCAGTACC 


GACATGGGGT 


GTCTTTCTAA 


780 


ACGAAGAAGC 


CCAAGGTGGA 


GGACCTTTAA 


TCGATATTGG 


TACGCACGCA 


TTAGATTTAA 


840 


CGTTATGGAT 


GATGGATAAT 


TATGAACCAG 


AATCAGTGAT 


GGGTTCAACA 


TTCCATAAAT 


900 


TAAATAAGCA 


GCATGATGCG 


CCAAACGCTT 


GGGGTTCATG 


GAATCCAGAT 


GAATTAACAG 


960 


TTGAAGACTC 


TGCGTTTGGC 


TTTATTAAAT 


GAAGAACGGA 


GCGACGATCA 


TTTTAGATCC 


1020 



969 



WO 97/30070 



PCT/US97/02318 



GCTTGGGCGA 


TTAATTCTTT 


AGAAGTGGAT 


GAGGCAAAAT 


GTTCATTATT AGGAACGAA\ 


1080 


GCAGGTGCTG 


ATATGAAAGA 


TGTTCTACGT 


ATTCATGGTG 


AAGATATGGG 


CACACTTTAT 


1140 


ACCAAACACG 


TTGAATTNGA 


AAACAAAGGC 


GTCGACTTTT 


ATGAAGGTAA 


TGAAGTGGAT 


1200 


GAAGCTGAAG 


AAGAANCAAA 


AGCTTGGATT 


GATGCAGTTG 


TAAATGATAC 


TGAACCAGTT 


1260 


GTGAAACCGG 


AACAAGCAAT 


GGTAGTTACA 


AAAATTCTTG 


AAGCGATTTA 


TCAGTCTGCA 


1320 


AAATCAGGCA 


AAGCAATTTA 


CTTTGAATAA 


CATCATACGG 


TAAGGAGGCA 


CATCATGACA 


1380 


AAATTAAAAG 


TTGGTGTGAT 


AGGTGTTGGT 


GGCATTGCAC 


AAGACCGTCA 


TATTCCAGCA 


1440 


TTGCTGAAAC 


TCAAAGACAC 


AGTCTCATTA 


GTTGCAGTAC 


AAGATATTAA 


TACAGTGCAG 


1500 


ATGATTGATG 


TTGCGAAGCG 


CTTTAATATA 


CCTCAGGCAG 


TTGAGACACC 


TAGCGAGCTG 


1560 


TTTAAACTTG 


TTGATGCGGT 


GGTCATTTGT 


ACACCCAATA 


AATTCCATGC 


TGATCTTTCT 


1620 


ATAGAAGCAT 


TGAACCATGG 


TGTCCATGTC 


TTATGCGAAA 


AGCCAATGNC 


AATGACGACG 


1680 


GAAGAGTGTG 


ATCGCATGAT 


TGAAGCGGCT 


AATAAAAATC 


ACAAATTATT 


AACTGTCGCT 


1740 


TATCATTATC 


GTCACACAGA 


TGTCGCTATG 


ACTGCTAAAA 


AAGCAATTGA 


AGCAGGTGTG 


1800 


GTTGGTAAAC 


CATTAGTAGC 


ACCGTTGTCC 


AANCGATGCG 


TAGGCGTAAA 


GNACCTGGGT 


1860 


GGGGCGTTTT 


TACCCAATAA 


AACCGTTGCA 


AGGTGGCGGT 


AGTTTAATCG 


ATTATGGTTT 


1920 


CCCACTTGTT 


AAGACTTATC 


TTTGTGGCTA 


TTAGGTAAAG 


ATATGGTGCC 


GCATGAAGTG 


1980 


CTAGGAAAAA 


CATATAATCA 


ATTGAGCAAA 


CAACCGAATC 


AAATTAATGA 


TTGGGGAACA 


2040 


TTTGATCACA 


CTAAATTTGA 


TGTCGATGAT 


CATGTTACTA 


GTTATATGAC 


ATTTGCCAAT 


2100 


CGAGCAAGCA 


TGCAGTTTGA 


ATGTTCGTGG 


TCTGCAAATA 


TAAAGGAAGA 


TAAGGTACAC 


2160 


GTTAGTTTAT 


CAGGAGAAGA 


TGGCGGTATC 


AATTTATTTC 


CATTTGAAAT 


ATATGAGCCC 


2220 


CGCTTTGGAA 


CTATTTTTGA 


AAGCAAAGCT 


AATGTTGAGC 


ATAACGAAGA 


CATTGCTGGT 


2280 


GAGAGACAGG 


CGCGTAACTT 


TGTCAATGCG 


TGTTTAGGTA 


TAGAAGAGAT 


TGTGGTGAAA 


2340 


CCGGAAGAAG 


CACGCAATGT 


AAATGCCCTT 


ATAGAAGCGA 


TTTATCGTAG 


CGATCTTGAT 


2400 


AACAAGAGCA 


TACAACTTTA 


ATGATTATTA 


TATATAATAC 


AAAATTCTCA 


ATATAAAAAG 


2460 


ANGGAGTGCT 


TTTCAATGAA 


AATAGGTGTA 


TTTTCAGTAT 


TATTTTACGA 


TAAAAATTTT 


2520 


GAAGATATGT 


TAGATTATGT 


CGCAGAATCT 


GGATTGGATA 


TGATTGAAGT 


TGGAACAGGT 


2580 


GGTAACCCAG 


GAGATAAATT 


TTGTAAGTTA 


GATGAGTTGT 


TAGAAAATGA 


AGACAAGCGA 


2640 


CAAGCATTTA 


TGAAGTCAAT 


CACAGACAGA 


GGCTTACAAA 


TAAGTGGTTT 


CAGTTGTCAT 


2700 


AACAATCCAA 


TTTCTCCAGA 


TCCGATAGAA 


GCGAAAGAAG 


CCGATGAAAC 


GTTACGTAAA 


2760 


ACAATCCGTT 


TAGCAAATCT 


ATTAGACGTG 


CCAGTTGTTA 


ATACATTTTC 


TGNCATTGCA 


2820 


GGGTCAGATG 


ATACCGCTAA 


AAAGCCTAAT 


TGGCCTGTTA 


CACCTTGGCC 


AACAGCCTAC 


2880 


TCTGAAATTT 


ATGATTATCA 


GTGGAATGAA 


AAGTTGATAC 


CATATTGGCA 


AGATTTAGCT 


2940 


GAGTTTCCNC 


AAGAGCAAGA 


TGTNNAAATT 


GCCATNGAGT 


TACATNCAGG 


ATTTTTAGCT 


3000 


CANACACCAA 


ATACGATGTT 


NAAGTTACGT 


GAGCCAACAA 


ATGAATATAT 


CGGTGCTAAC 


3060 


TTAGATCCTA 


GTCATTTATG 


GTGNCAAGGT 


ATTGACCCAA 


NTCCTGCGAN 


TCGCATATTA 


3120 


GGCCCNANCA 


AATNCAATTC 


ATCACTTCCA 


TGGAATTCCG 


AAGAAACGTA 


TGTTANTCNA 


3180 


GGGNATGTAA 


ANATGTATGG 


TCTAGCTGAT 


NTCCAGCCAT 


ATGGTNACGT 


TGCGACANGN 


3240 


GCATGGACAT 


TCCNTACAGN 


TGGNTATGGA 


CATAGTCCAT 


ATGNATGGGC 


AGATNTCATA 


3300 


AGTCAACTTA 


NTATTAGATG 


GAG 








3323 



(2) INFORMATION FOR SEQ ID NO:1147: 

(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147: 



GGTCTGCATA 


AAACCTTGGA 


AGTACTTAAG 


AAATCCCAAA 


GCCAATACAT 


✓*v^i m Attn ^ ^tnvn^ 

GGATTACTTG 


60 


CACCCACCGT 


ATCGANACAA 


TGTTTGCGAA 


ACATAACGTA 


AAATATGGAG 


ACAATCCTTT 


120 


GATGCGTTGG 


TTTACTAATA 


ATGTTGCAGT 


AAAGGTTAAA 


CCCCGATGGT 


AATAAAGGAA 


180 


TATATTANNN 


NANGATGNAA 


ATAGAAGAAA 


AACCCGATGG 


GTTCATGGCT 


TTTGTTCACG 


240 


CATTATATAG 


AGCAGACGAT 


ATAGTAGACA 


AAGACATGTC 


TAAAGCGCTT 


GATGCATTAA 


300 


TGAGTATAGA 


TTTCTAATAG 


AGGAGGTGAG 


ACATGAGTAT 


TCTAGAAAAG 


ATATTTAAAA 


360 


CTAGGAAAGA 


TATAACATAT 


ATGCTTGATT 


TAGATATGAT 


AGAAGATCTA 


TCACAACAAG 


420 


CGTATGTGAA 


ACGTTTAGCG 


ATTGATAGTT 


GTATTGAATT 


TGTTGCGCGA 


GCTGTCGCTC 


480 


AAAGTCATTT 


TAAAGTATTG 


GAAGGTAATA 


ACTGGCTCTG 


GGAATTCCTG 


CAGCCCGGGG 


540 


GATCCACTAG 


TTCTAGAGCG 


GCCGCCACCG 


CGGTGGAGCT 


CCAGCTTTTG 


TTCCCTTTAG 


600 


TGAGGGTTAA 


TTTCGAGCTT 


GGCGTAATCA 


TGGTCATAGC 


TGTTTCCTGT 


GTGAAATTGT 


660 


TATCCGCTCA 


CAATTCCACA 


CAACATACGA 


GCCGGAAGCA 


TAAAGTGTAA 


AGCCTGGGGG 


720 


TGCCTAATGA 


GTGAGCTAAC 


TCACAATAAT 


TGCGTTGGCG 


CTCACTGCCC 


GCTTTCCAGT 


780 


CNGGGAAACC 


TGTCGTGCCC 


AGCTTGCAAT 


TAATGAATCC 


GGCCAACGCG 


CGGGGAGAGG 


840 


NGGGNTTGGN 


GTATTGGGCG 


CTCTTCCCGC 


TTCCTCGCTC 


ACTTGACTCG 


CTGCGCTCGG 


900 


TCGTCCGGNT 


TGNGGNGAGC 


GGGNATCAAG 


NTCAACTCAA 


AGGNGGGNAA 


TACGGGNTAT 


960 


CCACAGAATC 


AAGGGGGATA 


ACGCAGGAAA 


GAACATTGTT 


GAGCAAAAGG 


CCAGNAAAAG 


1020 


GCCAGAACCC 


NNAAAAGGCG 


CGNNNNGCTG 


GNGNNNNTTC 


CCCATAGGCT 


CCCCNCCCCC 


1080 


CTGGACGANN 


ATCAAAAAAT 


CGNNCGGTCA 


AATNNAGANG 


GNNGGGGGAA 


ANCCCGCNAG 


1140 


GATTATTNAG 


GNTCCAGGNG 


GNNTCCCCCC 


G 






1171 



(2) INFORMATION FOR SEQ ID NO: 1148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1130 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148: 
CTGCTTAAAT TCGAACGTGT CATTGACATT TTCAATAAGA AAAACAACGG AGCATGGAAT 60 
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TTCTGTCCGC 


AAAACGCCGG 


TTATTGGGAA 


CATATCCCTA 


AGAGTATTAC AAAATTATCA 


120 


GATTTAAAAA 


TCGTTGGTTT 


AGATTTCTAT 


ATCACTACTG 


AAGAATCAAA 


ACGATTTACT 


180 


GATTTTCCTA 


AAGACTTTAA 


AGGTATTGCA 


GGTTGGATAT 


TAGAAGTAAA ATCGAATACA 


240 


CCAGGTAACA 


CAACACAAGT 


ATTAAGACGT 


AATAACTTCC 


CGTCTGCACA 


TCAATTTNTA 


300 


GTTAGAAACT 


TTGGCACGAG 


ACTGGTGGCG 


TTGGTAAATG 


GGAGTTTATT 


CGAAGGAAAG 


360 


GTGGTTGAAT 


AATGATAGTA 


GATAATTTTT 


CAAAAGACGA 


TAACTTAATC 


GAGTTACAAA 


420 


CAACATCACA 


ATATAATCCA 


ATTATTGACA 


CAAACATCAG 


TTTCTATGAA 


TCAGATAGAG 


480 


GAACTGGTGT 


TTTAAATTTT 


GCAGTAACTA 


AGAATAACAG 


ACCGTTATCT 


ATAAGTTCTG 


540 


AACATGTTAA 


GACATCTATC 


GTGTAAAAAC 


CCGATGATTA 


TAACGTAGAT 


AGAGGCGCTT 


600 


ATATTACAGA 


CGAATTAACG 


ATAGTAGACG 


CAATTAATGG 


GCGTTTGCAG 


TATGTGATAC 


660 


CGAATGAATT 


TTTAAAACAT 


TCAGGCAAGG 


TGCATGCTCA 


GGCATTCTTT 


ACACAAAACG 


720 


GGAGTAATAA 


TGTTGTTGTT 


GAACGTCAAT 


TTAGCTTCAA 


TATTGAAAAT 


GATTTAGTTA 


780 


GTGGGTNTGA 


TGGTATAACA 


AAGCTTGTTT 


ATATCAAATC 


TATTCAAGAT 


ACTATCGAAG 


840 


CTGTCGGTAA 


AGACTTTAAC 


CAATTAAAGC 


AAAATATGGC 


TGATACACAA ACGTTAATAG 


900 


CAAAAGTGAA 


TGATAGTGCG 


ACAAAAGGCA 


TTCAACAAAT 


CGAAATCAAG 


CAAAACGAAG 


960 


CTATACAAGC 


TATTACTGCG 


ACGCAAACTA 


GTGCAACACA 


AGCTGTTACA 


GCTGAAGTCG 


1020 


GATAAANTAG 


GTGGGAAAAA 


GAGCAAGCGA 


TTGGGGACCG 


TTTTAACGAA 


GTNGGACAAC 


1080 


AAATCAATGC 


GCCTGGCCTT 


GTNAAGGTAA 


TNCAACAACA 


AATTGGGAAA 




1130 



(2) INFORMATION FOR SEQ ID NO: 1149: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1149: 



AACCCTAATT 


GTTCCAGTTT 


TGTNCCATTG 


ATAATAATAA 


ATTTNCGCCT 


NTGAACANCT 


60 


TGATCAAAGT 


TTTGTTCCCC 


NGGTTGACCT 


TCGTTGAAAT 


ATTTCCAATT 


TCCTTAAGTT 


120 


CCTCCGTAAT 


CTTGAATCCC 


CTGNCTNAAC 


CCATCCATNG 


NTTGGTTAAC 


CGAATNCCCA 


180 


TTTTGTCCCA 


ATTTGTTGAC 


CTCCAGTTNC 


TAAAGTTCCA 


CCTGTAACTT 


GGTCTTTTAA 


240 


TCCTGTTTTT 


TGAGGATCAT 


TTAAGCTTGT 


TAAATCATCA 


ATTGCTTNCG 


TCGCATTTGT 


300 


TTTAGCAACT 


TGTAAATTAT 


GATCTCCATT 


TAACGCTTGA 


AGTGTTGTTT 


GAACTTTAGT 


360 


AATAGCTTGT 


TCTACTTCTG 


TTTGGTTCGC 


ATTTTGACCA 


GTTTGTTTGT 


TAATCATTGC 


420 


TGCTGCTTCA 


TTTACCGCTG 


TATTATAAGC 


AGTTTGTTTA 


TCAGTATCAG 


CATTGATGAA 


480 


GTTACCACGT 


TGTTCAACGG 


CTTGATGATC 


TGCAATCGCA 


TTAATTAAGT 


TACCCATCGC 


540 


AGTATTTAAA 


GATTCTGCTG 


TTTGTTTGTG 


ACCATTAACT 


GCAGCAATAT 


CAGATGATTG 


600 


CGTAATTTGT 


GATTGTAACT 


GTTGCTTTTG 


CGCATTATTC 


AAATGATCTA 


ATTGATCAAG 


660 


TCTGTGTTTG 


CTTTGGTGGT 


TTAGCTTGCT 


TGCAAGGTTT 


TCCATCACCA 


TCTTAATTGC 


720 
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AACTTTCCGG ATTTATTCAC TTGGNGTC _ 748 

(2) INFORMATION FOR SEQ ID NO: 1150: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150: 



GGCACGAGCT 


CGTGCCGCTC 


GGCATTTTTT 


ATCTACATGT 


TTGCATTCTG 


GTTCATTTTT 


60 


AGTTCTATTG 


CAGGATTATT 


TACGTTTACG 


GGTAGTGGTA 


GCTTAAAACT 


AATTTCAGTG 


120 


ATTTTTAATT 


TATTAGGTAT 


TGTTTTCGGT 


GTCATTTTAT 


TATTTAATCC 


ATTAATGGGT 


180 


ATCGTCTTTA 


TTTCGACGAT 


GATTGCTATG 


CATTGTATTC 


GTAGGTGTCA 


TTTATGTTGT 


240 


AGATGCACTT 


GCTTAAGTAA 


AATGAAGCGG 


TTCAAAAGAA 


GGGTGTGACA 


TGAAGTTTGT 


300 


GTCATATCCT 




TTTATGAAGC 


ATAAAAAAGG 


GGCGCTACCT 


ACAATAAGTA 


360 


AGATACGCCC 


ATATTTTTAT 


ATTTTACTAT 


TATTGTTTTT 


CAATACGATT 


AATAGTTACA 


420 


TTTAGTCCAA 


AATATTTTTC 


TAAAAAATGT 


TTATAGTTAT 


CTTTAGTGAC 


ATCAAATTTT 


480 


TCTGAGCTAC 


CATTCCTTGT 


TAAAGTTAAA 


TGATTTTCAG 


ACATTGTAGC 


ACGGCCAAAT 


540 


GATTGTGGCA 


TTGTAATTAA 


TAAATGCTGT 


ACAAATATTG 


AATCTGGATG 


CGTTTGATTA 


600 


TATTCGATGT 


TGCTATTAAA 


ATCTTCTATT 


TGTTTAGGTT 


TGAATTCAGC 


TTCATATTTT 


660 


GTATGCCAAT 


GATCATTTTC 


GAATTTTTGA 


ACATAGAAAA 


TATCCTTGTC 


TTCGTTGTTA 


720 


AAAATAGCGC 


GGAATGTACC 


ACTGATATCA 


GTAATTGGTT 


GTGTATGCTC 


AGATGAAGTA 


780 


ATAGGAATGG 


CATGTAGAGG 


TAAGTCTCCA 


AAGCCAACAT 


CAGTTACATA 


GAATACATCA 


840 


TTTATAGAAA 


CAACAAGTGA 


AGCATGTGAA 


CCGTTCAGAC 


TACGACCGCC 


ACCGGGTGTG 


900 


TGAATAGTAG 


CTGACATTAA 


TTCAGGATTA 


AATCCTTTTT 


GGTGTAAATA 


TGCTTTGGAA 


960 


AAATGTATTT 


AACTCATAAC 


AAAAAACCAC 


CACGTTTATC 


ATGAACAATT 


TNATTAAAAA 


1020 


GTGCATCGAT 


ATTTATAGAT 


ATCGGCTTAC 


TATTTTGGAC 


ATCAATATTT 


TCAAAAGGGT 


1080 


ACAAGTTAAC 


ATAAAACGTN 


GTTTGGATTA 


ATAATTTNAA 


TGCTTCAATA 


CTCGGNCGGA 


1140 


TTATTACGAG 


ATTGATCANT 


TGTNAATTAT 


TCTCTTACTT 


CCGCAATATT 


CAATAAGCAT 


1200 


TAGCGCCTCC 


TGGTATTAAA 


GATTATTATT 


AA 






1232 



(2) INFORMATION FOR SEQ ID NO: 1151: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 543 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151: 



GGCACGAGCG 


ACATTCATTA 


TGCCCAAATT 


TTTAAAATCT TTCTTTAACG AGCATCCGTT 


60 


TATCGATGTT 


TCTCTCCATG 


TCAAAAATGA 


AAATATTGAA 


AAAGATATTA 


ATAATCATAC 


120 


GTATGACATC 


GGGATTAGCA 


GAAATCAACC 


TAAACTAAGA 


GAAGTTCATT 


CTGAAAAAGT 


180 


ATGCGAAGGT 


AAAATTGTAT 


TGATTGCTCC 


CAATAAAGAG 


AATAATCATC 


TTCTAACTGA 


240 


GGCATCTTTA 


TTTGAAAAAT 


ACAAAATAAT 


AAGTGATAAC 


CATCCAGAAT 


ATTGGTCATC 


300 


ACTTAAAAAT 


AATATTTTAA 


ATATTTATGG 


AAAAGCACAA 


TTTCTATCTA 


TCAATGATGT 


360 


GCATACTTCA 


ATTAAATTAA 


TCGAAATGAA 


TCAAGGTATA 


TCATTCCTGC 


CTATTTATAT 


420 


TAACAACAGA 


TAGTGATTAC 


AATATTTCGG 


TTATTAACAC 


GAAAATTTTA 


CAAGCGCCTA 


480 


TTTCATTTAC 


ATATATATAC 


AGCAAAAAGA 


AGGCCAGGAA 


TATGGCTTTT 


ATTAATCATT 


540 


TAA 












543 



(2) INFORMATION FOR SEQ ID NO: 1152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152: 



GGCACGAGCT 


CGAAACAATA 


AATTGTTCCA 


GCTCCTTTAA 


GCACCGATGT 


CTTATTTATA 


60 


ATATGTTTGT 


CTAATATCAT 


CTTTCTTTAT 


ACCAAACTGC 


TTATAGGATT 


TATCCATTAA 


120 


TTTGGCAATT 


TTATCTGCCC 


ATCGAATGTC 


ACTCGCATAT 


TGATGTTGCG 


CAGGATTTTC 


180 


TGGATTCCAT 


CGCATTTGAT 


ATAAATTCAG 


TTGATTGTTT 


TCAAAATATT 


CATTACGAAT 


240 


GAACTTTGCA 


CCACCAATAA 


TCGCCTTATC 


TGGTGATGTC 


CATTGTTCCT 


TTTCAGCATA 


300 


ACTTTTCCCA 


CTACGAACAG 


CACTACTATC 


GAATGCTCCT 


ATACCAAAAA 


AGTTGTAATA 


360 


GCGTTTTTTC 


CCATCTTTAA 


TGCCTTTTGC 


TAATTCTGAT 


TTGCCGTTAC 


CTGTTTCTAC 


420 


TAATGCATGG 


CTAACAAGAT 


AAATGACNTT 


AACCTCATAT 


TTTTCTTGAG 


CTTCTAGAAA 


480 


AACTTTCCCT 


CGATTTTCTA 


AAATCCCCTT 


ACCTTTTAGC 


AATTGGGTAA 


CTTCAGATTC 


540 


CGACATTGGC 


ACTTTTTCTG 


TGATATCCAT 


ATACTTCAGT 


ATCATGTCTT 


ATGTGAAATA 


600 


GTCATGCTTT 


AGC 










613 



(2) INFORMATION FOR SEQ ID NO:1153: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153: 



GGCACGAGAC 


ACAAGTTAAT 


ATTTGGTAAC 


TAACTTGTGT 


ACACGTGATA 


CGAAACTAAT 


60 


TTTTATTATT 


CATTTGATAC 


ATGTCTATTT 


AAATAATTTT 


GCATATACGT 


ATTCTGCCAT 


120 


ATCATTTGAG 


GCAGTTAAGA 


TTAATGTAGT 


GTCATTTCCA 


CTCACCGTAC 


CTAATATCTC 


180 


TTCTATATTT 


AGCTGATCGA 


TAAAATAGTT 


TATGCCTTGG 


GCGAAACCAG 


GTGAGGTTTT 


240 


AACGATAATG 


TATGAACCAT 


TGATGATTGA 


ACTTAGAACT 


TCATGTTCAC 


AATAGTGTCN 


300 


AAACTTTTCT 


CTCATCTCCT 


GTTCCGTTTG 


AATATTAATA 


GCTTTATAAA 


TCCATGTTTC 


360 


ACAATCGATA 


GGGACCCCGA 


TATATATTAA 


GTTCTTTCAA 


GTCTTTAGC 




409 



(2) INFORMATION FOR SEQ ID NO: 1154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1154: 



AGAATTTTAC 


AAAGTTGCAC 


AAGCGCTATT 


AATCAATATC 


GGTACTTTAA 


CAGCACAAAA 


60 


TGAACAAGAT 


ATTATTGCGA 


TTGCTCAAAC 


GGCAAATTGA 


GGCAGGCTTA 


CCCTATTTGT 


120 


ATTTTGACCC 


CTGTACCTGT 


TGGTGCTTCT 


ACATATCGAA 


AGCAATTTTG 


TAAATTATTA 


180 


TTGAAATCAG 


CGAAAGTATC 


AGTAATTAAA 


GGCAATGCAT 


CTGAAATATT 


AGCGTTGATT 


240 


GATGATACAG 


CAACTATGAA 


AGGTACAGAT 


AGTGATGCTA 


ATCTTGATGC 


GGTTGCAATA 


300 


GCGAAAAAGG 


CTTACGCAAT 


ATATAAAACT 


GCAATAGTAA 


TCACAGGTAA 


AGAGGACGTT 


360 


ATTGTTCAAG 


ATAATAAAGC 


CATCGTATTA 


GCTAATGGAT 


CTCCATTATT 


AGCACGAGTA 


420 


ACTGGAGCTG 


GTTGTTTATT 


AGGAGGCGTT 


ATTGCTGGAT 


TTTTATTTAG 


AGAAACAGAA 


480 


CCAGACATAG 


AAGCGTTAAT 


TGAAGCGGTA 


AGCGTATTTA 


ATATTGCTGC 


TGAGGTAGCT 


540 


GCTGAAAATG 


AAAATTGTGG 


TGGTCCTGGT 


AACGTTTTCA 


CCATTGTTGC 


TTGATACGTT 


600 


ATATCATTTA 


AATGAAACAA 


CTTATCAACA 


ACGGATTCGT 


ATTCAAGAGG 


TGGAATAATA 


660 


TGTTTAACCA 


TTCGTATCTA 


AATGTGTATT 


TTATTTGTGG 


GACCTCCGAT 


GTTCCGAGTC 


720 


ATCGAACTAT 


TCATGAAGTA 


TTAGAAGCAG 


CATTAAAGGC 


AGGGATTACA 


CTTTTCCAAT 


780 


TTCGAGAAAA 


GGGTGAATCA 


GCTTTAAAAG 


GAAACGATAA 


ACTTGTATTA 


GCTAAAGAAT 


840 
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TACAGCATCT CTGTCATCAA TATGATGTGC CTTTTATCGT TAATGATGAT GTATCCTTGG^ 900 

CAAAAGAAAT TAACGCGGAT GGTATACATG TCGGTCAAGA TGATGCGAAA GTCAAAGAGA 960 

TAGCACAATA TTTCACTGAT AAAATTATTG GACTTAGTAT TAGTGATTTA GATGAGTATG 1020 

CAAAATCTGA TTTAACTCAT GTTGATTATA TAGGTGTTGG ACCAATCTAT CCTACGCCGT 1080 

CGAAGCATGA CGCGCATATA CCAGTAGGTC CAGAAATGAT TGCAACGTTT AAAGAAATGA 1140 

ATCCGCAACT ACCGATTGTA GCTCGTGCC 1169 

(2) INFORMATION FOR SEQ ID NO:1155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155: 



CCGCTTCGCA 


CGAGCNAAGG 


CTTTTATTGC ANCCCTCGCA TCTTCTTTAG CTTACACTAA 


60 


ATCTTTGATG 


TCTTGTAATG 


CTTGTGCAAG 


TTGTGCTTTG 


GCTTGTTCAA 




120 


AGTCATCGCA 


TTGTTAATGT 


CGTTATGACC 


TTGTTGAAGT 


ATTTGANTAA 


TTCGATCTTT 


180 


AAGCGCTTGT 


TTNTCCTTAT 


CTGTTAGATT 


TGGATTGTGA 


TCGATTTCGN 


CAATTAATGC 


240 


TTGAACTTGT 


TTATCAACAT 


CTTGTTTCGC 


ATCTTCTTTA 


GCTTACACTA 


AATCTTTAAT 


300 


GTCTTGTAAT 


GCTTGCGCAA 


GTTGTGCTTT 


AGCTTGTTCA 


ATTTCTTCTT 


TAGTCAGCGC 


360 


ATTGTTAATG 


TCGTTATGAC 


CTTGTTGAAG 


TATTTGATTA 


ATACGATCTT 


TTAAGTGCTT 


420 


GTTTTTCCTT 


ATCTGTTAGA 


TTTGGATTTC 


GATCAATTTC 


GTCAATTAAT 


GCTTGAACTT 


480 


GCTTATCAAC 


GTCTTTATTG 


GCATCAATTT 


TNGCTTTNGG 


TATTTCATTA 


GCANGCACTN 


540 


GTTCAATCGC 


GTGGTTGCCT 


GCTGTTTGAA 


CTTGAGATAC 


AGCCTGATTA 


CTTGTTGCTT 


600 


TATTAATGTT 


GTTGATGATG 


ATTTGAAGCT 


AATTCTTCTT 


GCTTTATTTT 


TTCGGCAATA 


660 


AGCTTGTCTT 


GATCCGTCGC 


TCGATTTCTT 


TTAGCTTATT 


AGCTAAAGCT 


720 


TGATTAATTG 


ATTGAATTGC 


CTTGTCTTTA 


GCATCTTGTA 


GTCGTTGATC 


ACCATTAAGA 


780 


TTATGGATTG 


CATCATTGAC 


TGCTTGGATT 


GCGCCATTGA 


TATCATTCAC 


ATTTGTGTTA 


840 


TCACTATTTA GCAATGTATT 


TGCTAGACGT 


TTGGCATCAT 


CGAAGTTTGT 


TTTAGCATTA 


900 


TCGTCAGCGT 


TTTGGTAATT 


GACAGTTTGC 


TCTGCATTTG 


GAATTTCATT 


GTCAACTAAA 


960 


TGTTTCAATG 


TTTCCATTGC 


ATCATTTAAG 


TCAATTTGAT 


TATTAACAAT 


ATCTGTTACA 


1020 


TC TGAT AC AG 


TATCGGCATT 


GTTAATTGCT 


TTATGTGCAA 


GATCTTGTTG 


CTGTTGATTT 


1080 


AATCCATTTA 


ACGAATTAAC 


AAACGCATTT 


GCTTTATCCT 


TTGGCATTTG 


CAAGGTTTTG 


1140 


GTCTCCATTT 


AATGCATTTT 


GAGCATCGAT 


AATATTTTGT 


TTCAATTGCT 


CTGCTTCAGC 


1200 


TTTTGCAATT 


GCATTACTTG 


CACTTTATCT 


AACTCATGTG 


CTGCATTCGC 


AATGCATATC 


1260 


ATAATTTGCT 


TTCAATGTCA 


TCTGCATGAT 


GTATTGTGCT 






1300 



(2) INFORMATION FOR SEQ ID NO: 1156 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156: 



TTGGAGGATG 


GAACATATGT 


GGGGCGAATC 


CCCTGGGTGA 


AGATAATAAT 


CCATATGCCT 


60 


ATGTGATCAA 


GCACTTAGAA 


AAACATTGAA 


ATTATGTTTG 


ACATCACCAC 


AAGGAGTGAA 


120 


ACACCATAAG 


CTTAATTCTC 


CAGTTCTATA 


TGGGGTAAAT 


TTCCGTACAN 


ACTTGTATGT 


180 


TGTAGTTAGC 


GTCTAACACT 


GCTGAATAAA 


TCGCAGGTCA 


ACTCATTGCA 


CGATATATCT 


240 


ATAGGCGTTG 


TGTGTGCCGT 


GTCCGCCATT 


TTTAACTAGC 


AGTCTATCAA 


TAAATTGTCC 


300 


GTTAGGCTTC 


AATCTAGATA 


ACATGTAATG 


ATTGCCTGGA 


CGCGCTTGNG 


TCATGTAAAT 


360 


AATTTTTGTT 


CTAGGGTCTA 


CCCAACAATG 


ATTGCATTAC 


TGCGTTAGTA 


TATGGCGATA 


420 


AATCGGGGAG 


GAATTCCTGC 


AGCCCGGGGG 


ATCCACTAGT 


TCTAGAGCGN 


CCGCCACCGC 


480 


GGTGGAGCTC 


CAGCTNGTGT 


TCCCGGTAGT 


GNGATGGGTC 


GACGGNCAA 




529 



(2) INFORMATION FOR SEQ ID NO: 1157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157: 



GATTCGAGCT 


CGGTACCGGG 


GATCCTCTAG 


AGTCGACCTG 


CAAGCTATAC 


CAAGCTAGAG 


60 


TACTGCGACG 


CAAACTAGTG 


CAACACCTAA 


GGCTGTTACA 


GCTTGGAAGT 


CGATAAAATA 


120 


GTTGAAAAAG 


AGCAAGCGAT 


TTTTGAACGT 


GTTAACGAAG 


TTGAACAACA AATCAATGGC 


180 


GCTGACCTTG 


TTAAAGGTAA 


TTCAACAACA 


AATTGGCAAA 


AGTCTAAACT 


TACAGATGAT 


240 


TACGGTAAAG 


CAATCGAATC 


GTCTGAGCAG 


TCCATAGATA 


GCGTTTTAAG 


CACAGTTAAC 


300 


ACATCTAGGA 


TTATTCATAT 


TACTAATGCA 


ACAGATGCGC 


CAGAAAAGAC 


GGATATAGGC 


360 


ACGTTAGAGA 


AGCCTGGACA 


AGATGGTGTT 


GATGACGGTT 




TGAATCAACT 


420 


TATACATCAA 


GCAAATCTGG 


TGTGTTAGTT 


GTTTATGTTG 


TTGATAATAA 


TACTGCTCGG 


480 


TGCAACATGG 


TACCCAGACG 


ATTCAAACGA 


TGAGTTCACA 


AAATACAAAN 


TCTTACGGCA 


540 


CATGGTACCC 


GTTTTATTAA 


AAAGAATGAT 


GGAAACTTAA 


CTAGCAATTT 


GGTTGGAGGA 


600 
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ACGTCTTACA 


ACGCTTTAAA 


TCAAGCTAAG 


CAGTATGTAG 


ATGATAAATT 


CGGAACAACG. 


660 


AGCTGGCAAC 


AACATAAGAT 


GACAGAGCCG 


AACGGTCAAT 


CAATTCAAGT 


TAACTTAAAT 


720 


AATGCGCAAG 


GCGATTTGGG 


ATATTTAACT 


GCTGGTAATT 


ACTATGCAAC 


AAGAGTGCCG 


780 


GATTTACCAG 


GTAGCGTTGA 


AAGTTATGAG 


GGTTATTTAT 


CGGTATTCGT 


TAAAGATGAT 


840 


ACAAACAAGC 


TATTTAACTT 


CACACCTTAT 


AACTCTAAAA 


AGATTTACAC 


ACGATCAATC 


900 


ACAAACGGAA 


GACTTGAGCA 


ACAGTGGACA 


GTTCCTAATG 


AACATAAGTC 


AACGGTATTG 


960 


TTCGACGGTG 


GAGCAAATGG 


TGTAGGTACA 


ACAATCAATC 


TAACTGAACC 


GTACACAAAC 


1020 


TATTCTATTT 


TGTTGGTAAG 


TGGAACTTAT 


CCAGGTGGCG 


TTATTGAGGG 


ATTCGGACTA 


1080 


ACCGCATTAC 


CTAATGCAAT 


TCAATTAACC 


AAACCGAATG 


TAGTTGACTC 


AGACGGTANC 


1140 


GGTGGCGGTA 


TTTATGAGTG 


TTTACTATCA 


AAAACAAGTA 


GTACCCACTT 


TAAGAATAGA 


1200 


CAACGATGTG 


TATTTCGATT 


TAGGCAAAAC 


ATCAGGTTCT 


GGAACGAATG 


CCAACAAAGT 


1260 


TACTATAACT 


AAAATTATGG 


GGTGGAAATA 


ATGAAAATAA 


CAGTAAACGA 


TAAAAACGAA 


1320 


GTTATCGGAT 


ACGTTAATAC 


TGGCGGTTTA 


CGCAATAGTT 


TAGATGTAGA 


TGATAACAAT 


1380 


GTGCCTATCA 


AATTCAAAGA 


AGAGTTTGAA 


CCTAGAAAGT 


TTGTTTTCAC 


TAACGGCGAA 


1440 


ATTAAATATA 


ACAGCAATTT 


TGAAAAAGAA 


GACGTACCGA 


ATGCATCAAG 


CCAACAAAGT 


1500 


GAATCAGATT 


TGAGTGATGA 


AGAACTTCGC 


GGAATGGTTG 


CGAGTATGCA 


AATGCAGGTG 


1560 


GCACAAGTAA 


ACGTATTAAC 


AATGGAATTA 


GCTCAACAAA 


ACGCTATGTT 


AACACAACAG 


1620 


TTGACTGAAC 


TGAAAACTAA 


CAAAACAAGT 


ACTGAGGGGG 


ACGTTTAATG 


ATGAAGATGA 


1680 


TTTATCCGAC 


TTTTAAAGAC 


ATCAAAACTT 


TTTATGTTTG 


GGGTTACTAT 


AAAAACGAGC 


1740 


AAATTAAGTG 


GTACGTAGAC 


AAGGGTTTAA 


TCGATAAAGA 


AGAATACGCT 


TTAATCACTG 


1800 


GAGAAAAATA 


TCCAGAAACA 


AAAGATGAAA 


AGTCACAGGT 


GTAATGCTTG 


TGGCTTTTTA 


1860 


ATTTAACAAA 


AAGTAGGTGG 


CGTAATGTTT 


GGTTTTACCA 


AACGACATGA 


ACAAGATTGG 


1920 


CGTTTAACGC 


GATTAGAAGA 


AAATGATAAG 


ACTATGTTTG 


AAAAATTCGA 


CAGAATAGAA 


1980 


GATAGTCTTA 


GAGCGCAAGA 


AAAGATTTAT 


GACAAATTAG 


ATAGAAATTT 


TGGAAGAATT 


2040 


AAAGCGCGAC 


AAGGTAGAAG 


ATGAAAAGAA 


TAAAGAAAAG 


AATGCCAAGA 


ATATTAGAGA 


2100 


CATAAAAATG 


TGGATTCTCG 


GTTTGATAGG 


GACTATCTTC 


AGTACGATTG 


TCATAGCTTT 


2160 


ACTAAGAACT 


ATTTTTGGTA 


TTTAAAGGAG 


GTGATTACCA 


TGCTTAAAGG 


GGATTTTAGG 


2220 


ATATAGCTNC 


TGGGCGGTCC 


TCCTGGTTGG 


GGGTAAATGT 


AAATAACAGT 


TAAGAGTCAG 


2280 


TGCTCCGGCA 


CTGGCTTTTT 


ATTTTGATTG 


AAATGAGGTG 


CATACATGGG 


ATTACCTAAT 


2340 


CCAAAGACTA 


GAAAGCCTAC 


AGCTAGTGAA 


GTGGTGGAGT 


GGGCAAAGTC 


GAATATTGGT 


2400 


AAGAGGATTA 


ATATAGATAA 


TTATCGGGGC 


AGTCAATGTT 


GGGATACACC 


TAACTTTATT 


2460 


TTTAAAAGAT 


ATTGGGGTTT 


TGTAACATGG 


GGCAATGCTA 


AGGATATGGC 


TAATTACAGA 


2520 


TATCCTAAGG 


GTTTCCGATT 


CTATCGTTAT 


TCATCTGGAT 


TTGTTCCGGA 


ACCCGGAGAC 


2580 


ATAGCAGTTT 


GGCACCCTGG 


CAACGGAATA 


GGTTCGGACG 


GACACACCGC 


AATAGTAGTA 


2640 


GGACCATCTA 


ATAAAAGTTA 


TTTTTATAGC 


GTTGACCAAA 


ACTGGGTTAA 


TTCTAATAGT 


2700 


TGGACAGGTT 


CTCCAGGAAG 


ATTAGTAAGA 


CACCCTTATG 


TAAGTGTTAC 


AGGCTTTGTT 


2760 


AGGCCTCCGT 


ACTCAAAAGA 


TACTAGCAAA 


CCTAGTAGTA 


CTGATACAAG 


TTCAGCATCA 


2820 


AAAAGCCAAT 


GACTCAACAA 


TTACTGGCGA 


AGCGAAGAAA 


CCGCAATTTA 


AAGAAGTTAA 


2880 


AACAGTAAAA 


TACACTGCTT 


ACAGCAATGT 


TTAGATAAAG 


AAGAGCACTT 


CATTGATCAT 


2940 


ATAGTTGTAA 


TGGGTGATGA 


ACGCTCAGAT 


ATTCAAGGAT 


TATATATAAA 


AGAATCAATG 


3000 


CATATGCGTT 


CTGTAGACGA 


ACTTTATACG 


CAAAGAAATA 


AGTTTATAAG 


CGATTATGAA 


3060 


ATACCGCATT 


TATATGTCGA 


TAGAGAGGCT 


ACAGGAATTC 


CGGAATTCCG 




3110 
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(2) INFORMATION FOR SEQ ID NO:1158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 1158: 



GGTTAAACCG 


TACCCCAANA 


ATAAAGCNTG 


GTCAACCCNT 


CCAAATATAA 


ATTTGAAANC 


60 


CTTGCCCAGC 


TGTTTTATTA 


AGGGANCGAT 


CCAATTAACA 


CCCACATATT 


AATAATACAT 


120 


CAGCAAAGCC 


TTCAGGTTTT 


TGAATATAAC 


CTAAGTGCCC 


GCCTGGAATA 


TCTACAATAG 


180 


GTATGCCAGT 


TTCTTTGTTT 


ATATAAAAGT 


TAACATCTTG 


TGGGAAGGAG 


CCTCTAGAAT 


240 


CTGTCCCATT 


TAGTAGGGTG 


ATTTTATCGC 


TGTATTTTGT 


GAAATCATCC 


AAAGTAATAT 


300 


CTGAATGCGT 


ATATTGTCTA 


ATTTCAAATT 


CTGACCAGAA 


CATCGTACGT 


TTGTACTGTT 


360 


CTATACGTCC 


TTCTTCAGTA 


TCAGCAGGTT 


GAGACATCAT 


TTTTGCATCA 


ATTGGTGCGA 


420 


TATTTAATGT 


TTCACCAAAT 


GTTTTCATAC 


CTTTTTCTAA 


GCCTTCTGTT 


AAAATTTGAT 


480 


GCACAATGTC 


ATCATTTTTA 


TCTTTCCAAT 


AAGTACTGTC 


TGGTAAAAAT 


GTATTAATTG 


540 


GTGGTTCGTG 


AAATGCAATC 


TTTTTAACGA 


CTTCAGGGTA 


ATCTTTTAAC 


ACATGCATCG 


600 


CAACGATTGA 


ACCTGAACTT 


GAACCTAATA 


TATAGACAGG 


TTCATCACTT 


AATGACTTTG 


660 


CAAGTTCGGC 


AATGTCCTGT 


GCGTCGCGTT 


TGACACGATA 


ATCACTGTCA 


GGGTTTGAAG 


720 


CGGAATCAGG 


GAGTGGTTCA 


GTTAACTCGC 


TTTCTCCATA 


ATCACGCCGA 


TCCACGGCTA 


780 


CAACAGTAAA 


ATGGTCTTTT 


AACTGTTCTG 


CAAGAGGCAG 


AAAAATGTTT 


CCGGTNCCGT 


840 


TTGCACCAGG 


AATAAAGATG 


AGCACGGGTC 


CTTGTCCGAC 


TTGGTGGTAT 


CGTAATTTAG 


900 


CGCCTTGTAA 


TTCTAAAGTT 


TCCATATTCA 


ATGACCTCCA 


TTTGTTAATT 


GTTAGATGAT 


960 


AAACCTAATA 


ATTTAGCACC 


ATTTGTATAA 


CTTATTTTCT 


cnrrrrarric 


ATCTGTTAAA 


1020 


CCCAGTTCAT 


CTAAAAATAC 


ACCTAATTTT 


TCAGGCTCAA 


TATATGGATA 


ATCAGCAGCA 


1080 


TAAAGAATTC 


TATCAATACC 


TACTTCTTTC 


TTGACTAAAT 


CAAACTGTGG 


CTTCGTTAAC 


1140 


ATGCCACTTG 


GTGTGATATA 


AAAACTATTT 


TTAAAGTAAT 


AGCTTACAGA 


GTGGTTCAAA 


1200 


TGTTCAGCGA 


ATAAAGCTTC 


ATCCATACGT 


TCTAAGAAGA 


ATGGGATAAA 


CTCACCCCAA 


1260 


TGTCCAATAA 


TCATATTTAA 


CTTTGGATAA 


CGATCAAAAA 


TGCCAGATAA 


TACTAGATGT 


1320 


ATTGCATGAA 


TTCCGACATC 


AATGTGCCAA 


CCATAACCAA 


AACAAGCAAA 


TGTTGCCGCA 


1380 


GTTACTTCAG 


GATAATTTCC 


TTTATAGTAT 


GATTGATAAA 


TGTCACTGTT 


AACTGGCGCG 


1440 


GGATGTAGAT 


AAATCGGTAT 


CGTGCCTAAA 


TTTTCAGCTG 


TTTTGAAAAT 


AATGTCATAT 


1500 


TTGTCTTGAT 


CAAGAAAACC 


ATCTTGTGCA 


CGTCCCATAA 


TGAGCGCACC 


TTTGGAATCC 


1560 


TAAATCATTG 


ATGCAACGNT 


CGAATTCTCG 


GGCTGCGGTT 


CANGCTCAAT 


GGTNGATAAA 


1620 


GTTGGCAAAG 


CCTACAAAGC 


GATTGGGATA 


TTGGCAATAT 


AGTTTOGCAA 


TTGNATCATT 


1680 


TGGTTTGTTG 


GCATGATTCA 


ATGGGTTTTC 


TGACCAACTN 


AAATTTGAGG 


AGGATCCCAA 


1740 



(2) INFORMATION FOR SEQ ID NO: 1159: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159: 



GGCACGAGCG 


GTGCCCATTA 


TAAACCAGAA 


TTCCCAGAGC 


GTAATGATGA AGAGTGGTTA 


60 


AAAACGACAA 


TGGCCTCATT 


CCAAGGCGCA 


TTTGAAAAAC 


CACAGTTTAC 


TTATGATGAC 


120 


GTCGATGTGA 


GTTTAATACC 


ACCTCGTAAA 


CGTGATTACA 


CAAGTAAGTC 


TAAAGGGGGT 


180 


AAAAAATAAT 


GACTGAACAA 


TCAGTGAAAA 


ACACTCCACA 


ACATGAAACA 


CAATCTAAAC 


240 


CGAAACAAAA 


AACAGTAAAA 


TTAATTATTA 


AACGACAAGA 


TACAAGTGAT 


TCTAAGCCTT 


300 


ATGAAGAAAC 


ATTTGAAATT 


CCATATCGTG 


AAAATTTAAA 


CGTCATTGCT 


TGTTTAATGG 


360 


AAATTAGACG 


TAACCCAGTT 


AATATTAAAG 


GTGAAAAAAC 


AACACCTGTT 


GTCTGGGATA 


420 


TTGAACTGCT 


TAGAAGAAGT 


ATGTGGAGCA 


TGTTCTATGG 


NTATCAATGG 


GCCGGGCAAA 


480 


GACAATCTTG 


GTCCTTGCGA 


TTGGTGAGTC 


CAAATTAGGA 


CAAACCTATT 


CGGTTTAGAG 


540 


CCAATGATAC 


TTCCCCAGGT 


TNTCCCGTGA 


CTACAAAGTG 


ATCGTCCAGA 


TGTCGATAAC 


600 


TTAAACGTAT 


GAAAGCATGG 


TCCCAATGAT 


GGACGTATGA 


TTTAGG 




646 



(2) INFORMATION FOR SEQ ID NO: 1160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160: 



CTATTGCTTT TCAATAAATA AGATGATTTC GGCGATATTA ATAAAGAAAA ATGCATCATG 60 

TAAAAAGTCG CGTACTAAAC GTTCGTCATC TGNTTCAATA AATGAATTAC TATTCACTTT 120 

TTTATGTGCT TCTGGCATTG GCTTTAATGT CAGGTGTGAA GCTGCTTCAC TTAAATGCTC 180 

ACGCTTTAAA ACGAAACACC ATCGCGGGAA ATCTTTNTAA GGGCAATAAC GTGGTAGGGC 240 

CAAACCCATT NTTCAATGAA ATGGAGGCAT CAATATNTTT GGTGCATTGC GCATTCAAAA 300 

GGGCAAATAC CCGCGGATTA NTAAAGGCAT 330 



(2) INFORMATION FOR SEQ ID NO: 1161: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161: 



ACCAATCCGA 


TTNCGTNCGG 


TTGTCGCCAA 


TAAGACNCCG 


CCCAAAATAT 


TAATTTANNG 


60 


GGTNCCCAAT 


AAAGAATANC 


CGGACCACCT 


CCACCGAAAC 


CGAGTGTTGC 


CNCCTCAATA 


120 


CAATTGAAAA 


TGCAGCCCAA 


ATTCCGGATT 


GGNTAAAGCA 


GATACAGCAA 


CAAGATATGT 


180 


ATTTANAAGA 


TGAATTAAAA 


GACAAGTTTT 


TCTAGGGGGA 


AGTCTTAGGA 


CAATCTTATT 


240 


TAAAATACAC 


AACTTTCGCC 


TTATAAACCA 


AACTCCAAGT 


TTATGGTGCG 


TTAGGTGTTA 


300 


ATATGCCGTG 


NNAAATATAT 


ATCAATATGT 


TAATCGATGA 


AGAGGATGCG 


ATACCATTTA 


360 


ATGCACTTTA 


TGCAAGTGAT 


AAGGATGGTT 


TACCATTCAT 


TGAAAAGTGG 


ATTAAACAAT 


420 




AAGCTTGGAC 


CAAAGCAATT 


TTTAGCTGTA 


GCGATTCGTC 


CATTGATTCA 


480 


TATGCTTTAT 


TATCACGGTA 


TTGCCTTTGA 


ATCGCATGCA 


CCACAATATG 


ATGCTCAATT 


540 


CATGAAAATG 


GTTGCCCTAC 


ACGTATTGCC 


TTAAAAAGAT 


TTCCACGATG 


TTGTTCGTTT 


600 


TAAGCGTGAG 


CATTTAAGTG 


AAGCAGCTTC 


ACACCTGACA 


TTAAAGCCAA 


TGCCAGAAGC 


660 


ACATAAAAAA 


GTGAATAGTA 


ATTCATTTAT 


TGAAACAGAT 


GACGAACGTT 


TAGTACGCGA 


720 


CTTTTTACAT 


GATGCATTTT 


TCTTTATTAA 


TATCGCCGAA 


ATCATCTTAT 


TTATTGAAAA 


780 


GCAATATGGT 


ATCGATGAGC 


AGCGACAATG 


GCAATGGGTT 


AAAGACATTA 


TCGAGGCGTA 


840 


TCAAGAAGCA 


TTTCCAGAGT 


TGAATAACTA 


TCAACATTTC 


GATTTGTTTG 


AACCTACGAT 


900 


TCAAGTTGAA 


AAGTTAACGA 


CACGTCGATT 


ATTAAGTGAC 


TCCGAGTTAA 


GANATC 


956 



(2) INFORMATION FOR SEQ ID NO:1162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 980 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162: 
TTTNCGCCCA CAAATCCCAN AGAAGACTTT GCCTTTAATT CCANCNACAT GGGNCATTAC 60 
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CTAAGACGAT 


GTNGGTACCA 


TGTCAGTCGT 


ACTCCCACAT GGTGTATTAT TCCGTGGTGC 


120 


CGCAGANGGT 


GTGATTCGTC 


GCTATTTAAT 




AGCTNCTTAG 


AANCCGTGNT 


180 


TGGGTTACCA 


NCCAATATTT 


TCTATGGGNC 


AALrlAI 1\-LA 


ACATGTATTT 


TAGTATTTAA 


240 


AAAATGTCGC 


CAACAAGACG 


ACAACGTACT 


ATTTATCGAT 


CaCATCCAATG 


ATTTTGNAAA 


300 


AGGAAAAAAT 


CAAAACCATT 


TAAGCGATGC 


CCAAGTCGAA 


CGTATTATAG 


ACACATATAA 


360 


GCGTAAAGCA 


ACGATTGATA 


AATATAGCTA 


CAGCGCGACA 


TTACAAGAGA 


TCGCCGATAA 


420 


CGATTACAAC 


TTAAAGGCAT 


ACCGAGCGAT 


ATGTTGATAC 


ATTCGAAGAA 


GAAGCACCAA 


480 


TTGATTTAGA 


TCAAGTCCAA 


CAAGATTTGA 


AAAATATCGA 


CAAAGAAATC 


GCAGAAATTG 


540 


AACAAGAAAT 


CAATGCATAC 


CTGAAAGAAC 


TTGGGGTGTT 


GAAAGATGAG 


TAATACACAA 


600 


ACGAAAAATG 


TGCCAGAGTT 


GAGATTCCCA 


GGATTTGAGG 


GCGAATGGGA 


AGAGAAGAAG 


660 


TTAGGGAATC 


TTACTACCAA 


AATAGGTAGT 


GGAAAGACTC 


CCAAAGGTGG 


AAGTGAAAAC 


720 


TATACAAACA 


AAGGCATACC 


ATTTTTAAGG 


AGTCAAAATA 


TTAGAAATGG 


TAAATTAAAT 


780 


CTTAATGACT 


TAGTTTATAT 


TAGTAAAGAT 


ATAGATTGG A 


TGAGATTGAA 


AAATAGTAGA 


840 


ACGTACTATG 


GTGATGTTCC 


TTTTNNATAT 


TACAGGAGCA 


TCAATNGGTA 


GGACAGGCCA 


900 


TTAATTCGAT 


AGGTNGAAAC 


GCATGTCTAA 


TTTANTCAAC 


ATGTTTGTAT 


TATTAGATTA 


960 


ANAAAGAGTA 


TTATTATTAT 










980 



(2) INFORMATION FOR SEQ ID NO: 1163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163: 



CCGTACATTA 


ACAACCTGGA 


TTAATTACAG 


ATTACGTTTT 


GTCTCATCTA 


AAAATTGTTG 


60 


ATAATCTTTG 


ATCAATTTCC 


AAATCTGTAC 


GTATGTATTA 


TCTTAGTAGT 


AGTTTAATAT 


120 


GCACTAAATA 


CTTGCAATTG 


ATATTTACCA 


TATTTATTGT 


CAAATCCAAT 


TATCTTGTGT 


180 


TTTTCATAAA 


ACGATTGCTT 


TAAATAATCT 


TCTAACACAT 


CAAACATCGT 


ATTATCACCG 


240 


ACATGGTGCC 


CGTATAAAAT 


AGTATTATGA 


TTTAAATTCT 


NCAATTCATT 


TCTAAAATCC 


300 


ATAAAAATAC 


TACCTTTACG 


TCGATGTTCT 


CGCTCAAAAT 


CTAAATTTAA ATAATCGTGA 


360 


TTTGTCTTAC 


CTTGTAGTAC 


TGGATAANTT 


AATGATGTTC 


CTGATAATTT 


TATCCATCCA 


420 


ACAATGTCTT 


TATTTATTTT 


TTCAAGTGAT 


TCAAATTGTG 


GTCTCACATG 


TTCTTGATGT 


480 


TTGNTCATCA 


GCATNTGAAA 


TTTNTGTTGT 


AATTTCTCAT 


AANTTGCGCG 


TTCTTGCTTG 


540 


TCTTCAATAT 


ATGTTTGAAC 


AATTTTGTAA 


CCAAAAATGA 


TAATAATTAC 


AACCAATAAA 


600 


ATTTGTACAA 


TAGTTAAAAA 


TCGCTTCATT 


CTCATAAAAA 


CCCTCTTTAA TAAACAANTT 


660 


TCTTCAGTCA 


TCACTAAACC 


AGTTGTTGTA 


CCGTTTTAGA 


TTCGATTTCG 


TTGACTTTGA 


720 


CAAATTAAGT 


AAATTAGCAT 


TGGACCACCG 


ACAATCATTA 


AAATAGCATT 


GGCTGGAATT 


780 
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IV. 1 AAAVjvAV? 


CPTPT ATP a p 


1\<V3 1 X AA 1 


AAATTARTrA 


CTAACAATAG CCATGCACC^ 




ITU APTOTAfl 
A 1 AA\~ X 1 AVj 


AAA APf2f2A AT 


A AP/P APTPTfJ 




CAACTAGCTT 


TCTAACCACA 


900 


tptwip apa a 


T A ATA PPT A A 
X AA X Av. v* X AA 


A A AfSTSTT ACT 
AAAuvTl. Xn\j 1 


TfJTPPAAPAA 


TCGCAACAGT 


TGCACTTGCT 


QfiO 


A AA A ATAPTT2 


p*p A AT A A APP 
l> x AA X nnnLL 


TPJTTA APP AT 
X \j X X nnLLn x 


PTC2TAAPPAT 


CAATATTAAA ACCGATACTT 






TfSTPPTV^F* A A 
lu 1\.Lt IV X AA 


A TTTSf""'!' A A A 
Ail 1 A vi 1 AAA 


TTP A A TTT A fZ. 
1 X\_AAX 1 X AVJ 


GGGACAATAG 


TAATGTTAAT 


XUOU 


AX 1 AAXv-t-l-A 


AT A ATrJPTTS A 
A X AA X \ lun 


X AV_ luL X AA X 


ATGTATAPPT 


CGCTCCATAT 


TTTCATTGTT 




A AflPPTTf: AC 


fti ATTTTV* AT 
vXAAX X X XwAX 


TAAAfSnflTTT 
X nnnuVju X x x 


TRAfiTTAAAA 


1 1 1 Aw 1 AAAAC 


ACCATTTAAT 




A ATAPPMATA 
M 1 AIAmN A 1 A 


ALuUviUtLU 


T APTA AT ATP 
1AL I AAXAXV- 


ATAPTTAPAfl 
A InV. X X n^no 


CATTGAATCT 


AAATTTAGAA 


1 OCA 


rn/T* a A/"*A A«PA 
IxjUAAV. An 1 A 


1AA1 1A1 1AA 


A A ATV2PTATT 
AnAiuulAl 1 


A A APPTPP A A 


TAAAACTTAA 


TAATGGTAAG 


1 1 O ft 


1 AAAAb 1 Av.A 


Al XXjlvavjAAi 


A A Af* A 

AAAUAALA X A 


LLAAu X viv* 1 L. 


TCATTATAAG 


TGCACCTGAG 


n o o f\ 


GAAACGCCAA 


TGATATTCGC 


CTCTGCCAAA 


GGATTTTGTA 


GTCCTGCTTG 


TAATAATGCT 


1440 


CCAGAAACTG 


CTAACATTGC 


GCCANCCATC 


AATGCAANTA 


ATAGACGTGC 


CAATCCCAAA 


1500 


TCAATGATTG 


AATCCACTGC 


TTCATTGCTA 


CCAGTTGTAA 


ATTTTGTAAA 


TAGGTCATTA 


1560 


AATGACAATT 


TAATTGTACC 


GGTTACAAAC 


GAAATATAAG 


CAGTTGCGAT 


TAAAATGACT 


1620 


AACAAACATA 


AAAATAGTAG 


TTTCTTTTTA 


TTTTTTATCA 


TCAG 




1664 



(2) INFORMATION FOR SEQ ID NO: 1164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164: 



TGAACCAGGA 


AAOTTGACAC 


GAGACATCTA 


ATCGACATGA 


CTATGCGTTA 


AAACCATTTG 


60 


GTAAATNCTG 


TCTTAGATCA 


TTTGAAAGAC 


GGTGGTTATG 


ATGTTATTNC 


CATCGGTAAA 


120 


ATTAATGACA 


TTTATGATGG 


TGAAGGTGTA 


ACAGAAGCGG 


TTCGTNCGAA 


GAGTAACATG 


180 


GACGGTATGG 


ATCAATTGAT 


GAAAATTGTT 


AAGAAAGATT 


TCACAGGTAT 


TAGCTTCTTA 


240 


AACTTAGTAG 


ACTTTGATGC 


ATTATACGGT 


CATCGTCGTG 


AGAAACCAGG 


TTATGCACAA 


300 


GCAATTAAAG 


ATTTCGATGA 


TNCGCTTGCC 


AGAAGCTTGT 


TTAGCAACTT 


ANAAGGAAGA 


360 


CGATTTAGTA 


ATTATTACAA 


GCAGACCATG 


GTAATGACCC 


CGACAGCGCC 


AGGGTNCGGG 


420 


ACCAATACGA 


GAGAATATAT 


CCCAGTAATT 


ATTGTNAAGT 


CCCGANATTT 


AAAGGTGNGT 


480 


CAATGCACTT 


AGANAGTGGA 


TACTACATCC 


AAGTTCTATC 


GGTGCANCTA 


TAGCAGATAA 


540 


TTTCAACGTA 


ACATTACCAG 


AGTTCGGTAA 


AAGTTATTTA 


AAGGAATTGN 


NATAGAATAA 


600 


ATTTAGATAT 


TATAAAAACA 


GCAGTGTAGT 


TAACTATAAC 


AATAGTTTTC 


TTCACTGCTG 


660 


TTTTTATTAT 


AATAGAGANA 


CX5TGTCGGTA 


GGACCTCTTA 


TTTAGGAGTA 


TCCTGATTTA 


720 


ATGTTAAACA 


ATACOTTTTC 


GGATTGANCC 


GGAAATTAAA 


TCGACAATTG 


CGACCATTAG 


780 


TCTCGTGCC 












789 
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(2) INFORMATION FOR SEQ ID NO: 1165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165 : 



GGCACGAGAT 


ATATAAATAT 


GTATATACTT 


GTTTAATTTA 


TCTCTTTCTA 


TAAATAAAGA 


60 


CCTACCAGCA 


CTCGACTGAT 


AGGCCTTTTA 


ATATCTATAA 


TTATTTAATT 




120 


CGGCTAACTC 


TTTGTACCAA 


TAAGCACTTT 


TCTTAGGATA 


ACGTTCTTGA 


GTCTCAAAAT 


180 


CGACATAGAA 


TAAACCATAT 


CGTTTTTCAT 


AACCATTTGA 


CCAAGAGAAC 


ACATCCATTA 


240 


ATGACCAAAT 


AAAGTAACCT 


TTAACATTTG 


CACCATCTTT 


AATAGCATCT 


GCAATAACGT 


300 


TCAAATGTTG 


GCTTACATAA 


TCAATACGTG 


CATCATCATG 


AACTGTNTTT 


TCAGATTCAA 


360 


TAAATTCATC 


TTNATATCCT 


AAACCATNTT 


CAGTGATATA 


AATCTTGTGA 


TAGGNTAGGA 


420 


TAATCTTTNA 


ACAACACGCA 


TGATTGGATC 


ATATAAACCT 


TGAGGATAGA 


TCATCCAGGT 


480 


CCAGGCTTGT 


GCGAGGTACG 


TCGACATCAA 


ATCA 






514 



(2) INFORMATION FOR SEQ ID NO: 1166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1166: 
GCTCCTAAAA GGTTACTCCA CCGGC 25 
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WHAT IS CLAIMED IS: 

1. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NOS: 8,10, 14,19, 25, 27, 28, 30, 32 and 36. 

2. An isolated nucleic acid encoding one of the amino acid sequences of 
5 Claim 1 and nucleic acid sequences capable of hybridizing therewith under stringent 

conditions. 

3. Recombinant vectors comprising the nucleic acid sequences of 
Claim 2 and host cells transformed or transfected therewith. 

4. A method of identifying an antimicrobial compound comprising 

10 contacting candidate compounds with a polypeptide of Claim 1 and selecting those 
compounds capable of inhibiting the bioactivity of said polypeptide. 

5. Antimicrobial compounds identified by the method of Claim 4. 

6. An isolated polypeptide comprising an acid sequence selected from 
the group consisting of SEQ ID NO: 1 through SEQ ID NO:623. 

15 7. An isolated nucleic acid encoding one of the amino acid sequences of 

Claim 6 and nucleic acid sequences capable of hybridizing therewith under stringent 
conditions. 

8. An isolated polynucleotide comprising a nucleic acid sequence 
selected from the group consisting of SEQ ID NOS: 631, 633, 636, 641, 647, 649, 

20 650, 652, 654 and 658. 

9. An isolated polynucleotide comprising a nucleic acid sequence 
selected from the group consisting of SEQ ID NO:624 through SEQ ID NO: 1 165. 

10. An antibody against the polypeptide of claim 6. 
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